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CHAPTER 1 


Introduction to Calculus 



1.1 Velocity and Distance 



The right way to begin a calculus book is with calculus. This chapter will jump 
directly into the two problems that the subject was invented to solve. You will see 
what the questions are, and you will see an important part of the answer. There are 
plenty of good things left for the other chapters, so why not get started? 

The book begins with an example that is familiar to everybody who drives a car. 
It is calculus in action — the driver sees it happening. The example is the relation 
between the speedometer and the odometer , One measures the speed (or velocity); 
the other measures the distance traveled , We will write v for the velocity, and / for 
how far the car has gone. The two instruments sit together on the dashboard: 



fig. 1.1 Velocity v and total distance / (at one instant of time). 

Notice that the units of measurement are different for v and /, The distance / is 
measured in kilometers or miles (it is easier to say miles). The velocity v is measured 
in km/hr or mites per hour. A unit of time enters the velocity but not the distance. 
Every formula to compute v from / will have / divided by time. 

The central question of calculus i$ the relation between v and /, 
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Can you find v if you know f t and vice versa, and how? If we know the velocity over 
the whole history of the car, we should be able to compute the total distance traveled. 
In other words, if the speedometer record is complete but the odometer is missing, 
its information could be recovered. One way to do it (without calculus) is to put in 
a new odometer and drive the car all over again at the right speeds. That seems like 
a hard way; calculus may be easier. But the point is that the information is there. 
If we know everything about v, there must be a method to find /. 

What happens in the opposite direction, when / is known? If you have a complete 
record of distance, could you recover the complete velocity? In principle you could drive 
the car, repeat the history, and read off the speed. Again there must be a better way. 

The whole subject of calculus is built on the relation between v and /, The question 
we are raising here is not some kind of joke, alter which the book will get serious 
and the mathematics will get started. On the contrary, / am serious now — and the 
mathematics has already started. We need to know how to find the velocity from a 
record of the distance, (That is called differentiation, and it is the central idea of 
differential calculus ,) We also want to compute the distance from a history of the 
velocity, (That is integration , and it is the goal of integral calculus.) 

Differentiation goes from / to u; integration goes from v to /. We look first 
at examples in which these pairs can be computed and understood, 

CONSTANT VELOCITY 

Suppose the velocity is fixed at u = 60 (miles per hour). Then / increases at this 
constant rate. After two hours the distance is /= 120 (miles). After four hours 
/ = 240 and after t hours /= 6Gt. We say that / increases linearly with time~its 
graph is a straight fine. 


velocity u(f) 


60 


■ ; | V — 60 

Area i 240 I 


4 — i — I time t 

2 4 


distance /(f) 


240 - 



240 

Slope = — = 60 


time / 


Hg. 1.2 Constant velocity v = 60 and linearly increasing distance / = 60f. 


Notice that this example starts the car at full velocity. No time is spent picking up 
speed. (The velocity is a "step function.”) Notice also that the distance starts at zero; 
the car is new. Those decisions make the graphs of v and / as neat as possible. One 
is the horizontal line v = 60. The other is the sloping line / = 60f. This u, /, f relation 
needs algebra hut not calculus: 

if v is constant and f starts at zero then f= vt. 

The opposite is also true. When / increases linearly, v is constant. The division by 
time gives the slope . The distance is f x = 120 miles when the time is t t = 2 hours. 
Later f 2 = 240 at f 2 = 4. At both points, the ratio f/t is 60 miles/hour. Geometrically, 
the velocity is the slope of the distance graph : 

change in distance vt 

slope = — : “ — : — — = u, 

change in time t 
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Fig, 1.3 Straight lines /= 20 + 60f (slope 60) and /= — 30/ (slope — 30). 


The slope of the /- graph gives the v-graph. Figure 1.3 shows two more possibilities: 

L The distance starts at 20 instead of 0. The distance formula changes from 60t 
to 20+ 60t. The number 20 cancels when wc compute change in distance — so 
the slope is still 60, 

2. When v is negative, the graph of / goes downward. The car goes backward and 
the slope of /= - 30f is v = - 30, 

I don't think speedometers go below zero. But driving backwards, it's not that safe 
to watch. If you go fast enough, Toyota says they measure "absolute values” — the 
speedometer reads + 30 when the velocity is — 30. For the odometer, as far as I know 
it just stops. It should go backward. t 

VELOCITY vs. DISTANCE: SLOPE vs , AREA 

How do you compute f from u? The point of the question is to see / = vt on the 
graphs. We want to start with the graph of v and discover the graph of /, Amazingly, 
the opposite of slope is area. 

The distance f is the area under the v~graph, When v is constant, the region under 
the graph is a rectangle. Its height is t\ its width is i , and its area is v times t. This is 
integration, to go from v to / by computing the area. We are glimpsing two of the 
central facts of calculus. 


1A The slope of the f-graph gives the velocity v. The area under the v-graph 
gives the distance /. 


That is certainly not obvious, and I hesitated a long time before I wrote it down in 
this first section. The best way to understand it is to look first at more examples. The 
whole point of calculus is to deal with velocities that are not constant, and from now 
on v has several values. 

EXAMPLE (Forward and back) There is a motion that you will understand right away. 
The car goes forward with velocity V, and comes back at the same speed. To say it 
more correctly, the velocity in the second part is — V. If the forward part lasts until 
t = 3, and the backward part continues to r = 6, the car will come back where it started. 
The total distance after both parts will be / = 0, 


tThis actually happened in Ferris BueUer's Day Off, when the hero borrowed his father's sports 
car and ran up the mileage. At home he raised the ear and drove in reverse. I forget if it 
worked. 
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o(t) - slope of/(0 

area 

3V 

3 6 


area 

-3V 




Hg. 14 Velocities + V and — V give motion forward and back, ending at /(6) = 0. 


The u-graph shows velocities + V and - K The distance starts up with slope + V 
and reaches /= 3K Then the car starts backward. The distance goes down with slope 
V and returns to /= 0 at t = 6. 

Notice what that means. The total area “under” the thgraph is zero! A negative 
velocity makes the distance graph go downward (negative slope). The car is moving 
backward. Area below the axis in the v-graph is counted as negative. 

FUNCTIONS 


This forward-back example gives practice with a crucially important idea — the con- 
cept of a “function.” We seize this golden opportunity to explain functions: 

The number v[t) is the value of the function v at the time t. 

The time £ is the input to the function. The velocity [?(£) at that time is the output . 
Most people say “r of t” when they read u(£). The number “v of 2” is the velocity 
when t = 2. The forward-back example has v(2)= + V and v{4) — — V. The function 
contains the whole history, like a memory bank that has a record of v at each t. 

It is simple to convert forward-back motion into a formula. Here is u(t): 

\ + v if 0<t<3 

u(t) = l ? if t = 3 

- V if 3 < t < 6 

\ 

The right side contains the instructions for finding i?{£). The input t is converted into 
the output + V or — K The velocity u(r) depends on £. In this case the function is 
“discontinuous,” because the needle jumps at t — 3. The velocity is not defined at that 
instant. There is no v(3 ). (You might argue that v is zero at the jump, but that leads 
to trouble.) The graph of / has a corner, and we can’t give its slope. 

The problem also involves a second function, namely the distance. The principle 
behind f(t) is the same: /(r) is the distance at time f. It is the net distance forward, 
and again the instructions change at / = 3. In the forward motion, f(t) equals Vt as 
before. In the backward half, a calculation is built into the formula for f(t): 

(Vt if 0 ^ t ^ 3 

/(f)— < 

[V(6 ~f) if 3 *0^6 


At the switching time the right side gives two instructions (one on each line). This 
would be bad except that they agree: /(3)=3Kf The distance function is “con- 


fA function is only allowed one value f(t) or /;(r) at each time r 
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tmuous.” There is no jump in / even when there is a jump in v . After t = 3 the distance 
decreases because of - Vl At t = 6 the second instruction correctly gives /{6} = 0. 

Notice something more. The functions were given by graphs before they were given 
by formulas. The graphs tell you / and v at every time f—sometimes more clearly 
than the formulas. The values /(f) and u(f) can also be given by tables or equations 
or a set of instructions, (In some way all functions are instructions — the function 
tells how to find / at time t.) Part of knowing / is knowing all its inputs and 
outputs — its domain and range : 

The domain of a function is the set of inputs. The range is the set of outputs. 

The domain of / consists of all times 0 ^ t ^ 6, The range consists of all distances 
0^/(f)^3K. (The range of v contains only the two velocities + V and — V.) 
We mention now, and repeat later, that every “linear” function has a formula 
/(f) = vt+ C. Its graph is a line and v is the slope. The constant C moves the line up 
and down. It adjusts the line to go through any desired starting point. 

SUMMARY: MORE ABOUT FUNCTIONS 

May I collect together the ideas brought out by this example? We had two functions 
v and /. One was velocity, the other was distance. Each function had a domain , 
and a range , and most important a graph . For the /-graph we studied the slope 
{which agreed with v). For the n-graph we studied the area (which agreed with /). 
Calculus produces functions in pairs, and the best thing a book can do early is to 
show you more of them. 


in 

f input t -► 

function f 

-*■ output f(t) 1 

in 

the 

i input 2 — > 

function v 

► output i:{2) J 

> the 

domain 

l input 7 — ► 

m = 2t + 6 /(7) = 20 J 

range 


Note about the definition of a function , The idea behind the symbol /(f) is absolutely 
crucial to mathematics. Words don’t do it justice! By definition, a function is a “rule” 
that assigns one member of the range to each member of the domain. Or, a function 
is a set of pairs (f, /(f)) with no t appearing twice. (These are “ordered pairs” because 
we write t before /(f).) Both of those definitions are correct — but somehow they are 
too passive. 

In practice what matters is the active part. The number /(f) is produced from the 
number f. We read a graph, plug into a formula, solve an equation, run a computer 
program. The input t is “mapped” to the output /(f), which changes as t changes. 
Calculus is about the rate of change. This rate is our other function v. 



HQ- 1-5 Subtracting 2 from / affects the range. Subtracting 2 from t affects the domain. 
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It is quite hard at the beginning, and not automatic, to see the difference between 
/(f) — 2 and j\t — 2 ), Those are both new functions, created out of the original /(r). 
In /(r) — 2, we subtract 2 from all the distances. That moves the whole graph down. 
In /{r — 2), we subtract 2 from the time. That moves the graph over to the right. 
Figure L5 shows both movements, starting from /(t) = 2t L The formula to find 
/{r - 2) is 2(r - 2) + 1, which is 2r - 3. 

A graphing calculator also moves the graph, when you change the viewing window. 
You can pick any rectangle A ^ f ^ B, C ^/(f) ^ D. The screen shows that part of 
the graph. But on the calculator, the function f(t) remains the same. It is the axes that 
get renumbered. In our figures the axes stay the same and the function is changed. 

There arc two more basic ways to change a function. (We are always creating new 
functions — that is what mathematics is all about.) Instead of subtracting or adding, 
we can multiply the distance by 2. Figure 1,6 shows 2/(f). And instead of shifting the 
time, we can speed it up. The function becomes f(2t). Everything happens twice as 
fast (and takes half as long). On the calculator those changes correspond to a 
"zoom ' — on the / axis or the i axis. Wc soon come back to zooms. 



Fig. 1.6 Doubling the distance or speeding up the time doubles the slope. 


A A EXERCISES 


Each section of the book contains read-through questions. They 
allow you to outline the section yourself— more actively than 
reading a summary. This is probably the best way to remember 
the important ideas. 

Starting from /[0) = O at constant velocity r, the distance 
function is fit)— o - When f{t} = 55t the velocity is 
v = b . When f{i) = 55t + 1000 the velocity is still c 
and the starting value is f(0) = d r [n each case i is the 

e of the graph of /. When f is negative, the graph 
of g goes downward. In that case area in the r-graph 
counts as h 

Forward motion from /(0) = 0 to / (2) = 10 has i = < 

Then backward motion to /[4) = 0 has v = j The dis- 
tance function is fit) = 5f for 0 ^ t ^ 2 and then f[t) — k 


(not — 5r). The slopes are i and m . The distance 
/(3)-_n_. The area under the r-graph up to time 1.5 is 
o . The domain of / is the time interval p . and the 
range is the distance interval Q The range of y(t) is only 


The value of fit) = 3r + 1 at t = 2 is /( 2) = s . The value 
19 equals fi \ i The difference /( 4) —f(\)= u . That 
is the change in distance, when 4 — 1 is the change in v . 
The ratio of those changes equals w , which is the x 
of the graph. The formula for /(f) + 2 is 3f + 3 whereas 
f{i + 2) equals y , Those functions have the same i 
as /: the graph of f(t) + 2 is shifted a and f(t + 2) is 
shifted B . The formula for /(5r) is c . The formula 
for 5 fit) is D r The slope has jumped from 3 to E 
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The set of inputs to a function is its F , The set of 
outputs is its G . The functions /(f) = 7 + 3(f — 2) and 
f(t) = vt + C are H . Their graphs are I with slopes 
equal to J and K . They are the same function, if 
v = i and C = M . 


Draw the velocity graph that goes with each distance graph. 



2 4 6 1 2 



10 20 30 


2b 


T IT 3 T 


3 Write down three-part formulas for the velocities u(f) in 
Problem 2, starting from u(f) = 2 for 0 < t < 10* 


Draw the distance graph that goes with each velocity graph. 
Start from /= 0 at t — 0 and mark the distance. 


(jJ 

o 

V 





n 

1 




1 2 3 

i n 


13a 


14 

40 - 

V 







1 2 

-40 - 


14a 



15 Write down formulas for i?(f) in Problem 14, starting with 
v- — 40 for 0 < £ < 1. Find the average velocities to t = 2.5 
and t - 37. 

16 Give 3- part formulas for the areas f(t) under r(r) in 13, 


4 The distance in lb starts with /(f) = 10 - 10f for 0 ^ f ^ L 17 The distance in 14a starts with f(t) = -40; for 0 ^ t ^ I. 

Give a formula for the second part. Find /(r) in the other part, which passes through /= Oat t = 2 . 


5 In the middle of graph 2a find /(15) and /{ 12) and /(r) t 

6 In graph 2b find f(\AT). If 7=3 what is /( 4)? 

7 Find the average speed between t = 0 and t = 5 in graph 
la. What is the speed at t = 5? 

8 What is the average speed between f = 0 and f = 2 in graph 

I b? The average speed is zero between t = { and t = __ , 

9 (recommended) A car goes at speed v = 20 into a brick 
wall at distance /= 4. Give two-part formulas for u(f) and 
/(f) (before and after), and draw the graphs* 

10 Draw any reasonable graphs of t?(f) and /(f) when 

(a) the driver backs up, stops to shift gear, then goes fast; 

(b) the driver slows to 55 for a police car; 

(c) in a rough gear change, the car accelerates in jumps; 

(d) the driver waits for a light that turns green. 

II Your bank account earns simple interest on the opening 
balance /(G). What are the interest rates per year? 



+- H * t 1 \ h 

12 12 


18 Draw the velocity and distance graphs if u(f) = 8 for 
0 < t < 2 S /(f) = 20 + f for 2 ^ f ^ 3. 

19 Draw rough graphs of y = y/x and y = ^/x — 4 and 
y = yfx — 4. They are “half-parabolas" with infinite slope at 
the start* 

20 What is the break-even point if x yearbooks cost 
SI 200 + 30x to produce and the income is 40x? The slope of 

the cost line is (cost per additional book). If it goes 

above you can't break even. 

21 What are the domains and ranges of the distance functions 
in 14a and 14b — all values of f and /(f) if /(0) = 0? 

22 What is the range of t?(f) in 14b? Why is £ = 1 not in the 
domain of u(f) in 14a? 

Problems 23-28 involve Unear functions /(t) = vt + C. Find the 
constants v and C 

23 What linear function has /(0) = 3and /(2)=— 11? 

24 Find two linear functions whose domain is 0 < £ < 2 and 
whose range is 1 ^/(f) ^ 9. 

25 Find the linear function with /(l) = 4 and slope 6. 

26 What functions have f[t + 1) = /(f) + 2? 

27 Find the linear function with /{f + 2) =/(() + 6 and 

/(l) = 10. 


12 The earth's population is growing at v= 100 million a 
year, starting from /= 52 billion in 1990* Graph /(f) and find 
/(200G). 


28 Find the only /= vt that has /(2f) = 4 f(t). Show that every 

/=iof 2 has this property. To go times as far in 

twice the time, you must accelerate. 
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29 Sketch the graph of /(f) = [5 — 2t| (absolute value) for 
|t[ ^ 2 and find its slopes and range. 

30 Sketch the graph of f(t) ~ 4 — t — j4 — f| for 2 ^ t ^ 5 and 
find its slope and range. 

31 Suppose v — 8 up to time T, and after that v = — 2. Starting 
from zero, when does / return to zero? Give formulas for u(t) 
and /(t). 

32 Suppose v — 3 up to time T— 4. What new velocity will 
lead to /(7) = 30 if /{ 0) = 0? Give formulas for u(t) and f(t). 

33 What function F(C) converts Celsius temperature C to 

Fahrenheit temperature F? The slope is , which is 

the number of Fahrenheit degrees equivalent to 1°C. 

34 What function C(F) converts Fahrenheit to Celsius (or 
Centigrade), and what is its slope? 

35 What function converts the weight w in grams to the 
weight f(w) in kilograms? Interpret the slope of f(w). 

36 (Newspaper of March 1989) Ten hours after the accident 
the alcohoi reading was .061. Blood alcohol is eliminated at 
.015 per hour. What was the reading at the time of the acci- 
dent? How much later would it drop to .04 {the maximum set 
by the Coast Guard)? The usual limit on drivers is . 10 percent 

Which points between t — 0 and t = 5 can be in the domain of 
/(f)? With this domain find the range in 37-42. 

37 /(i) = 38 /(<) = Uy/rri 

39 j p (f) = |r_4| (absolute value) 40 /(f) = l/(t — 4). 2 
41 /(*) = 2' 42/(0 = 2"' 

43 (a) Draw the graph of / (r) = if + 3 with domain 0 ^ t < 2. 
Then give a formula and graph for 

(b) /(t)+l (c) /(t+1) 

<<i) 4/w (e) 

44 (a) Draw the graph of (7(f) = step function = {0 for t < 0, 
1 for f £ 0}. Then draw 

(b) (7(0 + 2 (c) (7{f -h 2) 

(d) 31/(0 (e) 1/(30. 


45 (a) Draw the graph of /(r) = t + 1 for -1 ^ r ^ 1. Find 
the domain, range, slope, and formula for 

(b) 2/(0 (c) /(*- 3) (d) -/CO (e)/(-r). 

46 If /(0 = t — 1 what are 2/(30 an d /(I — 0 and /(2 — 1 )? 

47 In the forward -back example find /(i 73 and f(\ T). Verify 
that those agree with the areas “under'’ the t>-graph in 
Figure 1.4, 

48 Find formulas for the outputs f Y {t) and / 2 (0 which come 
from the input f: 

(1) inside = input *3 (2) inside «- input + 6 

output = inside + 3 output «- inside * 3 

Note BASIC and FORTRAN (and calculus itself) use = 
instead of But the symbol «- or i= is in some ways better. 
The instruction t «- t + 6 produces a new t equal to the old t 
plus six. The equation t = t + 6 is not intended, 

49 Your computer can add and multiply. Starting with the 
number 1 and the input called f, give a list of instructions to 
lead to these outputs: 

m = t 2 + t / 2 (0=/i(/i(0) AfrWiG + i* 

50 In fifty words or less explain what a function is. 

The last questions are challenging bat possible. 

51 If /(r) = 3t — 1 for 0 ^ t ^ 2 give formulas (with domain) 
and find the slopes of these six functions: 

(a) f(t + 2) (b) f{t) + 2 (c) 2/(r) 

(d) /(2t) (e) /(— f) (f) /(/(0). 

52 For f(t) = vt + C find the formulas and slopes of 

(a) 3/(0+ 1 (b)/(3* + I) (c) 2/(4() 

(d) /(— o (e) m-m (o rum 

53 (hardest) The forward -back function is /(f) = 2f for 

0 ^ t ^ 3, /(t) = 12 - 2t for 3 < t < 6. Graph and find 

its four -part formula. First try t = 1.5 and 3. 

54 (a) Why is the letter X not the graph of a function? 

(b) Which capital letters are the graphs of functions? 

(c) Draw graphs of their slopes. 


1 .2 Calculus Without Limits 


The next page is going to reveal one of the key ideas behind calculus. The discussion 
is just about numbers — functions and slopes can wait. The numbers are not even 
special, they can be any numbers. The crucial point is to look at their differences: 

Suppose the numbers are/=0 2 6 7 4 9 

Their differences are v = 2 4 1—35 

The differences are printed in between, to show 2 — 0 = 2 and 6 — 2 — 4 and 7 — 6 = L 
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Notice how 4 — 7 gives a negative answer —3. The numbers in / can go up or down, 
the differences in u can be positive or negative. The idea behind calculus comes when 
you add up those differences : 

2 + 4 + 1 - 3 + 5=9 

The sum of differences is 9. This is the last number on the top line (in /). Is this an 
accident, or is this always true? If we stop earlier, after 2 + 4 + 1, we get the 7 in /. 
Test any prediction on a second example: 

Suppose the numbers are /= 1 3 7 8 5 10 

Their differences are v = 2 4 1—35 

The /’ s are increased by 1. The differences are exactly the same — no change. The 
sum of differences is stili 9. But the last f is now 10. That prediction is not right, we 
don’t always get the last /, 

The first f is now 1. The answer 9 (the sum of differences) is 10— I, the last f 
minus the first f. What happens when we change the /’ s in the middle? 

Suppose the numbers are /= 1 5 12 7 10 

Their differences are v = 4 7 —53 

The differences add to 4 + 7 — 5 + 3 = 9. This is still 10 - I. No matter what /’ s we 
choose or how many, the sum of differences is controlled by the first / and last /. 
If this is always true, there must be a dear reason why the middle f's cancel out. 

The sum of differences is (5 - 1 ) + (12 - 5) + (7 - 1 2) + (10 - 7) = 10 - 1 . 

The 5’s cancel, the I2’s cancel, and the 7’s cancel. It is only 10 — 1 that doesn’t cancel. 
This is the key to calculus! 


IB The differences of the f's add up to (f tst — / first ). 


EXAMPLE! The numbers grow linearly: /= 2 3 4 5 6 7 

Their differences are constant: v = 1 1 1 1 1 

The sum of differences is certainly 5. This agrees with 7 — 2 =f aa — / first . The numbers 
in v remind us of constant velocity. The numbers in / remind us of a straight line 
J'=vt + C. This example has u= l and the f’s start at 2. The straight line would 
come from /= f + 2. 

EXAMPLE 2 The numbers are squares: /= 0 1 4 9 16 

Their differences grow linearly: i>= 1 3 5 7 

1 + 3 + 5 + 7 agrees with 4 Z = 16. It is a beautiful fact that the first j odd numbers 
always add up to j 2 . The u’s are the odd numbers, the f’s are perfect squares. 

Note The letter j is sometimes useful to tell which number in / we are looking at. 
For this example the zeroth number is f 0 = 0 and the yth number is f = j 2 . This is a 
part of algebra, to give a formula for the f's instead of a list of numbers. We can also 
use j to tell which difference we are looking at. The first v is the first odd number 
t>! = I. The ;th difference is the jth odd number Vj = 2 j — 1. (Thus v 4 is 8 - 1 = 7.) It 
is better to start the differences with j= 1, since there is no zeroth odd number i? 0 . 

With this notation the yth difference is v j =f j —fj- l . Sooner or later you will get 
comfortable with subscripts like j and J — 1, but it can be later. The important point 
is that the sum of the i>’s equals — if.rsi - We now connect the v’s to slopes and the 
f’s to areas. 
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Figure 1.7 shows a natural way to graph Example 2, with the odd numbers in v and 
the squares in /. Notice an important difference between the r-graph and the /-graph. 
The graph of / is "piecewise linear," We plotted the numbers in / and connected 
them by straight lines. The graph of v is "piecewise constant " We plotted the differ- 
ences as constant over each piece. This reminds us of the distance-velocity graphs, 
when the distance /(f) ts a straight line and the velocity u(t) is a horizontal line. 

Now make the connection to slopes: 


The slope of the /graph is 


distance up 
distance across 


change in / 
change in t 


Over each piece, the change in f (across) is 1 . The change in / (upward) is the difference 
that we are calling v. The ratio is the slope vj 1 or just v. The slope makes a sudden 

change at the breakpoints f= 1,2,3 At those special points the slope of the 

/graph is not defined — we connected the e’s by vertical lines but this is very 
debatable. The main idea is that between the breakpoints, the slope of /(f) is u(f). 

Now make the connection to areas: 


The total area under the i-praph in / asl — / firsl . 

This area, underneath the staircase in Figure 1.7, is composed of rectangles. The base 
of every rectangle is 1. The heights of the rectangles are the e’s. So the areas also 
equal the u’s, and the total area is the sum of the u’s. This area is / sst — / fjrst . 

Even more is true. We could start at any time and end at any later time 
— not necessarily at the special times f = 0, 1,2, 3, 4. Suppose we stop at f = 3.5. 
Only half of the last rectangular area (under v = 7) will be counted. The total area is 
1 + 3 + 5 + 2(7) = 1 2.5. This still agrees with / as , — / first = 1 2.5 — 0. At this new ending 
time f = 3.5, we are only halfway up the last step in the /-graph. Halfway between 
9 and 16 is 12.5. 


1C The u’s are slopes of f{t). The area under the v-graph is /(( Md )— /(f 51art ). 


This is nothing less than the Fundamental Theorem of Calculus. But we have only 
used algebra (no curved graphs and no calculations involving limits). For now the 
Theorem is restricted to piecewise linear /(r) and piecewise constant c(f). In Chapter 5 
that restriction will be overcome. 

Notice that a proof of 1 + 3 + 5 + 7 = 4 2 is suggested by Figure 1.7a. The triangle 
under the dotted line has the same area as the four rectangles under the staircase. 
The area of the triangle is £■ base * height = ^-4 ■ 8, which is the perfect square 4 2 . 
When there are j rectangles instead of 4, we get j’j'2j=j 2 for the area. 
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The next examples show other patterns, where / and v increase exponentially or 
oscillate around zero. I hope you like them but I don’t think you have to learn them. 
They are like the special functions 2' and sin t and cos t — except they go in steps. 
You get a first look at the important functions of calculus, but you only need algebra. 
Calculus is needed for a steadily changing velocity, when the graph of f is curved. 

The last example will be income tax — which really does go, in steps. Then Sec- 
tion 1.3 will introduce the slope of a curve. The crucial step for curves is working 
with limits . That will take us from algebra to calculus. 

EXPONENTIAL VELOCITY AND DISTANCE 

Start with the numbers /= 1, 2,4, 8, 16. These are “powers of 2 ” They start with the 
zeroth power, which is 2° = 1. The exponential starts at 1 and not 0, After / steps there 
are j factors of 2, and f equals 2K Please recognize the difference between 2 j and j 2 
andlK The numbers 2 j grow linearly, the numbers j 2 grow quadratically, the numbers 
2 J grow exponentially. At ; = 10 these are 20 and 100 and 1024. The exponential 2 j 
quickly becomes much larger than the others. 

The differences of /= 1, 2, 4, 8, 16 are exactly v = 1, 2, 4, 8. We get the same beauti- 
ful numbers. When the f s are powers of 2, so are the u’s. The formula Uj-^2' -1 is 
slightly different from fj — 2\ because the first v is numbered (Then —2°= 1. 
The zeroth power of every number is 1, except that 0° is meaningless,) The two graphs 
in Figure LS use the same numbers but they look different, because / is piecewise 
linear and v is piecewise constant. 



1 2 3 4 12 3 4 

H0. 1.8 The velocity and distance grow exponentially (powers of 2). 


Where will calculus come in? It works with the smooth curve /(t) = 2 r , This expo- 
nential growth is critically important for population and money in a bank and the 
national debt. You can spot it by the following test: v{t) is proportional to /(£). 

Remark The function 2 r is trickier than t 2 . For f—t 2 the slope is v = 2 1, It is 
proportional to f and not f 2 . For /=2 r the slope is v — c2\ and we won’t find the 
constant c = ,693 ... until Chapter 6. (The number c is the natural logarithm of 2.) 
Problem 37 estimates c with a calculator — the important thing is that it’s constant. 

OSCILLATING VELOCITY AND DISTANCE 

We have seen a forward-back motion, velocity V followed by — K That is oscillation 
of the simplest kind. The graph of f goes linearly up and linearly down. Figure 1.9 
shows another oscillation that returns to zero, but the path is more interesting. 

The numbers in / are now 0, 1, 1,0, -1, — 1, 0. Since f 6 = 0 the motion brings us 
back to the start. The whole oscillation can be repeated. 
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The differences in v are l, 0, — 1* —1,0, 1, They add up to zero, which agrees with 
yiasi It Is the same oscillation as in / (and also repeatable), but shifted in time* 
The /-graph resembles (roughly) a sine curve . The u-graph resembles (even more 
roughly) a cosine curve * The waveforms in nature are smooth curves, while these are 
“digitized”— the way a digital watch goes forward in jumps. You recognize that the 
change from analog to digital brought the computer revolution. The same revolution 
is coming in CD players. Digital signals (off or on, 0 or I) seem to win every time. 

The piecewise v and / start again at t = 6* The ordinary sine and cosine repeat at 
t — 2n. A repeating motion is periodic— here the “period” is 6 or 2 ti. (With t in degrees 
the period is 360 — a full circle* The period becomes 2 n when angles are measured in 
radians * We virtually always use radians — which are degrees times 2^/360.) A watch 
has a period of 12 hours. If the dial shows am and pm, the period is * 



Fig. 1.9 Piecewise constant “cosine” and piecewise linear “sine.” They both repeat. 


A SHORT BURST OF SPEED 

The next example is a car that is driven fast for a short time. The speed is V until 
the distance reaches /= 1 , when the car suddenly stops. The graph of / goes up 
linearly with slope V, and then across with slope zero: 

f V up to t = T (Vt up to t=T 

v{t)= J f(t)={ 

[0 after t = T [1 after t=T 

This is another example of “function notation.” Notice the general time t and the 
particular stopping time T. The distance is /(r). The domain of / (the inputs) includes 
all times t ^ 0. The range of / (the outputs) includes all distances 0 1 . 

Figure 1.10 allows us to compare three cars — a Jeep and a Corvette and a Maserati. 
They have different speeds but they all reach /= 1. So the areas under the u-graphs 
are all 1. The rectangles have height V and base T = 1/K 



Hg.l A0 Bursts of speed with = FcT c = = 1. Step function has infinite slope. 

Optional remark It is natural to think about faster and faster speeds, which means 
steeper slopes. The /-graph reaches 1 in shorter times* The extreme case is a step 
function, when the graph of / goes straight up. This is the unit step U(t)> which is 
zero up to t = 0 and jumps immediately to V = 1 for t > 0* 
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What is the slope of the step function ? It is zero except at the jump. At that moment, 
which is r = 0, the slope is infinite. We don’t have an ordinary velocity u(t) — instead 
we have an impulse that makes the car jump. The graph is a spike over the single 
point t = 0, and it is often denoted by 6 — so the slope of the step function is called 
a “ delta function'' The area under the infinite spike is 1. 

You are absolutely not responsible for the theory of delta functions! Calculus is 
about curves, not jumps. 

Our last example is a real-world application of slopes and rates — to explain ‘'how 
taxes work.” Note especially the difference between tax rates and tax brackets and 
total tax. The rates are t>, the brackets are on x, the total tax is /. 

EXAMPLE 3 Income tax is piecewise linear. The slopes are the tax rates . 1 5, .28, .3 1 . 

Suppose you are single with taxable income of x dollars (Form 1040, line 37 — after 
all deductions). These are the 1991 instructions from the Internal Revenue Service: 

If x is not over $20,350, the tax is 15% of x. 

If $20,350 $x ^ $49,300, the tax is $3052.50 + 28% of the amount over $20,350. 

If x is over $49,300, the tax is $1 1,158.50 + 31% of the amount over $49,300. 

The first bracket is 0 < x $20,350. (The IRS never uses this symbol but I think 
it is OK here. We know what it means.) The second bracket is $20,350 < x ^ $49,300. 
The top bracket x Js $49,300 pays tax at the top rate of 31%. But only the income in 
that bracket is taxed at that rate. 

Figure 1.11 shows the rates and the brackets and the tax due. Those are not average 
rates, they are marginal rates. Total tax divided by total income would be the average 
rate. The marginal rate of .28 or .31 gives the tax on each additional dollar of income — 
it is the slope at the point x. Tax is like area or distance — it adds up. Tax rate is like 
slope or velocity — it depends where you are. This is often unclear in the news media. 



Fig. 1.11 The tax rate is i>, the total tax is /. Tax brackets end at breakpoints. 


Question What is the equation for the straight line in the top bracket? 

Answer The bracket begins at x = $49,300 when the tax is /(x) = $l 1,158.50. The 
slope of the line is the tax rate .31. When we know a point on the line and the slope, 
we know the equation. This is important enough to be highlighted. 


ID For x in the top bracket the tax is /(x) = $11,1 58.50 + .31 (x — $49,300). 
This is the tax on $49,300 plus the extra tax on extra income. 


Section 2.3 presents this “point-slope equation" for any straight line. Here you see it 
for one specific example. Where does the number $1 1,158.50 come from? It is the tax 
at the end of the middle bracket, so it is the tax at the start of the top bracket. 
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Figure LI l also shows a distance- velocity example. The distance at t = 2 is 
/( 2) = 40 miles. After that time the velocity is 60 miles per hour. So the line with 
slope 60 on the /-graph has the equation 

/(f) = starting distance + extra distance = 40 + 60(£ - 2). 

The starting point is (2, 40), The new speed 60 multiplies the extra time t — 2. The 
point-slope equation makes sense. We now review this section, with comments. 

Central idea Start with any numbers in /, Their differences go in v . Then the sum 
of those differences is /i ast “/first- 

Subscript notation The numbers are / 0 , f u ... and the first difference is v x =f x — / 0 . 
A typical number is fj and the j'th difference is — fj^ x . When those differences 
are added, all /’ s in the middle (like f x ) cancel out: 

r l + v 2 + = (/ 1 -/o) + </2 _ /l) + "' + (/ =fj _ /o‘ 

Examples fj=j oij 2 or 2K Then = 1 (constant) or 2j — 1 (odd numbers) or 2 J_1 . 

Functions Connect the /’ s to be piecewise linear. Then the slope v is piecewise 
constant. The area under the u-graph from any £ start to any r end equals /(f end ) — /(f alart ). 

Units Distance in miles and velocity in miles per hour. Tax in dollars and tax rate 
in (dollars paid) /(dollars earned). Tax rate is a percentage like .28, with no units. 


1.2 EXERCISES 


Read-through questions 

Start with the numbers /= 1, 6, 2, 5. Their differences are 
v = ° . The sum of those differences is *> . This is equal 

to /„! minus ,, c . The numbers 6 and 2 have no effect on 

this answer, because in (6 — 1) -f- (2 — 6) + (5 — 2) the numbers 
6 and 2 d . The slope of the line between /( 0) = l and 
/(l) = 6 is * . The equation of that line is fit) — f . 

With distances 1, 5, 25 at unit times, the velocities are 
0 . These are the h of the /-graph. The slope of the 

tax graph is the tax * , If /(t) is the postage cost for f 

ounces or t grams, the slope is the I per * . For 

distances 0, 1,4,9 the velocities are i The sum of the 
first j odd numbers is /}= m . Then f lf> is » and the 
velocity i? 10 is o . 

The piecewise linear sine has slopes p . Those form a 
piecewise Q cosine. Both functions have r equal to 
6, which means that f{t + 6) = * for every i. The veloci- 

ties v= L 2*4, 8, ... have vj = l . In that case / 0 = 1 and 
f s = u . The sum of 1, 2, 4, 8, 16 is v . The difference 
2 J — 2 J ~ 1 equals w . After a burst of speed V to time T, 
the distance is * If f{T) — 1 and V increases, the burst 
lasts only to T= v . When V approaches infinity, f(t) 
approaches a * function. The velocities approach a 
* function, which is concentrated at t = 0 but has area 

B under its graph. The slope of a step function is c . 


Problems 1-4 are about numbers / and differences v. 

1 From the numbers /= 0, 2, 7, 10 find the differences v and 

the sum of the three d s. Write down another / that leads 
to the same u*s. For /=G, 3, 12, 10 the sum of the v’s is 
still . 

2 Starting from /= 1, 3, 2, 4 draw the /’graph (linear pieces) 
and the u-graph. What are the areas “under” the i>graph that 
add to 4 — 1? If the next number in / is 1 1, what is the area 
under the next u? 

3 From u=l,2, 1,0, — 1 find the /’ s starting at / 0 = 3. 

Graph v and / The maximum value of / occurs when 
v = , Where is the maximum / when v= 1,2, 1, “1? 

4 For f=\,b, c f 7 find the differences and add 

them up. Do the same for /= a, b, c, 7. Do the same for 
/= a, b , c, d. 

Problems 5-11 are about linear fuoctions and constant slopes. 

5 Write down the slopes of these linear functions: 

(a)/(f)=l.L (h) /(£)= 1 - 2t (c) /(r)^4 + 5(t-6). 

Compute /( 6) and /( 7) for each function and confirm that 
/(7) — /( 6) equals the slope, 

6 If /(f) = 5 -h 3(i — 1) and g(f)= 1.5 + 2.5(f- 1) what is 
h(t) = /(*) — g(0? Find the slopes of /, g, and h . 
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7 Suppose t?(f ) = 2 for f < 5 and t?(£) = 3 for £ > 5. 

(a) If /(0) = 0 find a two-part formula for /(f). 

(b) Check that /( 10) equals the area under the graph of 
u(0 (two rectangles) up to t = 10, 

8 Suppose u(f) = 10 for f < 1/10, r(£) = 0 for f > 1/10, Start- 
ing from /(0)= 1 find /(f) in two pieces, 

9 Suppose g(f) = 2f + 1 and /(f) = 4r. Find g (3) and /(g(3)) 
and /(g(r)). How is the slope of /(g(f)) related to the slopes 
of / andg? 

10 For the same functions, what are /(3) and g(/(3)} and 

£(/{*))? When t is changed to 4 £, distance increases 

times as fast and the velocity is multiplied by . 

11 Compute /( 6) and /(8) for the functions in Problem 5. 
Confirm that the slopes v agree with 

, _ /(B)-/ (6) _ change in / 

SOpC 8-6 change in t 1 

Problems 12-18 are based on Example 3 about income taxes. 

12 What are the income taxes on x = $10,000 and 
x = $30,000 and x = $50,000? 

13 What is the equation for income tax /(x) in the second 
bracket $20,350 < x < $49,300? How is the number 11,158.50 
connected with the other numbers in the tax instructions? 

14 Write the tax function F(x) for a married couple if the IRS 
treats them as two single taxpayers each with taxable income 
x/2. (This is not done,) 

15 In the 15% bracket, with 5% state tax as a deduction, the 

combined rate is not 20% but . Think about the tax 

on an extra $100. 

16 A piecewise linear function is continuous when /(f) at the 
end of each interval equals /(f) at the start of the following 
interval. If — up to f = 1 and v(t) = 2 for t > 1, define 
/ beyond t = 1 so it is (a) continuous (b) discontinuous, 

(c) Define a tax function /(x) with rates .15 and .28 so you 
would lose by earning an extra dollar beyond the breakpoint. 

17 The difference between a tax credit and a deduction from 
income is the difference between fix) — c and f(x — d). Which 
is more desirable, a credit of c = $l0Q0 or a deduction of 
d — $1000, and why? Sketch the tax graphs when f{x) = .15x. 

18 The average tax rate on the taxable income x is a(x) = 

/(x)/x. This is the slope between (0, 0) and the point (x, /(x)). 
Draw a rough graph of a(x). The average rate a is below the 
marginal rate v because _ r . 

Problems 19-30 involve numbers / 0 , /, / 2 , ... and tbeir differ- 
ences Vj=fj— /-i* They give practice with subscripts 0, ..., j. 

19 Find the velocities v lt v 2t and formulas for Vj and fy 

(a) /= 1,3, 5, 7... (b)/=0, 1,0,1,... (c) /= 0, i, i, J, ... 


20 Find / , / 2 , h and a formula for / with / 0 — 0: 

(a) u = I, 2, 4, 8, ... (b) » = -l,U-U,... 

21 The areas of these nested squares are l 2 , 2 2 , 3 2 What 

are the areas of the L-shaped bands (the differences between 
squares)? How does the figure show that 1 + 3 + 5 + 7 = 4 2 ? 




22 From the area under the staircase (by rectangles and then 
by triangles) show that the first j whole numbers l to j add 
up to i/ 2 + ij. Find 1 + 2 + -■* + 100. 

23 If 17=1*3,5,... then If 17=1,1,1,... then / = 

. Add those to find the sum of 2, 4, 6, ..., 2/ Divide 

by 2 to find the sum of 1, 2, 3 / (Compare Problem 22.) 

24 True (with reason) or false (with example). 

(a) When the /’ s are increasing so are the u’s. 

(b) When the u’s are increasing so are the /’s. 

(c) When the /* s are periodic so are the i7’s. 

(d) When the u’s are periodic so are the f s. 

25 If /(f) = t 2 , compute /{ 99) and /(101). Between those 
times, what is the increase in / divided by the increase in f? 

26 If f{t) = f 1 + f, compute /(99) and /( 101). Between those 
times, what is the increase in / divided hy the increase in f? 

27 If ft = j 2 +/ + 1 find a formula for 

28 Suppose the increase by 4 at every step. Show by 
example and then by algebra that the “second difference* 
fj + 1 - Vi +fj - 1 equals 4. 

29 Suppose f 0 = 0 and the e’s are 1, i, i, {, { For 

which j does / = 5? 

30 Show that — 2/+/-1 always equals u i+1 — vj. If 

v is velocity then a stands for . 

Problems 31-34 involve periodic /’s and u’s (like sinf and 
cos f). 

31 For the discrete sine /= 0, 1, 1, 0, — 1, — 1, 0 find the 
second differences a l =f 2 — 2 f x + / 0 and a 2 —f z — lf 2 + f x and 

Compare aj with /. 

32 If the sequence Vi , v 2 , has period 6 and w lt w 2 , ... has 
period 10, what is the period of i7 4 + w lt v 2 + w 2i ...? 

33 Draw the graph of /(f) starting from f Q — 0 when v = 1, 
-1,-1, 1. If v has period 4 find /( 12), /(13),/(100.1). 
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34 Graph /(f) from / 0 = 0 to / 4 — 4 when v = 1, 2, 1, 0. If v 
has period 4, find /( 12) and /(14) and /(16)* Why doesn't/ 
have period 4? 


44 Graph the square wave U(f) — U(t — 1)* If this is the veloc- 
ity graph the distance /(f). If this is the distance /(f), 
graph the velocity. 


Problems 35^42 are about exponential i2s and 

35 Find the v*s for /= 1, 3, 9, 27* Predict r 4 and Vj. Algebra 
gives 3 J — 3 j ~' =(3 — l)3 J ~ 1 . 

36 Find 1 + 2 + 4 + — +32 and also 1 + i + i + — + Jj. 


37 Estimate the slope of /(t) = 2* at £ = 0. Use a calculator 
to compute (increase in /)/(increase in f) when f is small: 


m-m 

f 


2 - 1 

~r 


and : 


2 1 — 1 


and 


2 01 - 1 
.01 


and 


2^01 _ i 
.001 


38 Suppose / 0 = 1 and Vj = 2/_ lr Find / 4 * 

39 (a) From f=Uhi*k find ri,u 2l and predict t^* 

(b) Check / 3 -f 0 = v A + v 2 + v 3 and f } — /}_ x = v } , 

40 Suppose v s = rK Show that fj = (r j+1 — l)/(r- 1) starts 
from /o = 1 and has fj — f } _ ^ = Vj. (Then this is the correct 
/ = 1 + r + * ■ ■ + r s = sum of a geometric series.) 

41 From / = (— 1) J compute u j+ What is v x + v 2 + ■■■ + 17 ? 

42 Estimate the slope of f(t) = e t at t = 0 , Use a calculator 
that knows e (or else take e = 2.78) to compute 


45 Two bursts of speed lead to the same distance /= 10: 

v — to f = .001 v = V to f = . 

As ao the limit of the /(f)*s is . 

46 Draw the staircase function C/(t) + U{t — I) + U(t - 2). Its 

slope is a sum of three functions. 

47 Which capital letters like L are the graphs of functions 
when steps are allowed? The slope of L is minus a delta func- 
tion. Graph the slopes of the others. 

48 Write a subroutine FINDV whose input is a sequence 

/ 0 , /j ? .*>, f H and whose output is Include 

graphical output if possible. Test on fj = 2j and j 2 and 2K 

49 Write a subroutine FINDF whose input is v l7 and 
/o, and whose output is / 0 ,/ 1? .**,/*. The default value of / 0 
is zero. Include graphical output if possible* Test — j, 

50 If FINDV is applied to the output of FINDF, what 
sequence is returned? If FINDF is applied to the output of 
FINDV, what sequence is returned? Watch / 0 . 


m-n 0 ) 

f 


e-\ 

1 


and 


<?■*-! 


and 


<r 01 - 1 
*01 


51 Arrange 2 j and j 2 and 2 j and ^fj in increasing order 
(a) when j is large: j = 9 (b) when j is small: j = i. 


Problems 43-47 are about 17(f) — step from 0 to 1 at f = 0. 

43 Graph the four functions U(t — t) and U{t) — 2 and U(3t) 
and 4[/(f). Then graph /(r) — 4t/(3f — 1) — 2. 


52 The average age of your family since 1970 is a piecewise 
linear function ^(t). Is it continuous or does it jump? What 
is its slope? Graph it the best you can. 



1.3 The Velocity at an Instant 



We have arrived at the central problems that calculus was invented to solve* There 
are two questions, in opposite directions, and I hope you could see them coming. 

1* If the velocity is changing, how can you compute the distance traveled! 

2, If the graph of f(t) is not a straight line, w hat is its slope ? 

Find the distance from the velocity, find the velocity from the distance* Our goal is 
to do both — but not in one section* Calculus may be a good course, but it is not 
magic* The first step is to let the velocity change in the steadiest possible way. 

Question 1 Suppose the velocity at each time t is u(t) = 2t. Find /( r)* 

With [7- 2t, a physicist would say that the acceleration is constant (it equals 2)* The 
driver steps on the gas, the car accelerates, and the speedometer goes steadily up* 
The distance goes up too — faster and faster* If we measure r in seconds and v in feet 
per second, the distance / comes out in feet. After 10 seconds the speed is 20 feet 
per second. After 44 seconds the speed is 88 feet/second (which is 60 miles/hour). 
The acceleration is clear, hut how far has the car gone! 
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Question 2 The distance traveled by time t is fit) = I 2 . Find the velocity t>(f). 

The graph of /(t) = t 2 is on the right of Figure 1.12. It is a parabola. The curve starts 
at zero, when the car is new. At t = 5 the distance is /= 25. By t = 10, f reaches 100. 

Velocity is distance divided by time, but what happens when the speed is changing? 
Dividing / = 100 by t = 10 gives c = 10 — the average velocity over the first ten 
seconds. Dividing /= 121 by t= 11 gives the average speed over 11 seconds. But how 
do we find the instantaneous velocity — the reading on the speedometer at the exact 
instant when t- 10? 





Fig. 1.12 The velocity u = 2ms linear. The distance /= t 1 is quadratic. 


I hope you see the problem. As the car goes faster, the graph of t 2 gets steeper — 
because more distance is covered in each second. The average velocity between t — 10 
and t= 11 is a good approximation — but only an approximation— to the speed at 
the moment t - 10. Averages are easy to find: 

distance at f = 10 is /(10) = 10 2 = 100 distance at r = 11 is /(ll) = ll 2 = 121 


average velocity is 


mr -/go) 

11-10 


121 - 100 
1 


= 21 . 


The car covered 21 feet in that 1 second. Its average speed was 21 feet/second. Since 
it was gaining speed, the velocity at the beginning of that second was below 21. 


Geometrically, what is the average? It is a slope, but not the slope of the curve. 
The average velocity is the slope of a straight line . The line goes between two points 
on the curve in Figure 1.12. When we compute an average, we pretend the velocity 
is constant — so we go back to the easiest case. It only requires a division of distance 
by time: 


average velocity = 


change in / 
change in t 


(1) 


Calculus and the Law You enter a highway at 1 : 00. If you exit 150 miles away at 
3 : 00, your average speed is 75 miles per hour. Fm not sure if the police can give you 
a ticket. You could say to the judge, "‘When was I doing 75?” The police would have 
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to admit that they have no idea — but they would have a definite feeling that you 
must have been doing 75 sometime, t 


We return to the central problem — computing v(10) at the instant f = 10. The 
average velocity over the next second is 21. We can also find the average over the 
half-second between t = 10.0 and t = 10.5* Divide the change in distance by the change 
in time: 


/(10.5) -/{ 10*0) _ (10*5) 2 - (10.0) 2 
10.5-10.0 *5 


110*25-100 

.5 


20.5. 


That average of 20.5 is closer to the speed at t = 10. It is still not exact* 

The way to find u(10) is to keep reducing the time interval This is the basis for 
Chapter 2, and the key to differential calculus. Find the slope between points that are 
closer and closer on the curve. The “limit” is the slope at a single point. 

Algebra gives the average velocity between t = 10 and any later time t - 10 + h. 
The distance increases from 10 2 to (10 + ft) 2 . The change in time is ft. So divide: 

(10 + ft ) 2 - 10 2 100 + 20ft + ft 2 - 100 ^ , 

iw ra gt = 7 = r = 20 + ft. (2) 


This formula fits our previous calculations. The interval from t= 10 to t= 11 had 
ft = 1 , and the average was 20+ ft ~ 21* When the time step was h — % t the average 
was 20 + £ = 20*5* Over a millionth of a second the average will be 20 plus 
1/1,000,000 — which is very near 20. 


Conclusion: The velocity at t= 10 is v = 20. Tftar is the slope of the curve. It agrees 
with the u-graph on the left side of Figure 1,12, which also has r{10) = 20, 

We now show that the two graphs match at all times. If /(r) = f 2 then v{t)~ 2 1. 
You are seeing the key computation of calculus, and we can put it into words before 
equations. Compute the distance at time t + ft, subtract the distance at time t, and 
divide by ft* That gives the average velocity: 


= 


/(f + ft)-/(f)_(t + ft) 2 -^ _ t 2 + 2fft + ft 3 - t 2 


= 2r + ft. 


(3) 


This fits the previous calculation, where t was 10. The average was 20 + ft* Now the 
average is 2r + ft. It depends on the time step ft, because the velocity is changing. But 
we can see what happens as ft approaches zero * The average is closer and closer to 
the speedometer reading of 2t, at the exact moment when the clock shows time t: 


IE As ft approaches zero, the average velocity 2t + ft approaches v(t) - 2f, 


Note The computation (3) shows how calculus needs algebra. If we want the whole 
u-graph, we have to let time be a 41 variable It is represented by the letter f. Numbers 
are enough at the specific time t = 10 and the specific step ft — 1 — but algebra gets 
beyond that. The average between any t and any r + ft is 2t + ft. Please don't hesitate 
to put back numbers for the letters — that checks the algebra* 


tThis is our first encounter with the much despised “Mean Value Theorem.” If the judge can 
prove the theorem, you are dead. A few u-graphs and /-graphs will confuse the situation 
(possibly also a delta function). 
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There is also a step beyond algebra! Calculus requires the limit of the average. As 
h shrinks to zero, the points on the graph come closer. “Average over an interval” 
becomes “velocity at an instant.” The general theory of limits is not particularly 
simple, but here we don’t need it. (It isn’t particularly hard either.) In this example 
the limiting value is easy to identify. The average 2 t + h approaches 2t, as h -*■ 0. 

What remains to do in this section? We answered Question 2 — to find velocity 
from distance. We have not answered Question 1 . If r(t) = 2 1 increases linearly with 
time, what is the distance? This goes in the opposite direction (it is integration). 

The Fundamental Theorem of Calculus says that no new work is necessary. If the 
slope of f{t) leads to £>(r), then the area under that v-graph leads back to the f-graph. 
The odometer readings f=t 2 produced speedometer readings v = 2 1. By the Funda- 
mental Theorem, the area under 2r should be t 2 . But we have certainly not proved 
any fundamental theorems, so it is better to be safe — by actually computing the area. 

Fortunately, it is the area of a triangle. The base of the triangle is t and the height 
is v = 2f. The area agrees with f(t): 

area = j(base)(height) = £(f)(2t) = t 2 . (4) 

EXAMPLE 1 The graphs are shifted in time. The car doesn’t start until t = 1 . Therefore 
v = 0 and /= 0 up to that time. After the car starts we have v = 2(r - 1) and 
f=(t — l) 2 . You see how the time delay of 1 enters the formulas. Figure 1.13 shows 
how it affects the graphs. 



EXAMPLE 2 The acceleration changes from 2 to another constant a. The velocity 
changes from v — 2t to v = at. The acceleration is the slope of the velocity curve\ The 
distance is also proportional to a. but notice the factor 

acceleration a velocity v = at -*=> distance / = %at 2 . 

If a equals 1, then v — t and /= j t 2 . That is one of the most famous pairs in calculus. 
If a equals the gravitational constant g, then v = gt is the velocity of a falling body. 
The speed doesn't depend on the mass (tested by Galileo at the Leaning Tower of 
Pisa). Maybe he saw the distance /= \gt 2 more easily than the speed v = gt. Anyway, 
this is the most famous pair in physics. 
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EXAMPLE 3 Suppose f(t) = 3t + t 2 . The average velocity from t to ( + k is 

_ fit + h) ~f(t) _ 3{t + k) + (t + h) 2 - 3 1 - t 2 
h h 

The change in distance has an extra (coming from 3(r + /i) minus 3f). The velocity 
contains an additional 3 (coming from 3h divided by h). When 3t is added to the 
distance, 3 is added to the velocity. If Galileo had thrown a weight instead of dropping 
it, the starting velocity v G would have added v 0 t to the distance. 

FUNCTIONS ACROSS TIME 

The idea of slope is not difficult — for one straight line. Divide the change in / by 
the change in r. In Chapter 2, divide the change in y by the change in x , Experience 
shows that the hard part is to see what happens to the slope as the line moves. 
Figure 1.14a shows the line between points A and B on the curve. This is a “secant 
line.” Its slope is an average velocity. What calculus does is to bring that point B 
down the curve toward A . 





fig. T14 Slope of line, slope of curve. Two velocity graphs. Which is whichl 


Question 1 What happens to the “change in /” — -the height of B above A? 
Answer The change in / decreases to zero. So does the change in r. 

Question 2 As B approaches A , does the slope of the line increase or decrease? 
Answer I am not going to answer that question. It is too important. Draw another 
secant line with B closer to A, Compare the slopes. 

This question was created by Steve Monk at the University of Washington — where 
57% of the class gave the right answer. Probably 97% would have found the right 
slope from a formula. Figure 1,14b shows the opposite problem. We know the veloc- 
ity, not the distance. But calculus answers questions about both functions. 

Question 3 Which car is going faster at time t = 3/4? 

Answer Car C has higher speed. Car D has greater acceleration. 

Question 4 If the cars start together, is D catching up to C at the end? Between 
t = j and r= 1, do the cars get closer or further apart? 

Answer This time more than half the class got it wrong. You won’t but you can see 
why they did. You have to look at the speed graph and imagine the distance graph. 
When car C is going faster, the distance between them . 



1.3 The Velocity at an Instant 


21 


To repeat: The cars start together, but they don’t finish together. They reach the 
same speed at t = 1, not the same distance. Car C went faster. You really should draw 
their distance graphs, to see how they bend. 

These problems help to emphasize one more point. Finding the speed (or slope) is 
entirely different from finding the distance (or area): 

1- To find the slope of the / graph at a particular time f, you dorit have to know 
the whole history. 

2. To find the area under the u-graph up to a particular time f, you do have to 
know the whole history. 

A short record of distance is enough to recover u(f). Point B moves toward point A . 
The problem of slope is local — the speed is completely decided hy f(t) near point A . 

In contrast, a short record of speed is not enough to recover the total distance. We 
have to know what the mileage was earlier. Otherwise we can only know the increase 
in mileage, not the total. 


1.3 EXERCISES 


Read-thTough questions 

Between the distances /( 2) = 100 and /(6) = 200, the average 
velocity is q . If f(t)=^t 2 then f(6)= b and 

/(8) = c . The average velocity in between is d . The 

instantaneous velocities at t — 6 and t = 8 are and 

1 

The average velocity is computed from f(t) and f(t + h) by 
v^c = A If f(t) = t 2 then t? fw = h . From t=\ to 
t = 1.1 the average is [ The instantaneous velocity 
is the 1 of i? 8Vt . If the distance is f(t) = jQt 2 then the 
velocity is u(t) — * and the acceleration is 1 

On the graph of /(/), the average velocity between A and 
B is the slope of m . The velocity at A is found by n 
The velocity at B is found by £ . When the velocity is 
positive, the distance is P , When the velocity is increas- 
ing, the car is Q . 

1 Compute the average velocity between r = 5 and t = 8: 

(a) f{t) = 6f (b) f{t) = 6t + 2 

(c) f(t) = $at 2 (d)/(f)=r-f 2 

(e) /(0 = 6 (0 ■>(/) = 2r 

2 For the same functions compute [f(t + h)—f(t)]/h. This 
depends on t and k. Find the limit as h 0. 

3 If the odometer reads f{t) = t 2 + t (f in miles or kilo* 
meters, t in hours), find the average speed between 

(a) t—\ and t - 2 

(b) t = l and f= 1.1 

(c) r = 1 and r = 1 + h 

(d) t = 1 and t - .9 (note h = — .l) 


4 For the same f(t) — t 2 + f, find the average speed between 

(a) t = 0 and 1 (b) t — 0 and \ (c) t = 0 and h. 

5 In the answer to 3(c), find the limit as h -► 0. What does 
that limit tell us? 

6 Set h = 0 in your answer to 4(c). Draw the graph of 
f{t) = f 2 4. r and show its slope at t = 0. 

7 Draw the graph of t?(f)=l+2t. From geometry find 
the area under it from 0 to f. Find the slope of that area 
function f{t). 

S Draw the graphs of t>(0 — 3 — 2 1 and the area /(r). 

9 True or false 

(a) If the distance f(t) is positive, so is t?(r). 

(b) If the distance f(t ) is increasing, so is u{t). 

(c) If /(f) is positive, ir(f) is increasing. 

(d) If t?(f) is positive, /(r) is increasing. 

10 If f(t) = 6t 2 find the slope of the /-graph and also the 

^graph. The slope of the y-graph is the . 

11 If fit) = t z what is the average velocity between r = .9 and 
t =■ 1.1? What is the average between t — h and t -F h ? 

12 (a) Show that for /(f) = \at 2 the average velocity between 
t — h and r 4-' h is exactly the velocity at t. 

(b) The area under t?(r) = at from t - h to t + h is exactly 
the base 2 h times . 

13 Find /(r) from t?(r) = 20t if /(0) = 12. Also if /( 1) = 12. 

14 True or false , for any distance curves. 

(a) The slope of the line from >1 to 5 is the average velocity 
between those points. 
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(b) Secant lines have smaller slopes than the curve, 

(c) If /(f) and F(f) start together and finish together, the 
average velocities are equal. 

(d) If i>(/) and V(t) start together and finish together, the 
increases in distance are equal. 

15 When you jump up and fall back your height is y = 2t — t 2 
in the right units. 

(a) Graph this parabola and its slope. 

(b) Find the time in the air and maximum height. 

(c) Prove: Half the time you are above y = 

Basketball players “hang” in the air partly because of (c). 

16 Graph /(t) = t 2 and g(t)=/(t)-2 and A(f)=/(2f), all 
from t = 0 to t — 1. Find the velocities. 

17 (Recommended) An up and down velocity is v(t) = 2t for 
t < 3, u(f) = 12 — 2/ for / ^ 3. Draw the piecewise parabola 
/(t). Check that /( 6) = area under the graph of u(f). 

18 Suppose v(t) — t for t ^ 2 and u(t) = 2 for t ^ 2. Draw the 
graph of f(t) out to t = 3. 

19 Draw /(/) up to t — 4 when i>(/) increases linearly from 

(a) 0 to 2 (b) — 1 to 1 (c) — 2 to 0. 

20 (Recommended) Suppose t>(/) is the piecewise linear sine 
function of Section 1,2. (In Figure 1.8 it was the distance.) 


Find the area under t;(f) between t = 0 and t = 1, 2, 3, 4, 5, 6, 
Plot those points /(l), ..., /( 6) and draw the complete piece- 
wise parabola f(t\ 

21 Draw the graph of /(f) = |I — 1 2 \ for 0</<2. Find a 
three-part formula for i>(f). 

22 Draw the graphs of f(t) for these velocities (to t= 2): 

(a) v(t) = l-t 

(b) v(t) = \\-t\ 

(c) + 

23 When does /(/) = t 2 — 3t reach 10? Find the average 
velocity up to that time and the instantaneous velocity at that 
time. 

24 If = bt + c* what is r(f)? What is the slope of 
r{/)7 When does /(f) equal 41, if a = b = c = 1? 

25 If f{t) = t 2 then i?(£) — 2t. Does the speeded-up function 
/(4f) have velocity t?(4f ) or 4v(t) or 4u(4t)? 

26 If f(t) — f — t 2 find »(t) and /(3f). Does the slope of /(3t) 
equal i>(3r) or 3u(r) or 3i?(3f)? 

27 For f(t) = t 2 find i7, vc (t) between 0 and f. Graph u BVC (/) 
and u(r). 

28 If you know the average velocity t\ Vfi (f), how can you find 
the distance /(f)? Start from /(0) = 0. 


1 .4 Circular Motion 


This section introduces completely new distances and velocities — the sines and cosines 
from trigonometry. As I write that last word, I ask myself how much trigonometry it 
is essential to know. There will be the basic picture of a right triangle, with sides cos t 
and sin / and I. There will also be the crucial equation (cos t) 2 + (sin f) 2 = 1, which 
is Pythagoras’ law a 1 + b 2 = c 2 . The squares of two sides add to the square of the 
hypotenuse (and the 1 is really l 2 ). Nothing else is needed immediately. If you don't 
know trigonometry, don’t stop — an important part can be learned now. 

You will recognize the wavy graphs of the sine and cosine. We intend to find the 
slopes of those graphs . That can be done without using the formulas for sin(x + y) 
and cos(x + y) — which later give the same slopes in a more algebraic way. Here it is 
only basic things that are needed.! And anyway, how complicated can a triangle be? 

Remark You might think trigonometry is only for surveyors and navigators (people 
with triangles). Not at all! By far the biggest applications are to rotation and vibration 
and oscillation . It is fantastic that sines and cosines are so perfect for “repeating 
motion” — around a circle or up and down. 


tSines and cosines are so important that I added a review of trigonometry in Section 1.5. But 
the concepts in this section can be more valuable than formulas. 
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Fig. 1.15 As the angle t changes, the graphs show the sides of the right triangle. 


Our underlying goal is to offer one more example in which the velocity can be 
computed by common sense. Calculus is mainly an extension of common sense, but 
here that extension is not needed. We will find the slope of the sine curve. The straight 
line / = vt was easy and the parabola / = \ax % was harder. The new example also 
involves realistic motion, seen every day. We start with circular motion , in which the 
position is given and the velocity will be found. 

A bait goes around a circle of radius one. The center is at x = 0, y = 0 (the origin). 
The x and y coordinates satisfy x 2 + y 2 = l 2 , to keep the ball on the circle. We specify 
its position in Figure 1.16a by giving its angle with the horizontal. And we make the 
ball travel with constant speed, by requiring that the angle is equal to the time r. The 
ball goes counterclockwise. At time 1 it reaches the point where the angle equals 1. 
The angle is measured in radians rather than degrees, so a full circle is completed at 
i = In instead of t = 360. 

The ball starts on the x axis, where the angle is zero. Now find it at time f: 

The hall is at the point where x = cos t and y = sin t. 

This is where trigonometry is useful. The cosine oscillates between 1 and - 1, as the 
ball goes from far right to far left and back again. The sine also oscillates between 1 
and —1, starting from sin 0 = 0. At time te/2 the sine (the height) increases to one. 
The cosine is zero and the ball reaches the top point x = 0, y = 1 . At time it the cosine 
is - 1 and the sine is back to zero — the coordinates are (-1, 0). At t = 2n the circle 
is complete (the angle is also 2tt) t and .v = cos 2n = h y — sin In = 0, 
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Important point: The distance around the circle (its circumference) is 2 nr — 27e, 
because the radius is L The ball travels a distance 2n in a time 2n r The speed equals 
1. It remains to find the velocity, which involves not only speed but direction , 

Degrees vs . radians A full circle is 360 degrees and 2n radians. Therefore 

1 radian = 360/2rc degrees & 573 degrees 

1 degree = 27e/360 radians & .01745 radians 

Radians were invented to avoid those numbers! The speed is exactly 1, reaching t 
radians at time t. The speed would be .01745, if the ball only reached t degrees. The 
ball would complete the circle at time T = 360. We cannot accept the division of the 
circle into 360 pieces {by whom?), which produces these numbers. 

To check degree mode vs. radian mode, verify that sin 1° ^ .017 and sin 1 ^ .84. 

VELOCITY OF THE BALL 

At time f, which direction is the ball going? Calculus watches the motion between t 
and t + h. For a ball on a string, we don’t need calculus — just let go. The direction 
of motion is tangent to the circle . With no force to keep it on the circle, the ball goes 
off on a tangent . If the ball is the moon, the force is gravity. If it is a hammer swinging 
around on a chain, the force is from the center. When the thrower lets go, the hammer 
takes off — and it is an art to pick the right moment. (I once saw a friend hit by a 
hammer at MIT. He survived, but the thrower quit track,) Calculus will find that 
same tangent direction, when the points at t and t + h come close. 

The u velocity triangle” is in Figure 1.16b. It is the same as the position triangle, 
but rotated through 90°, The hypotenuse is tangent to the circle, in the direction the 
ball is moving. Its length equals 1 (the speed). The angle t still appears, but now it is 
the angle with the vertical. The upward component of velocity is cos f, when the upward 
component of position is sin f. That is our common sense calculation, based on a 
figure rather than a formula. The rest of this section depends on it — and we check 
v = cos t at special points. 

At the starting time f = 0, the movement is all upward. The height is sin 0 = 0 and 
the upward velocity is cos 0 = 1. At time n/2, the ball reaches the top. The height is 
sin nj2 = 1 and the upward velocity is cos rc/2 = 0. At that instant the ball is not 
moving up or down. 

The horizontal velocity contains a minus sign. At first the ball travels to the left. 
The value of x is cos £, but the speed in the x direction is - sin r. Half of trigonometry 
is in that figure (the good half), and you see how sin 2 f + cos 3 r= 1 is so basic. 
That equation applies to position and velocity, at every time. 

Application of plane geometry: The right triangles in Figure 1.16 are the same size 
and shape. They look congruent and they are — the angle t above the ball equals the 
angle t at the center. That is because the three angles at the ball add to 180°. 

OSCILLATION: UP AND DOWN MOTION 

We now use circular motion to study straight-line motion. That line will be the y axis. 
Instead of a ball going around a circle, a mass will move up and down. It oscillates 
between y = 1 and y = — 1. The mass is the “ shadow of the bail” as we explain in a 
moment. 
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There is a jumpy oscillation that we do not want, with v= 1 and v = — L That 
“bang-bang 1 ’ velocity is like a billiard ball, bouncing between two walls without 
slowing down. If the distance between the walls is 2, then at t = 4 the ball is back to 
the start. The distance graph is a zigzag (or sawtooth) from Section 1.2. 

We prefer a smoother motion. Instead of velocities that jump between + 1 and — 1, 
a real oscillation s/ows down to zero and gradually builds up speed again. The mass 
is on a spring, whicb pulls it back. The velocity drops to zero as the spring is fully 
stretched. Then v is negative, as the mass goes the same distance in the opposite 
direction. Simple harmonic motion is the most important back and forth motion, 
while f- vt and / = ^ at 2 are the most important one-way motions. 



Rg.1 .17 Circular motion of the ball and harmonic motion of the mass (its shadow). 


How do we describe this oscillation? The best way is to match it with the ball on 
the circle. The height of the bail will be the height of the mass. The “shadow of the 
hall” goes up and down, level with the ball. As the ball passes the top of the 
circle, the mass stops at the top and starts down. As the ball goes around the bottom, 
the mass stops and turns back up the y axis. Halfway up (or down), the speed is 1. 

Figure 1.17a shows the mass at a typical time t. The height is y =/(r) = sin r, level 
with the ball. This height oscillates between /= 1 and /= — 1. But the mass does not 
move with constant speed. The speed of the mass is changing although the speed of 
the bail is always 1. The time for a full cycle is still 2 n, but within that cycle the mass 
speeds up and slows down. The problem is to find the changing velocity u. Since the 
distance is /= sin f, the velocity will be the slope of the sine curve . 

THE SLOPE OF THE SINE CURVE 

At the top and bottom (t = n/2 and t = 3rc/2) the ball changes direction and v = 0. 
The slope at the top and bottom of the sine curve is zero.i At time zero, when the ball 
is going straight up, the slope of the sine curve is v = 1 . At t = n, when the ball and 
mass and /-graph are going down, the velocity is v = — 1 . The mass goes fastest at 
the center. The mass goes slowest (in fact it stops) when the height reaches a maximum 
or minimum. The velocity triangle yields v at every time r. 

To find the upward velocity of the mass, look at the upward velocity of the ball. 
Those velocities are the same! The mass and ball stay level, and we know v from 
circular motion: The upward velocity is v — cos f. 


tThat looks easy but you will see later that it is extremely important. At a maximum or 
minimum the slope is zero. The curve levels off. 
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Figure 1.18 shows the result we want. On the right, /= sin t gives the height. On 
the left is the velocity v = cos t . That velocity is the slope of the /-curve. The height 
and velocity (red lines) are oscillating together, but they are out of phase— just as 
the position triangle and velocity triangle were at right angles. This is absolutely 
fantastic, that in calculus the two most famous functions of trigonometry form a pair: 
The slope of the sirte curve is given by the cosine curve . 

When the distance is f{t) = sin t, the velocity is v(t) = cos t. 

Admission of guilt: The slope of sin t was not computed in the standard way. 
Previously we compared (t + h ) 2 with t 2 , and divided that distance by h . This average 
velocity approached the slope 2t as h became small. For sin t we could have done the 
same: 

. change in sin f sin(t + A) — sin f 

average velocity = — ; - . (1) 

change in t h 


This is where we need the formula for sin (f + h), coming soon. Somehow the ratio in 
(1) should approach cos r as h -► 0, (It does.) The sine and cosine fit the same pattern 
as t 2 and 2r — our shortcut was to watch the shadow of motion around a circle. 




Fig. 1.1 fl v = cos t when /= sin t (red); v = — sin t when /= cos f (black). 


Question 1 What if the ball goes twice as fast , to reach angle It at time r? 

Answer The speed is now 2. The time for a full circle is only n. The ball's position 
is x = cos 2t and y = sin 2r. The velocity is still tangent to the circle — but the tangent 
is at angle 2t where the ball is. Therefore cos 2t enters the upward velocity and 
— sin2t enters the horizontal velocity. The difference is that the velocity triangle is 
twice as big . The upward velocity is not cos 2t but 2 cos It. The horizontal velocity 
is - 2 sin 2t. Notice these 2’s! 

Question 2 What is the area under the cosine curve from t = 0 to t = tt/2? 

You can answer that, if you accept the Fundamental Theorem of Calculus — 
computing areas is the opposite of computing slopes. The slope of sin f is cos t y so the 
area under cos t is the increase in sin t. No reason to believe that yet, but we use it 
anyway. 

From sin 0 = 0 to sin jt/2 = 1, the increase is 1. Please realize the power of calculus. 
No other method could compute the area under a cosine curve so fast. 
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THE SLOPE OF THE COSINE CURVE 

I cannot resist uncovering another distance and velocity (another f-v pair) with no 
extra work. This time / is the cosine. The time dock starts at the top of the circle . 
The old time t = n}2 is now t = 0. The dotted lines in Figure 1,18 show the new start. 
But the shadow has exactly the same motion — the hall keeps going around the circle, 
and the mass follows it up and down. The /-graph and u-graph are still correct, both 
with a time shift of tt/2. 

The new /-graph is the cosine. The new u-graph is minus the sine . The slope of the 
cosine curve follows the negative of the sine curve. That is another famous pair, twins 
of the first: 


When the distance is f(t) = cos (, the velocity is u(t) = — sin t. 

You could see that coming, by watching the ball go left and right (instead of up and 
down). Its distance across is /= cos t . Its velocity across is v = - sin r. That twin pair 
completes the calculus in Chapter 1 (trigonometry to come). We review the ideas: 

v is the velocity 

the slope of the distance curve 

the limit of average velocity over a short time 

the derivative of /. 

/ is the distance 

the area under the velocity curve 

the Omit of total distance over many short times 

the integral of v. 

Differential calculus: Compute v from /. Integral calculus: Compute f from v. 

With constant velocity, / equals vt. With constant acceleration, v= at and / = ±at 2 . 
In harmonic motion, v = cos t and /= sin t. One part of our goal is to extend that 
list — for which we need the tools of calculus. Another and more important part is 
to put these ideas to use. 

Before the chapter ends, may I add a note about the book and the course? The 
book is more personal than usual, and I hope readers will approve. What I write is 
very close to what I would say, if you were in this room. The sentences are spoken 
before they are written. t Calculus is alive and moving forward — it needs to be taught 
that way. 

One new part of the subject has come with the computer. It works with a finite 
step h, not an “infinitesimal'’ limit. What it can do, it does quickly — even if it cannot 
find exact slopes or areas. The result is an overwhelming growth in the range of 
problems that can be solved. We landed on the moon because / and v were so 
accurate. (The moon's orbit has sines and cosines, the spacecraft starts with v = at 
and f=\at 2 . Only the computer can account for the atmosphere and the sun's gravity 
and the changing mass of the spacecraft.) Modern mathematics is a combination of 
exact formulas and approximate computations . Neither part can be ignored, and 1 
hope you will see numerically what we derive algebraically. The exercises are to help 
you master both parts. 


tOn television you know immediately when the words are live. The same with writing. 
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The course has made a quick start — not with an abstract discussion of sets or 
functions or limits, but with the concrete questions that led to those ideas. You have 
seen a distance function / and a limit v of average velocities. We will meet more 
functions and more limits (and their definitions!) but it is crucial to study important 
examples early. There is a lot to do, but the course has definitely begun. 


1.4 EXERCISES 


Read-through questions 

A ball at angle r on the unit circle has coordinates x = o 
and y = b , It completes a full circle at f = c , Its speed 
is d . Its velocity points in the direction of the a . 
which is f to the radius coming out from the center. The 
upward velocity is a and the horizontal velocity is h . 

A mass going up and down level with the ball has height 
/(f) = * . This is called simple J motion. The velocity 

is u(f ) = fc . When t = njl the height is /= > and the 

velocity h v = m . If a speeded-up mass reaches / — sin 2 1 
at time f, its velocity is v = n , A shadow traveling under 
the ball has / — cos r and u — o . When / is distance — 
area = integral, v is p = g = > 

1 For a ball going around a unit circle with speed 1, 

(a) how long does it take for 5 revolutions? 

(b) at time / = 3n/2 where is the ball? 

(c) at f = 22 where is the ball (approximately)? 

2 For the same motion find the exact x and y coordinates 
at t = 2n/3, At what time would the ball hit the x axis, ir it 
goes off on the tangent at r = 2 tt/3? 

3 A ball goes around a circle of radius 4. At time f (when it 
reaches angle r) find 

(a) its x and y coordinates 

(b) the speed and the distance traveled 

(c) the vertical and horizontal velocity. 

4 On a circle of radius R find the x and y coordinates at 
time f (and angle r)> Draw the velocity triangle and find the 
x and y velocities, 

5 A ball travels around a unit circle (raoius 1) with speed 3, 
starting from angle zero. At time f , 

(a) what angle does it reach? 

(b) what are its x and y coordinates? 

(c) what are its x and y velocities? This part is harder. 

6 If another ball stays ;r/2 radians ahead of the ball with 
speed 3, find its angle, its x and y coordinates, and its vertical 
velocity at time t. 


7 A mass moves on the x axis under or over the original 
ball (on the unit circle with speed 1). What is the position 
x — /(f)? Find x and v at t = rc/4. Plot x and u up to t = n. 

8 Does the new mass (under or over the ball) meet the old 
mass (level with the ball)? What is the distance between 
the masses at time f? 

9 Draw graphs of /(f) = cos3f and cos 27rf and 2n cos f, 
marking the time axes. How long until each / repeats? 

10 Draw graphs of /=sin(r + n) and u = cos(f + ?r). This 
oscillation stays level with what ball? 

11 Draw graphs of /=sin(tf/2 — f) and v= — cos(ti/2 — f). 
This oscillation stays level with a ball going which way start* 
ing where? 

12 Draw a graph of /(f) = sin f + cos f> Estimate its greatest 
height (maximum /) and the time it reaches that height. By 
computing f 1 2 3 4 5 6 check your estimate, 

13 How fast should you run across the circle to meet the ball 
again? It travels at speed 1. 

14 A mass falls from the top of the unit circle when the ball 
of speed 1 passes by. What acceleration a is necessary to meet 
the ball at the bottom? 


Find the area under v =cos t from the change in /=sin r: 

15 from r = 0 to f = n J6 from t = 0 to t = 71/6 

17 from f = 0 to f = 2n 18 from f = ti/ 2 to t = 3ti/2, 

19 The distance curve /=sin4f yields the velocity curve 
v = 4 cos 4f. Explain both 4's, 

20 The distance curve /= 2 cos 3r yields the velocity curve 
v = — 6 sin 3f. Explain the —6. 

21 The velocity curve y = cos4r yields the distance curve 
/= l sin 4f. Explain the j. 

22 The velocity v = 5 sin 5f yields what distance? 
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23 Find the slope of the sine curve at f = nj 3 from v = cos t. 
Then find an average slope by dividing sin n/2 — sin nj 3 by 
the time difference n/2 — jt/3. 

24 The slope of /=sint at f — 0 is cos 0—1. Compute 
average slopes (sin t)jt for t = 1, .1, .01, .001. 

Tbe ball at x = cos t, y — sin t circles (1) counterclockwise 
(2) with radius 1 (3) starting from x = 1, y = 0 (4) at speed 1. 
Find (1)(2)(3)(4) for the motions 25-30. 

25 x = cos 3(, y = — sin 3r 

26 x = 3 cos 4t, y = 3 sin 4t 

27 x — 5 sin 2t, y = 5 cos 2t 

28 x = 1 + cos t, y — sin t 

29 x = cos (( + 1), y = sin (f + 1) 

30 x = cos(— f), y = sin(— t) 


The oscillation x = 0, y = sin t goes (1) up and down (2) between 
— 1 and 1 (3) starting from x = G, y = 0 (4) at velocity 
v = cos t. Find (I)(2)(3)(4) for the oscillations 31-36. 

31 x = cos f, y = 0 32 x = 0, y = sin 5t 

33 x = 0, y = 2 sin (t + 0) 34 x = cos £, y = cos t 

35 x — 0, y = — 2 cos {t 36 x = cos 2 t> y = sin 2 £ 

37 If the ball on the unit circle reaches t degrees at time t, 
find its position and speed and upward velocity. 

38 Choose the number k so that x cos kt, y = sin kt com- 
pletes a rotation at t = L Find the speed and upward velocity. 

39 If a pitcher doesn’t pause before starting to throw, a balk 
is called. The American League decided mathematically that 
there is always a stop between backward and forward motion, 
even if the time is too short to see it* (Therefore no balk.) Is 
that true? 



1.5 A Review of Trigonometry 



Trigonometry begins with a right triangle. The size of the triangle is not as important 
as the angles. We focus on one particular angle — call it 9 — and on the ratios between 
the three sides x, y, r. The ratios don’t change if the triangle is scaled to another 
size. Three sides give six ratios, which are the basic functions of trigonometry: 



x 


Fig. 1.19 


cos 9 = - = 

near side 

sec 9 = 

r 

i 

r 

hypotenuse 

X 

cos 9 

sin 9 = - = 

opposite side 

esc 9 — 

r 

1 

r 

hypotenuse 

y 

sin 9 

tan 9 - - = 

opposite side 

cot 9 = 

X 

1 

X 

near side 


y 

tan 9 


Of course those six ratios are not independent. The three on the right come directly 
from the three on the left. And the tangent is the sine divided by the cosine: 


tan 9 = 


sin 9 
cos 9 


y/r 

x l T 


l 

X 


Note that “tangent of an angle” and “tangent to a circle” and “tangent line to a 
graph” are different uses of the same word. As the cosine of 9 goes to zero, the tangent 
of 9 goes to infinity. The side x becomes zero, 9 approaches 90°, and the triangle is 
infinitely steep. The sine of 90° is y/r = L 
Triangles have a serious limitation. They are excellent for angles up to 90°, and 
they are OK up to 180°, but after that they fail. We cannot put a 240° angle into a 
triangle. Therefore we change now to a circle. 
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Rg, 1,20 Trigonometry on a circle. Compare 2 sin 9 with sin 20 and tan 0 (periods In, n , n). 


Angles are measured from the positive x axis (counterclockwise). Thus 90° is 
straight up, 180° is to the left, and 360° is in the same direction as 0°. (Then 450° is 
the same as 90°,) Each angle yields a point on the circle of radius r. The coordinates 
x and y of that point can be negative (but never r). As the point goes around the 
circle, the six ratios cos 0, sin 0, tan 0, ... trace out six graphs. The cosine waveform 
is the same as the sine waveform— just shifted by 90°, 

One more change comes with the move to a circle. Degrees are out. Radians are 
in. The distance around the whole circle is 2rcr, The distance around to other points 
is 0r, We measure the angle by that multiple 8. For a half-circle the distance is nr, 
so the angle is n radians — which is 180°. A quarter-circle is njl radians or 90°, 
The distance around to angle 8 is r times 8. 

When r = 1 this is the ultimate in simplicity: The distance is 0, A 45° angle is 5 of 
a circle and 2rc/8 radians — and the length of the circular arc is 2rc/8, Similarly for l u : 

360° = 2ti radians 1° = 2 ti/ 36Q radians 1 radian = 360/2 tt degrees. 

An angle going clockwise is negative. The angle — n/2 is -60° and takes us £ of the 
wrong way around the circle. What is the effect on the six functions? 

Certainly the radius r is not changed when we go to -0, Also x is not changed 
(see Figure 1,20a), But y reverses sign, because -0 is below the axis when +0 is 
above. This change in y affects yjr and yjx but not x/r: 

cos (- 0 ) = cos 0 sin(- 0 )= - sin 0 ian(“ 0 )= - tan 0 , 

The cosine is even (no change). The sine and tangent are odd (change sign). 

The same point is 5 of the right way around. Therefore f of 2n radians (or 300°) 
gives the same direction as — tt/ 3 radians or -60°. A difference of In makes no 
difference to x, y , r. Thus sin 0 and cos 8 and the other four functions have period 2 tt. 
We can go five times or a hundred times around the circle, adding 10 n or 200n to 
the angle, and the six functions repeat themselves. 

EXAMPLE Evaluate the six trigonometric functions at 0 ~ 2 tt/ 3 (or 0 = — 47e/3), 

This angle is shown in Figure 1.20a (where r = 1), The ratios are 

cos 0 — x/r ~ - 1/2 sin 0 = y/r = y^3/2 tan 0 = yjx = - 

sec 0 = - 2 esc 0 = 2/^/3 cot 0 = —1/^/3 

Those numbers illustrate basic facts about the sizes of four functions: 

|cos 0 | < 1 |sin 0 | 1 | sec 9\ ^ 1 [esc 0 | > 1 . 

The tangent and cotangent can fall anywhere, as long as cot 8= 1/tan 0, 
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The numbers reveal more* The tangent — yfl is the ratio of sine to cosine. The 
secant -2 is 1/cos 0. Their squares are 3 and 4 (differing by 1)* That may not seem 
remarkable* but it is. There are three relationships in the squares of those six numbers, 
and they are the key identities of trigonometry: 

cos 2 6 + sin 2 0=1 1 + tan 2 6 = sec 2 6 cot 2 9 + 1 = esc 2 9 

Everything flows from the Pythagoras formula x 2 + y 2 = r 2 . Dividing by r 2 gives 
{xM 2 + (yM 2 = T That is cos 2 9 + sin 2 0=1* Dividing by jc 3 gives the second identity, 
which is 1 + (yjx) 2 = (r/x) 2 , Dividing by y 2 gives the third. All three will be needed 
throughout the book — and the first one has to be unforgettable. 

DISTANCES AND ADDITION FORMULAS 

To compute the distance between points we stay with Pythagoras. The points are in 
Figure l*21a. They are known by their x and y coordinates, and d is the distance 
between them. The third point completes a right triangle. 

For the x distance along the bottom we don’t need help. It is x 2 “ (or |x 2 “ x x ) 
since distances can’t be negative). The distance up the side is |y 2 ^ yil Pythagoras 
immediately gives the distance d\ 

distance between points = d= v /(x 2 - x^ 2 + (y 2 ~ y^ 2 . (1) 



X - COS 5 

y - sin s 



Distance between paints and equal distances in two circles. 


By applying this distance formula in two identical circles, we discover the cosine 
of s — t. (Subtracting angles is important.) In Figure l,21b ? the distance squared is 

d 1 = (change in x) 2 + (change in y) 2 

= (cos s — cos t) 2 + (sin s - sin f) 2 . (2) 

Figure 1.21c shows the same circle and triangle (but rotated). The same distance 
squared is 

d 2 - (cos (s - £) - l) 2 + (sin (s - r)) 2 . (3) 

Now multiply out the squares in equations (2) and (3). Whenever (cosine) 2 +{sine) 2 

appears* replace it by 1. The distances are the same, so (2) = (3): 

(2) = 1 + 1 — 2 cos s cos ( — 2 sin s sin t 

(3) = 1 + 1 - 2 cos(s— £)- 
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After canceling 1 + 1 and then —2, we have the “addition formula ” for cos($ — t): 

The cosine of s - t equals cos s cos t + sin s sin t. (4) 

The cosine of s+ t equals cos s cos t - sin s sin t. (5) 

The easiest is t = 0. Then cos t = 1 and sin t = 0, The equations reduce to cos s = cos s. 

To go from (4) to (5) in all cases, replace f by — t. No change in cos t , but a "‘minus” 

appears with the sine. In the special case s = t t we have cos (t + t) — 
(cos t)(cos t) — (sin r)(sin t). This is a much-used formula for cos 2t: 

Double angle : cos It = cos 2 1 — sin 2 1 = 2 cos 1 1 - 1 = 1—2 sin 2 f, (6) 

I am constantly using cos 2 f + sin 2 r= I, to switch between sines and cosines. 

We also need addition formulas and double-angle formulas for the sine of s - 1 
and s + t and 2r. For that we connect sine to cosine, rather than (sine) 2 to (cosine) 2 . 
The connection goes back to the ratio yfr in our original triangle. This is the sine of 
the angle 6 and also the cosine of the complementary angle n/2 — 0: 

sin 8 = cos (ti/ 2 - 0) and cos 0 = sin (n/2 — 6 ). (7) 

The complementary angle is n/2 - 6 because the two angles add to n/2 (a right angle). 
By making this connection in Problem 19, formulas (4-5-6) move from cosines to 
sines: 


sin [s-t) = sin s cos t - cos s sin f 

(8) 

sin(5 + t) = sin s cos t + cos 5 sin t 

(9) 

sin It = sin {r + t) = 2 sin r cos r 

(10) 


I want to stop with these ten formulas, even if more are possible. Trigonometry is 
full of identities that connect its six functions — basically because all those functions 
come from a single right triangle. The x, y, r ratios and the equation x 1 + y 2 = r 2 can 
be rewritten in many ways. But you have now seen the formulas that are needed by 
calculus. f They give derivatives in Chapter 2 and integrals in Chapter 5. And it is 
typical of our subject to add something of its own — a limit in which an angle 
approaches zero. The essence of calculus is In that limit. 


Review of the ten formulas Figure 1 .22 shows d 2 = (0 - 1) 2 + (1 - v / 3 / 2 ) 2 . 


cos - 
6 

= cos - cos - + sin 

- sin 

3 

(s-f) 

sin - 
6 

= sin - cos - - cos 
2 3 

sin 

2 

5n 

n n 

71 

7t 

(s+ t) 

5tt 

7E n 

n . 

cos — 
6 

= cos - cos - - sin 
2 3 

— sin 
2 

3 

sin — 
6 

= sin - cos - + cos 
2 3 

- sin 
2 

. n 

j n t , jt 



(2f) 


71 71 


CO. 2 - 

= cos - sin - 

3 3 



s,n2 J 

= 2 sin - cos - 
3 3 


n 

cos - 
6 

= sin ^ = >/3/2 



(H 

. n 
sin - 
6 

= cos - = 1/2 

3 



tCalculus turns (6) around to cos 2 r = i(l + cos It) and sin 2 1 = ^(1 — cos 2f). 
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1.5 EXERCISES 


Read-through questions 

Starting with a o triangle, the six basic functions are the 
b of the sides. Two ratios (the cosine x(r and the c ) 
are below 1. Two ratios (the secant r/x and the d ) are 
above 1, Two ratios fthe o and the f ) can take any 
value. The six functions are defined for all angles 0, by chang- 
ing from a triangle to a o , 

The angle 0 is measured in h . A full circle is 0 = I . 


when the distance around is 2 nr. The distance to angle 9 is 
J . All six functions have period fc . Going clockwise 

changes the sign of 9 and * and rn . Since cos(— 0) = 
cos 9 , the cosine is n 


9 Find the distance d from (1, 0) to (i, v /3/2) and show on 
a circle why 6 d is less than 2n. 

10 In Figure L22 compute d 1 and (with calculator) 124. Why 
is Yld close to and below 2n1 

11 Decide whether these equations are true or false: 
sin S 1 + cos 9 


(a) 

(b) 


1 — cos 9 sin 9 
sec 9 + esc 0 


= sin 9 + cos 9 


tan 9 + cot 9 
(c) cos 0 — sec 0 = sin 9 tan 0 


Coming from x 2 + y 2 = r 2 are the three identities 
sin 2 9 + cos 2 0=1 and o and p , (Divide by r 2 and 
q and The distance from (2, 5) to (3, 4) is 

d = > . The distance from (I, 0) to (cos (s — f), sin(s — r)) 

leads to the addition formula cos fs — t) = t . Changing 
the sign of t gives cos (s + £) = u . Choosing s = t gives 
cos 2f = v or w . Therefore i(l +cos 2r) = 
a formula needed in calculus. 


(d) sin (2?i — 9) = sin 9 

12 Simplify sinfrc — 0), cos(tc — 9\ sin(7r/2 + 0), cos(tt/2 + 0), 

13 From the formula for cos(2r + t) find cos 3 1 in terms of 
cos f. 

14 From the formula for sin(2f -hf) find sin 3 1 in terms of 
sin t 


1 In a 60-60-60 triangle show why sin 30° =-J. 

2 Convert tt, 3n f —n/4 to degrees and 60°, 90°, 270° to 
radians. What angles between 0 and 2n correspond to 
0 = 480° and 0 = -l°? 

3 Draw graphs of tan 9 and cot 9 from 0 to In. What is their 
(shortest) period? 

4 Show that cos 20 and cos 2 0 have period n and draw them 
on the same graph. 

5 At 0 = 3n/2 compute the six basic functions and check 
cos 2 0 + sin 2 9, sec 2 0 — tan 2 9, esc 2 9 — cot 2 9. 

6 Prepare a table showing the values of the six basic func- 
tions at 9 s= 0, n/4, n/3, nl% n. 

7 The area of a circle is nr 1 . What is the area of the sector 

that has angle 0? It is a fraction of the whole area, 

8 Find the distance from (1, 0) to (0, 1) along (a) a straight 
line (b) a quarter-circle (c) a semicircle centered at (i, i). 


15 By averaging cos(s — t) and cos(s 4- 0 in (4-5) find a for- 
mula for cos s cos t. Find a similar formula for sin s sin L 

16 Show that (cos 1 4 i sin t) 2 = cos It + i sin 2f, if i 2 = — 1. 

17 Draw cos 0 and sec 9 on the same graph. Find all points 
where cos 0 = sec 9. 

18 Find all angles s and t between 0 and 2n where sin (s + 1) = 
sin s + sin t. 

19 Complementary angles have sin0= cos(tc/2 — 0). Write 
sin(s + f) as cos(rc/2 — $ — t) and apply formula (4) 
with n/2 — s instead of s . In this way derive the addition 
formula (9). 

20 If formula (9) is true, how do you prove (8)? 

21 Check the addition formulas (4-5) and (8-9) for 
s - t = nj 4. 

22 Use (5) and (9) to find a formula for tan(s + f). 
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In 23-28 find every 9 that satisfies the equation. 

23 sin0 = — 1 24 sec0 = — 2 

25 sin $ = cos $ 26 sin 6 - 9 

27 sec 2 9 + esc 2 0=1 28 tan 0 = 0 

29 Rewrite cos $ + sin 9 as sin(0 + <j>) by choosing the 
correct 4 'phase angle” <j>. (Make the equation correct at 
0 = 0. Square both sides to check.) 

30 Match a sin x + b cos x with A sin (x + <j>). From equation 

(9) show that a — A cos $ and b = A sin <£. Square and add to 
find A = . Divide to find tan = bja. 

31 Draw the base of a triangle from the origin 0 = (0, 0) to 
P — (a, 0) + The third corner is at Q = (b cos 0, b sin 0). What 
are the side lengths OP and Og? From the distance formula 


(1) show that the side PQ has length 

d 1 = a 2 + b 1 — lab cos $ (law of cosines)* 

32 Extend the same triangle to a parallelogram with its fourth 
comer at R = (a + b cos 9, b sin 0). Find the length squared of 
the other diagonal OR. 

Draw graphs for equations 33-36, and mark three points, 

33 y = sin 2x 34 y = 2 sin nx 

35 y — i cos lux 36 y — sin x + cos x 

37 Which of the six trigonometric functions are infinite at 
what angles? 

38 Draw rough graphs or computer graphs of fsinf and 
sin 4f sin t from 0 to 2 k* 



1.6 A Thousand Points of Light 



The graphs on the back cover of the book show y = sin n* This is very different 
from y — sin x * The graph of sin x is one continuous curve. By the time it reaches 
x = 10,000, the curve has gone up and down 10,000/2 tt times* Those 1591 oscillations 
would be so crowded that you couldn't see anything* The graph of sin n has picked 
10,000 points from the curve — and for some reason those points seem to lie on more 
than 40 separate sine curves. 

The second graph shows the first 1000 points* They don't seem to lie on sine curves. 
Most people see hexagons* But they are the same thousand points ! It is hard to believe 
that the graphs are the same, but I have learned what to do* Tilt the second graph 
and look from the side at a narrow angle * Now the first graph appears* You see 
“diamonds.” The narrow angle compresses the x axis — back to the scale of the first 
graph* 



The effect of scale is something we don’t think of. We understand it for maps* 
Computers can zoom in or zoom out — those are changes of scale* What our eyes see 
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depends on what is ‘'close.” We think we see sine curves in the 10,000 point graph, 
and they raise several questions: 

1. Which points are near {0, 0)? 

2* How many sine curves are there? 

3. Where does the middle curve, going upward from (0, 0), come back to zero? 

A point near (0, 0) really means that sin n is close to zero. That is certainly not true 
of sin 1 (1 is one radian!). In fact sin 1 is up the axis at ,84, at the start of the seventh 
sine curve. Similarly sin 2 is .91 and sin 3 is ,14. (The numbers 3 and .14 make us 
think of %. The sine of 3 equals the sine of n - 3. Then sin .14 is near .14.) Similarly 
sin 4, sin 5, . .., sin 21 are not especially close to zero. 

The first point to come close is sin 22. This is because 22/7 is near n. Then 22 is 
close to 7 tt, whose sine is zero: 

sin 22 - sin(7?r - 22) ^ sin(- .01) ^ - .01. 

That is the first point to the right of (0, 0) and slightly below. You can see it on 
graph 1, and more clearly on graph 2. It begins a curve downward. 

The next point to come close is sin 44. This is because 44 is just past 14 tt. 

44 % 14te T .02 so sin 44 ^ sin .02 ^ ,02. 

This point (44, sin 44) starts the middle sine curve , Next is (88, sin 88). 

Now we know something. There are 44 curves . They begin near the heights sin 0, 
sin 1, . .., sin 43. Of these 44 curves, 22 start upward and 22 start downward. I was 
confused at first, because I could only hnd 42 curves. The reason is that sin 1 1 equals 
- 0,99999 and sin 33 equals .9999. Those are so close to the bottom and top that you 
can't see their curves. The sine of 11 is near -1 because sin 22 is near zero. It is 
almost impossible to follow a single curve past the top — coming back down it is not 
the curve you think it is. 

The points on the middle curve are at n = 0 and 44 and 88 and every number 44N. 
Where does that curve come back to zero? In other words, when does 44N come 
very dose to a multiple of rc? We know that 44 is 14 jt H- ,02. More exactly 44 is 
14 ti + .0177. So we multiply .0177 until we reach n: 

if N = 7U/.0177 then 44 N = (\4n + .0177)iV ~ 14 nN + rc. 

This gives N = 177.5. At that point 44A r = 7810. This is half the period of the sine 
curve. The sine of 7810 is very near zero. 

If you follow the middle sine curve, you will see it come back to zero above 7810. 
The actual points on that curve have n = 44* 177 and n = 44 - 178, with sines just 
above and below zero. Halfway between is n = 7810. The equation for the middie sine 
curve is y = sin (te-v/ 78 1 0). Its period is 15,620 — beyond our graph. 

Question The fourth point on that middle curve looks the same as the fourth point 
coming down from sin 3. What is this “double point?” 

Answer 4 times 44 is 176. On the curve going up. the point is (176, sin 176). On the 
curve coming down it is (179, sin 179). The sines of 176 and 179 differ only by .00003. 

The second graph spreads out this double point. Look above 176 and 179, at the 
center of a hexagon. You can follow the sine curve all the way across graph 2, 

Only a little question remains. Why does graph 2 have hexagons? / don't know. 
The problem is with your eyes. To understand the hexagons, Doug Hardin plotted 
points on straight lines as well as sine curves. Graph 3 shows y = fractional part of 
nilx. Then he made a second copy, turned it over, and placed it on top. That 
produced graph 4 — with hexagons. Graphs 3 and 4 are on the next page. 
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This is called a Moire pattern. If you can get a transparent copy of graph 3, and 
turn it slowly over the original, you will see fantastic hexagons. They conic from 
interference between periodic patterns — in our case 44/7 and 25/4 and 19/3 are near 
2 tt. This interference is an enemy of printers, when color screens don't line up. It can 
cause vertical lines on a TV, Also in making cloth, operators get dizzy from seeing 
Moire patterns move. There are good applications in engineering and optics -but 
we have to get back to calculus. 
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Software is available for calculus courses — a lot of it. The packages keep getting 
better. Which program to use (if any) depends on cost and convenience and purpose. 
How to use it is a much harder question. These pages identify some of the goals, and 
also particular packages and calculators. Then we make a beginning (this is still 
Chapter 1) on the connection of computing to calculus. 

The discussion will be informal. It makes no sense to copy the manual. Our aim 
is to support, with examples and information, the effort to use computing to help 
learning. 

For calculus, the greatest advantage of the computer is to offer graphics. You see 
the function, not just the formula. As you watch, /(x) reaches a maximum or a 
minimum or zero. A separate graph shows its derivative. Those statements are not 
100% true, as everybody learns right away as soon as a few functions are typed in. 
But the power to see this subject is enormous, because it is adjustable. If wc don't 
like the picture we change to a new viewing window. 

This is computer-based graphics. It combines numerical computation with 
graphical computation. You get pictures as well as numbers — a powerful combination. 
The computer offers the experience of actually working with a function. The domain 
and range are not just abstract ideas. You choose them. May I give a few examples, 

EXAMPLE 1 Certainly x i equals 3 A when v = 3, Do those graphs ever meet again } 
At this point we don't know the full meaning of 3\ except when x is a nice number. 
(Neither does the computer.) Checking at x — 2 and 4, the function x A is smaller 
both times: 2 3 is below' 3 2 and 4 3 = 64 is below 3* = 81. If x A is always less than 3* 
we ought to know — these are among the basic functions of mathematics. 
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The computer will answer numerically or graphically. At our command, it solves 
x 3 “ 3 X At another command, it plots both functions — this shows more. The screen 
proves a point of logic (or mathematics) that escaped us. If the graphs cross once, 
they must cross again — because 3* is higher at 2 and 4. A crossing point near 2.5 is 
seen by zooming in. I am less interested in the exact number than its position — it 
comes before x = 3 rather than after. 

A few conclusions from such a basic example: 

1. A supercomputer is not necessary. 

2. High-level programming is not necessary. 

3. We can do mathematics without completely understanding it. 

The third point doesn’t sound so good. Write it differently: We can learn mathematics 
while doing it . The hardest part of teaching calculus is to turn it from a spectator 
sport into a workout. The computer makes that possible. 

EXAMPLE 2 (mental computer) Compare x 1 2 with 2 X . The functions meet at x — 2. 
Where do they meet again? Is it before or after 2? 

That is mental computing because the answer happens to be a whole number (4). 
Now we are on a different track. Does an accident like 2 4 = 4 2 ever happen again? 
Can the machine tell us about integers? Perhaps it can plot the solutions of x b = b x . 
I asked Mathematica for a formula, hoping to discover x as a function of b — but the 
program just gave back the equation. For once the machine typed HELP instead of 
the user. 

Well, mathematics is not helpless. I am proud of calculus. There is a new exercise 
at the end of Section 6.4, to show that we never see whole numbers again. 

EXAMPLE 3 Find the number b for which x b - b x has only one solution (at x — b). 

When b is 3, the second solution is below 3. When b is 2, the second solution (4) is 
above 2. If we move b from 2 to 3, there must be a special “double point” — where 
the graphs barely touch but don’t cross. For that particular b — and only for that 
one value — the curve x b never goes above b x . 

This special point b can be found with computer-based graphics. In many ways it 
is the “ center point of calculus .” Since the curves touch but don’t cross, they are 
tangent. They have the same slope at the double point. Calculus was created to work 
with slopes, and we already know the slope of x 2 . Soon comes x b . Eventually we 
discover the slope of fe x , and identify the most important number in calculus. 

The point is that this number can be discovered first by experiment. 

EXAMPLE 4 Graph y(x) = e* - x*. Locate its minimum. 

The next example was proposed by Don Small. Solve x 4 — 1 lx 3 + 5x - 2 = 0. The 
first tool is algebra — try to factor the polynomial. That succeeds for quadratics, and 
then gets extremely hard. Even if the computer can do algebra better than we can, 
factoring is seldom the way to go. In reality we have two good choices: 

1- (Mathematics) Use the derivative. Solve by Newton s method. 

2. ( Graphics ) Plot the function and zoom in. 

Both will be done by the computer. Both have potential problems! Newton’s method 

is fast, but that means it can fail fast. (It is usually terrific.) Plotting the graph is also 
fast — but solutions can be outside the viewing window. This particular function is 
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zero only once, in the standard window from -10 to 10, The graph seems to be 
leaving zero, but mathematics again predicts a second crossing point. So we zoom 
out before we zoom in. 

The me of the zoom is the best part of graphing , Not only do we choose the domain 
and range, we change them. The viewing window is controlled hy four numbers. They 
can be the limits A ^ x ^ B and C < y < Z>. They can be the coordinates of two 
opposite corners: (A, C) and (B, D). They can be the center position (a, b ) and the 
scale factors c and d . Clicking on opposite comers of the zoom box is the fastest way, 
unless the center is unchanged and we only need to give scale factors. (Even faster: 
Use the default factors,) Section 3,4 discusses the centering transform and zoom 
transform — a change of picture on the screen and a change of variable within the 
function. 

EXAMPLE 5 Find all real solutions to x 4 - 1 lx 3 + 5x - 2 = 0, 

EXAMPLES Zoom out and in on the graphs of y = cos40x and y = xsin(l/x). 
Describe what you see. 

EXAMPLE 7 What does y = (tan x — sin x)/x 3 become at x = 0? For small x the 
machine eventually can’t separate tan x from sin x. It may give y = 0, Can you get 
close enough to see the limit ofy? 

For these examples, and for most computer exercises in this book, a menu-driven 
system is entirely adequate. There is a list of commands to choose from. The user 
provides a formula for y(x), and many functions are built in. A calculus supplement 
can be very useful — MicroCalc or True BASIC or Exploring Calculus or A 4PP (in 
the public domain). Specific to graphics are Surface Plotter and Master Grapher and 
Gyrographics (animated). The best software for linear algebra is MATLAB. 

Powerful packages are increasing in convenience and decreasing in cost. They are 
capable of symbolic computation — which opens up a third avenue of computing in 
calculus. 


SYMBOLIC COMPUTATION 

In symbolic computation, answers can be formulas as well as numbers and graphs. 
The derivative of y — x 2 is seen as “ 2x The derivative of sin t is “cos t.” The slope 
of b x is known to the program. The computer does more than substitute numbers 
into formulas — it operates directly on the formulas. We need to think where this fits 
with learning calculus. 

In a way, symbolic computing is close to what we ourselves do. Maybe too close — 
there is some danger that symbolic manipulation is all we do. With a higher-level 
language and enough power, a computer can print the derivative of sin(x 2 ). So why 
learn the chain rule? Because mathematics goes deeper than “algebra with formulas.” 
We deal with ideas , 

/ want to say dearly : Mathematics is not formulas , or computations, or even proofs , 
but ideas . The symbols and pictures are the language. The book and the professor 
and the computer can join in teaching it. The computer should be non-threatening 
(like this book and your professor) — you can work at your own pace. Your part is 
to learn by doing. 

EXAMPLE 8 A computer algebra system quickly finds 100 factorial. This is 1001 = 
(100)(99)(98) ... (1). The number has 158 digits (not written out here). The last 24 
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digits are zeros* For 10! = 3628800 there are seven digits and two zeros* Between 10 
and 100, and beyond, are simple questions that need ideas: 

1, How many digits (approximately) are in the number Nil 

2. How many zeros (exactly) are at the end of N\1 

For Question 1, the computer shows more than N digits when N — 100* It will never 
show more than N 1 digits, because none of the N terms can have more than N digits* 
A much tighter bound would be 2N, but is it true? Does N\ always have fewer than 
2 N digits ? 

For Question 2, the zeros in 10! can be explained. One comes from 10, the other 
from 5 times 2* (10 is also 5 times 2,) Can you explain the 24 zeros in 100!? An idea 
from the card game blackjack applies here too: Count the fives * 

Hard question: How many zeros at the end of 200!? 

The outstanding package for full-scale symbolic computation is Mathematica. It 
was used to draw graphs for this book, including y = sin n on the back cover* The 
complete command was List Plot [Table [Sin[n], {n, 10000}]]* This system has rewards 
and also drawbacks, including the price* Its original purpose, like MathCAD and 
MACSYMA and REDUCE , was not to teach calculus — but it can* The computer 
algebra system MAPLE is good* 

As I write in 1990 , DERIVE is becoming well established for the PC* For the 
Macintosh, Calculus T/L is a “sleeper” that deserves to be widely known. It builds 
on MAPLE and is much more accessible for calculus* An important alternative is 
Theorist , These are menu-driven (therefore easier at the start) and not expensive. 

I strongly recommend that students share terminals and work together* Two at a 
terminal and 3-5 in a working group seems to be optimal* Mathematics can be 
learned by talking and writing — it is a human activity* Our goal is not to test but to 
teach and learn* 

Writing in Calculus May I emphasize the importance of writing? We totally miss it, 
when the answer is just a number. A one-page report is harder on instructors as well 
as students — but much more valuable* A word processor keeps it neat* You can't 
write sentences without being forced to organize ideas — and part of yourself goes 
into it* 

I will propose a writing exercise with options* If you have computer-based graph- 
ing, follow through on Examples 1-4 above and report* Without a computer, pick a 
paragraph from this book that should be clearer and make it clearer * Rewrite it with 
examples. Identify the key idea at the start, explain it, and come back to express it 
differently at the end* Ideas are like surfaces — they can be seen many ways* 

Every reader will understand that in software there is no last word. New packages 
keep coming (Analyzer and EPIC among them)* The biggest challenges at this 
moment are three-dimensional graphics and calculus workbooks* In 3D, the problem 
is the position of the eye — since the screen is only 2D* In workbooks, the problem is 
to get past symbol manipulation and reach ideas* Every teacher, including this one, 
knows how hard that is and hopes to help. 

GRAPHING CALCULATORS 

The most valuable feature for calculus — computer-based graphing — is available on 
hand calculators* With trace and zoom their graphs are quite readable. By creating 
the graphs you subconsciously learn about functions. These are genuinely personal 
computers, and the following pages aim to support and encourage their use* 
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Programs for a hand-held machine tend to be simple and short. We don't count 
the zeros in 100 factorial (probably we could). A calculator finds crossing points and 
maximum points to good accuracy. Most of all it allows you to explore calculus by 
yourself. You set the viewing window and define the function. Then you see it. 

There is a choice of calculators — which one to buy ? For this book there was also 
a choice — which one to describe ? To provide you with listings for useful programs, 
we had to choose. Fortunately the logic is so clear that you can translate the instruc- 
tions into any language — for a computer as well as a calculator. The programs given 
here are the “greatest common denominator ’ of computing in calculus. 

The range of choices starts with the Casio fx 7000G — the first and simplest, with 
very limited memory but a good screen. The Casio 7500, 8000, and 8500 have increas- 
ing memory and extra features. The Sharp FL-52QO (or 9000 in Canada and Europe) 
is comparable to the Casio 8000. These machines have algebraic entry — the normal 
order as in y = x+3> They are inexpensive and good. More expensive and much 
more powerful are the Hewlett-Packard calculators — the HP-2SS and Ff/MS&Y. 
They have large memories and extensive menus (and symbolic algebra}. They use 
reverse Polish notation— numbers first in the stack, then commands. They require 
extra time and effort, and other books do justice to their amazing capabilities, ft is 
estimated that those calculators could get 95 on a typical calculus exam. 

While this book was being written, Texas Instruments produced a new graphing 
calculator: the 77-81. It is closer to the Casio and Sharp (emphasis on graphing, easy 
to learn, no symbolic algebra, moderate price). With earlier machines as a starting 
point, many improvements were added. There is some risk in a choice that is available 
only Af before this textbook is published, and we hope that the experts we asked are 
right. Anyway, our programs are for the 77-81. It is impressive. 

These few pages arc no substitute for the manual that comes with a calculator. A 
valuable supplement is a guide directed especially at calculus— my absolute favorites 
are Calculus Activities for Graphic Calculators by Dennis Pence (PWS-Kent, 1990 for 
the Casio and Sharp and FfP-285, 1991 for the T/-81). A series of Calculator Enhance- 
ments. using HP' s, is being published by Harcourt Brace Jovanovich, What follows 
is an introduction to one part of a calculus laboratory. Later in the book, we supply 
77-81 programs close to the mathematics and the exercises that they are prepared 
for. 

A few words to start: To select from a menu, press the item number and ENTER. 
Edit a command line using DEL(ete) and INS(ert). Every line ends with 
ENTER. For calculus select radians on the MODE screen. For powers use * , For 
special powers choose v 2 , x" 1 , v -x. Multiplication has priority, so (-)3 + 2x2 
produces 1. Use keys for S I N, IF, IS,... When you press letters, I multiplies S. 

If a program says 3 -> C, type 3 STO C ENTER. Storage locations are A to Z 
or Greek 0. 

Functions A graphing calculator helps you (forces you?) to understand the concept 
of a function. It also helps you to understand specific functions — especially when 
changing the viewing window. 

To evaluate v = x 2 — lx just once, use the home screen. To define y(x) for repeated 
use, move to the function edit screen: Press MODE, choose Function, and press 
Y = . Then type in the formula. Important tip : for X on the TI-8 1 , the key X | T is faster 
than two steps Alpha X . The Y = edit screen is the same place where the formula 
is needed for graphing. 
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Example Yl = X2-2X ENTER on the Y ~ screen. 4 STO X ENTER on the home 
screen. Y i ENT ER on the Y-VARS screen. The screen shows 8, which is Y(4). The 
formula remains when the calculator is off. 

Graphing You specify the X range and Y range. (We should say X domain but we 
don’t.) The screen is a grid of 96 x 64 little rectangles called “pixels." The first column 
of pixels represents Xmi n and the last column is Xmax. Press RANGE to reset. 
With X r e s = 1 the function is evaluated 96 times as it is graphed. X s c L and Y s c L 
give the spaces between ticks on the axes. 

The Z 0 0 M menu is a fast way to set ranges. ZOOM Standard gives the default 
-10 ^ x ^ 10, -10 < 10. ZOOM Trig gives -2 jt<jc <2ji, -3 ^ y ^ 3. 

The keystroke GRAPH shows the graphing screen with the current functions. 

Example Set the ranges (-)2 ^ X ^ 3 and (-)150<Y<50. Press Y= and store 
Y *| = X {in MATH) 3 - 28X 2 + 15X + 36 ENTER. Press GRAPH. You won’t see 
much of the graph! Press RANGE and reset (— )10 30, ( — )4000 ^ 2300. 

Press GRAPH. See a cubic polynomial. 

“Smart Graph’’ recalls the graph instantly without redrawing it, if no settings have 
changed. The DRAW menu is for points, lines, and shaded regions. This is perfect for 
our piecewise linear functions — just connect the breakpoints with lines. In Section 3.6 
the lines show an iteration by its “cobweb.” 

Programming This book contains programs that you can type in once and save. 
We chose Autoscaling, Newton’s Method, Secant Method, Cobweb Iteration, and 
Numerical Integration. You will create others — to do calculations or to add features 
that are not available as single keystrokes. The calculator is like a computer, with a 
fairly small set of instructions. One difference : Memory is too precious to store com- 
ments with the code. You have to sec the logic by rereading the program. 

To enter the world of programming, press PRGM. Each PRGM submenu lists all 
programs by name — a digit, a letter, or 9 (37 names). The program title has up to 
eight characters. Select the EDIT submenu and press G for the edit screen. Type the 
title GRAPHS and press ENTER. Practice on this one: 

:"X2 + X" STO (Y-VARS) Yi ENTER 
: " X — 1 ” STO (Y-VARS) Y2 ENTER 
: (PRGM) (I/O) DispGraph 

The menus to call are in parentheses. Leave the edit screen with QUIT (not 
CLEAR — that erases the line with the cursor). Set the default window by ZOOM 
S t anda rd. 

To execute, press PRGM (EXEC) G ENTER, The program draws the graphs. It 
leaves Y i and Y 2 on the Y = screen. To erase the program from the home screen, 
press (PRGM)(ERASE)G. Practice again by creating P r cj*n2 : FUN C. Type 
: yf X S T 0 Y and : (PRGM) (I/0)Disp Y . Move to the home screen, store 
X by 4 STO X ENTER, and execute by (PRGM) (EXEC) 2 ENTER. Also try 
X = -1. When it fails to imagine i, select 1 :Goto Error. 

Piecewise functions and input (to a running program). The definition of a piecewise 
function includes the domain of each piece , Logical tests like "IF X ^ 7 n determine 
which domain the input value X falls into. An I F statement only affects the following 
line— which is executed when T E S T = 1 (meaning true) and skipped when T E ST = 0 
(meaning false). I F commands are in the P R G M ( C T L ) submenu; TEST calls the 
menu of inequalities. 
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An input value X = 4 need not be stored in advance. Program P stops while 
running to request input. Execute with P ENTER after selecting the PRGM (EXEC) 
menu. Answer ? with 4 and ENTER. After completion, rerun by pressing ENTER 
again. The function is y — 14 — x if x < 1, y — x if x^l. 


PrgmP: PIECES 
: Di sp 
: I n pu t X 
: 14-X- Y 
: If 7^ X 
: X -► Y 
:Di sp Y 


PGRM (I/O) Ask for input 
PGRM (I/O) Screen ? ENTER X 
First formula for all X 
PRGM (CTL) TEST 
Overwrite if T E S T = t 
Display Y(X) 


Overwriting is faster than checking both ends A ^ X ^ B for each piece. Even faster: 
a whole formula (14 - X)(X < 7) 4- (X)(7 ^ X) can go on a single line using 1 and 0 
from the tests. Compute-store-display Y{X) as above, or define Y 1 on the edit screen. 


Exercise Define a third piece Y=8 + X \( X < 3. Rewrite P using Y i = , A product 
of tests ( 3 ^ X ) ( X < 7 ) evaluates to 1 if ail true and to 0 if any false. 

TRACE and ZOOM The best feature ts graphing. But a w r hole graph can be like a 
whole book — too much at once. You want to focus on one part. A computer or 
calculator will trace along the graph, stop at a point, and zoom in. 

There is also ZOOM OUT, to widen the ranges and see more. Our eyes work the 
same way — they put together information on different scales. Looking around the 
room uses an amazingly large part of the human brain. With a big enough computer 
we can try to imitate the eyes — this is a key problem in artificial intelligence. With 
a small computer and a zoom feature, we can use our eyes to understand functions. 

Press TRACE to locate a point on the graph. A blinking cursor appears. Move 
left or right— the cursor stays on the graph. Its coordinates appear at the bottom of 
the screen. When x changes by a pixel, the calculator evaluates y(x), To solve y(.v) = 0, 
read off x at the point when y is nearest to zero. To minimize or maximize y{x), read 
off the smallest and largest y. In all these problems, zoom in for more accuracy. 

To blow up a figure we can choose new ranges. The fast way is to use a ZOOM 
command . For a preset range, use ZOOM Standard or ZOOM Trig. To shrink 
or stretch byXFactorYFact (default values 4), use Z 00M I n or Z 00 M 0 u t. 
Choose the center point and press ENTER. The new graph appears. Change those 
scaling factors with ZOOM Set Factors. Best of all, create your own viewing 
window. Press ZOOM Box. 

To draw the box, move the cursor to one corner. Press ENTER and this point is 
a small square. The same keys move a second (blinking) square to the opposite 
corner — the box grows as you move. Press ENTER, and the box is the new r viewing 
window'. The graphs show the same function with a change of scale. Section 3.4 will 
discuss the mathematics — here we concentrate on the graphics. 


EXAMPLE 9 Place : Y i = X s i n Cl / X > in the Y = edit screen. Press Z 00M T r i g 
for a first graph. Set X F a c t = 1 and Y F a c t = 2.5 , Press ZOOM In with center at 
(0, 0). To see a larger picture, use X F a c t = 10 and Y F a c t = 1 . Then Zoom Out 
again. As X gets large, the function X sin(l .'Y) approaches . 

Now return to ZOOM Trig, Zoom In with the factors set to 4 (default). Zoom 
again by pressing ENTER. With the center and the factors fixed, this is faster than 
drawing a zoom box. 
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EXAMPLE 10 Repeat for the more erratic function Y = sin (1/X). After ZOOMTrig, 
create a box to see this function near X — .01. The Y range is now . 

Scaling is crucial. For a new function it can be tedious. A formula for y(x) does 
not easily reveal the range of y’s, when A < x ^ B is given. The following program is 
often more convenient than zooms. It samples the function L= 19 times across the 
x-range (every 5 pixels). The inputs Xmin, Xmax , Y x are previously stored on other 
screens. After sampling, the program sets the y-range from C — Ymin to D — Ymax 
and draws the graph. 

Notice the loop with counter X. The loop ends with the command IS> ( K , L ) , 
which increases X by 1 and skips a line if the new X exceeds L. Otherwise the 
command G o t o 1 restarts the loop. The screen shows the short form on the left. 

Example: Y 1 = x 3 + IOjc 3 - 7x + 42 with range Xmin = -12 and Xmax = 10. 
Set tick spacing X s c L = 4 and Y s c L = 250. Execute with PRGM (EXEC) A 
ENTER. For this program we also list menu locations and comments. 


rgmA : AUTOSCL 

Menu (Submenu) Comment 

ALl-Off 

Y-VARS (OFF) Turn off functions 

Xm i n -* A 

VARS (RNG) Store Xmin using S T 0 

1 9 -» L 

Store number of evaluations (19) 

(Xmax-A) /L-» H 

Spacing between evaluations 

A -» X 

Start at x = A 

Y 1 -» C 

Y-VARS (Y) Evaluate the function 

C-> D 

Start C and D with this value 

1 ->K 

Initialize counter K = 1 

Lb L 1 

PRGM (CTL) Mark loop start 

A + KH -* X 

Calculate next x 

Yl -» Y 

Evaluate function at x 

IF Y < C 

PGRM (CTL) New minimum? 

Y -* C 

Update C 

IF D < Y 

PRGM (CTL) New maximum? 

Y -* D 

Update D 

IS> (K, L) 

PRGM (CTL) Add 1 to K„ skip Goto if > L 

Goto 1 

PRGM (CTL) Loop return to L b L 1 

Yi-On 

Y-VARS (ON) Tum on Yi 

C -+ Ymi n 

STO VARS (RNG) Set Ymi n = C 

D -> Ymax 

STOVARS (RNG) Set Y ma x = I> 

D i spGraph 

PRGM (I/O) Generate graph 



Contents 


chapter i Introduction to Calculus 

1.1 Velocity and Distance 1 

1.2 Calculus Without Limits 8 

1.3 The Velocity at an Instant 16 

1.4 Circular Motion 22 

1.5 A Review of Trigonometry 29 

1.6 A Thousand Points of Light 34 

1.7 Computing in Calculus 36 

chapter 2 Derivatives 

2.1 The Derivative of a Function 44 

2.2 Powers and_ Polynomials 50 

2.3 The Slope and the Tangent Line 58 

2.4 Derivative of the Sin e and Cosine 64 

2.5 The Product and Qu otient and Power Rules 71 

2.6 Limits 78 

2.7 Continuous Functions 85 


chapter 3 Applications of the Derivative 

3.1 Linear Approximation 91 

3.2 Maximum and Minimum Problems 96 

3.3 Second Derivatives: Minimum vs. Maximum 105 

3.4 Graphs 112 

3.5 Ellipses, Parabolas, and Hyperbolas 121 

3.6 Iterations x n+1 = F(x n ) 130 

3.7 Newton’s Method and Chaos 137 

3.8 The Mean Value Theorem and FHopital’s Rule 146 


CHAPTER 2 


Derivatives 


2.1 The Derivative of a Function 

This chapter begins with the definition of the derivative. Two examples were in 
Chapter 1. When the distance is f 2 , the velocity is 2 f. When /(f) = sin f we found 
u(f) = cos t. The velocity is now called the derivative of /(f). As we move to a more 
formal definition and new examples, we use new symbols /' and df/dt for the 
derivative. 


2A At time f, the derivative f'(t) or df jdt or i;(f) is 


/'(f) = Km 

Ar-*0 


/(t + At) -/(t) 
At 


( 1 ) 


The ratio on the right is the average velocity over a short time At. The derivative, on 
the left side, is its limit as the step Af (delta f) approaches zero. 

Go slowly and look at each piece. The distance at time f + Af is /(f + Af). The 
distance at time f is /(f). Subtraction gives the change in distance , between those 
times. We often write A / for this difference: A / =/(f + Af) —/(f). The average velocity 
is the ratio Af/At — change in distance divided by change in time. 

The limit of the average velocity is the derivative, if this limit exists: 


df 

dt 


Ar-*0 At 


(2) 


This is the neat notation that Leibniz invented: Af/At approaches df/dt. Behind the 
innocent word “limit” is a process that this course will help you understand. 

Note that Af is not A times /! It is the change in /. Similarly Af is not A times f. 
It is the time step, positive or negative and eventually small. To have a one-letter 
symbol we replace At by h. 

The right sides of (1) and (2) contain average speeds. On the graph of /(f), the 
distance up is divided by the distance across. That gives the average slope Af /At. 

The left sides of (1) and (2) are instantaneous speeds df/dt. They give the slope at 
the instant f. This is the derivative df /dt (when Af and Af shrink to zero). Look again 
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at the calculation for /(f) = t 2 : 


A/_ /(t + At)-/(t) 
At At 


f 2 + 2t At + (At) 2 - t 2 
At 


= 2t + At. 


( 3 ) 


Important point: Those steps are taken before At goes to zero. If we set At = 0 too 
soon, we learn nothing. The ratio A//At becomes 0/0 (which is meaningless). The 
numbers A / and At must approach zero together, not separately. Here their ratio is 
2t + At, the average speed. 

To repeat: Success came by writing out (t + At) 2 and subtracting t 2 and dividing 
by At. Then and only then can we approach At = 0. The limit is the derivative 2t. 

There are several new things in formulas (1) and (2). Some are easy but important, 
others are more profound. The idea of a function we will come back to, and the 
definition of a limit. But the notations can be discussed right away. They are used 
constantly and you also need to know how to read them aloud: 

/(f) = “/ of t” = the value of the function / at time f 

At = “delta t” = the time step forward or backward from f 

f(t + At) = “/ of t plus delta t” = the value of / at time t + At 

A /= “delta /” = the change /(t + At) -/(f) 

Af/At - “delta / over delta t” = the average velocity 

/'(f) = “/ prime of t” = the value of the derivative at time t 

df /dt = “df d t” = the same as /' (the instantaneous velocity) 

lim = “limit as delta t goes to zero” = the process that starts with 
A! ~° numbers Af/At and produces the number df/dt. 

From those last words you see what lies behind the notation df/dt. The symbol At 
indicates a nonzero (usually short) length of time. The symbol dt indicates an 
infinitesimal (even shorter) length of time. Some mathematicians work separately 
with df and dt, and df/dt is their ratio. For us df/dt is a single notation (don’t 
cancel d and don’t cancel A). The derivative df /dt is the limit of A / /At. When that 
notation df /dt is awkward, use f or v. 


Remark The notation hides one thing we should mention. The time step can be 
negative just as easily as positive. We can compute the average Af/At over a time 
interval before the time t, instead of after. This ratio also approaches df/dt. 

The notation also hides another thing: The derivative might not exist. The averages 
Af/At might not approach a limit (it has to be the same limit going forward and 
backward from time f). In that case /'(t) is not defined. At that instant there is no 
clear reading on the speedometer. This will happen in Example 2. 


EXAMPLE 1 (Constant velocity V — 2) The distance / is F times t. The distance at 
time f + At is V times t + At. The difference A f is V times At: 


A/ 

At 


VAt df 

— — = V so the limit is — 
At dt 


V. 


The derivative of Vt is V. The derivative of 2 1 is 2. The averages Af/At are always 
V= 2, in this exceptional case of a constant velocity. 
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EXAMPLE 2 Constant velocity 2 up to time t — 3, then stop. 

For small times we still have f(t) = 2t. But after the stopping time, the distance is 
fixed at f(t) = 6. The graph is flat beyond time 3. Then f(t + At) =/(f) and A/= 0 
and the derivative of a constant function is zero : 


t > 3: 


/'(f)= lim 

At->0 


f(t + At) — /(t) _ 1: _ 0 


At 


lim — = 0. 
Al-0 At 


( 4 ) 


In this example the derivative is not defined at the instant when t = 3. The velocity 
falls suddenly from 2 to zero. The ratio A//Af depends, at that special moment, on 
whether At is positive or negative. The average velocity after time t = 3 is zero. The 
average velocity before that time is 2. When the graph of / has a corner, the graph 
of v has a jump. It is a step function. 

One new part of that example is the notation {df/dt or /' instead of t;). Please look 
also at the third figure. It shows how the function takes t (on the left) to /(t). Especially 
it shows At and A/. At the start, A//At is 2. After the stop at t = 3, all t’s go to the 
same f(t) = 6. So A/= 0 and df \dt - 0. 


time distance 



Fig. 2.1 The derivative is 2 then 0. It does not exist at t = 3. 


THE DERIVATIVE OF Ml 

Here is a completely different slope, for the “demand function” fit) = 1/t. The demand 
is 1 It when the price is t. A high price t means a low demand 1/t. Increasing the price 
reduces the demand. The calculus question is: How quickly does 1/t change when t 
changes' ? The “marginal demand” is the slope of the demand curve. 

The big thing is to find the derivative of 1/t once and for all. It is —1/t 2 . 

1 1 1 t — (t + At) — At 

EXAMPLE 3 fit) = - has A f= — — . This equals / — . - = , , - . 

J t t + Att t(t + At) t(t + At) 

A f -1 , df -1 

Divide by At and let At -» 0: — = . , - - approaches — = —5- . 

3 At t(f + At) dt t 2 

Line 1 is algebra, line 2 is calculus. The first step in line 1 subtracts fit) from 
f{t + At). The difference is l/(t + At) minus 1/t. The common denominator is t times 
t + At — this makes the algebra possible. We can’t set At = 0 in line 2, until we have 
divided by At. 

The average is A//At= -l/t(t + At). Now set At = 0. The derivative is -1/t 2 . 
Section 2.4 will discuss the first of many cases when substituting At = 0 is not possible, 
and the idea of a limit has to be made clearer. 
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Fig. 2.2 Average slope is — &, true slope is — Increase in t produces decrease in /. 

Check the algebra at t = 2 and t + At = 3. The demand 1/t drops from 1/2 to 1/3. 
The difference is A/= - 1/6, which agrees with -l/(2)(3) in line 1. As the steps A/ 
and At get smaller, their ratio approaches — 1/(2)(2) = -1/4. 

This derivative is negative. The function 1/t is decreasing, and A / is below zero. The 
graph is going downward in Figure 2.2, and its slope is negative: 

An increasing f(t) has positive slope. A decreasing /(t) has negative slope. 

The slope —1/t 2 is very negative for small t. A price increase severely cuts demand. 

The next figure makes a small but important point. There is nothing sacred about t. 
Other letters can be used — especially x. A quantity can depend on position instead 
of time. The height changes as we go west. The area of a square changes as the side 
changes. Those are not affected by the passage of time, and there is no reason to use 
t. You will often see y =/(x), with x across and y up — connected by a function /. 

Similarly, / is not the only possibility. Not every function is named /! That letter 
is useful because it stands for the word function — but we are perfectly entitled to 
write y(x) or y(t) instead of /(x) or f(t). The distance up is a function of the distance 
across. This relationship “y of x” is all-important to mathematics. 

The slope is also a function. Calculus is about two functions, y(x) and dyjdx. 

Question If we add 1 to y(x), what happens to the slope? Answer Nothing. 

Question If we add 1 to the slope, what happens to the height? Answer . 

The symbols t and x represent independent variables — they take any value they 
want to (in the domain). Once they are set, f(t) and y(x) are determined. Thus / and 
y represent dependent variables — they depend on t and x. A change At produces a 



l 2 

Fig. 2.3 The derivative of 1/r is — 1/t 2 . The slope of 1/x is — 1/x 2 . 
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change A/. A change Ax produces Ay. The independent variable goes inside the 
parentheses in f(t) and y(x). It is not the letter that matters, it is the idea: 

independent variable t or x 

dependent variable / or g or y or z or u 

derivative df/dt or dfjdx or dyjdx or 

The derivative dyjdx comes from [change in y] divided by [change in x]. The time 
step becomes a space step, forward or backward. The slope is the rate at which y 
changes with x. The derivative of a function is its “ rate of change .” 

I mention that physics books use x(t) for distance. Darn it. 

To emphasize the definition of a derivative, here it is again with y and x: 

Ay y(x + Ax)-y(x) distance up dy Ay 

— = = — — = lim — = y (x). 

Ax Ax distance across dx Ax 

The notation y'(x) pins down the point x where the slope is computed. In dyjdx that 
extra precision is omitted. This book will try for a reasonable compromise between 
logical perfection and ordinary simplicity. The notation dy/dx(x) is not good; y'(x) is 
better; when x is understood it need not be written in parentheses. 

You are allowed to say that the function is y = x 2 and the derivative is y' = 2x — 
even if the strict notation requires y(x) = x 2 and y'(x) = 2x. You can even say that 
the function is x 2 and its derivative is 2x and its second derivative is 2 — provided 
everybody knows what you mean. 

Here is an example. It is a little early and optional but terrific. You get excellent 
practice with letters and symbols, and out come new derivatives. 


EXAMPLE 4 If u(x) has slope du/dx , what is the slope of /(x) = (u(x)) 2 ? 

From the derivative of x 2 this will give the derivative of x 4 . In that case u = x 2 and 
/ = x 4 . First point: The derivative of u 2 is not (du/dx) 2 . We do not square the derivative 
2x. To find the “square rule” we start as we have to — with A/=/(x + Ax) —f(x): 


A / = (w(x + Ax)) 2 - (u(x)) 2 = [w(x + Ax) + w(x)] [u(x + Ax) - «(x)]. 


This algebra puts A / in a convenient form. We factored a 2 - b 2 into \_a + b] times 
[a - b]. Notice that we don’t have (Au) 2 . We have A/, the change in u 2 . Now divide 
by Ax and take the limit: 


4 £ 

Ax 


= [u(x + Ax) + h(x)] 


u(x + Ax) - u(x) 
Ax 


approaches 2u(x) ^ . 

dx 


( 5 ) 


This is the square rule : The derivative of («(x)) 2 is 2m(x) times dujdx. From the 
derivatives of x 2 and 1/x and sin x (all known) the examples give new derivatives. 


EXAMPLE 5 (u = x 2 ) The derivative of x 4 is 2 u dujdx = 2(x 2 )(2x) = 4x 3 . 

EXAMPLE 6 (u = 1/x) The derivative of 1/x 2 is 2 u dujdx = (2/x)( — 1/x 2 ) = -2/x 3 . 

EXAMPLE 7 (u = sin x, dujdx = cos x) The derivative of u 2 = sin 2 x is 2 sin x cos x. 

Mathematics is really about ideas. The notation is created to express those ideas. 
Newton and Leibniz invented calculus independently, and Newton’s friends spent a 
lot of time proving that he was first. He was, but it was Leibniz who thought of 
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writing dyjdx — which caught on. It is the perfect way to suggest the limit of Ay/Ax. 
Newton was one of the great scientists of all time, and calculus was one of the great 
inventions of all time — but the notation must help. You now can write and speak 
about the derivative. What is needed is a longer list of functions and derivatives. 


2.1 EXERCISES 


Read-through questions 

The derivative is the a of A / /At as At approaches b . 
Here A/ equals c . The step At can be positive or d . 
The derivative is written v or e or 1 . If f(x) = 2x + 3 
and Ax = 4 then Af = g . If Ax = — 1 then A/ = h . 
If Ax = 0 then A /=_!_. The slope is not 0/0 but 
df/dx = | . 

The derivative does not exist where f(t) has a k and 
v(t) has a l . For f(t) = 1/t the derivative is m , The 
slope of y = 4/x is dy/dx = n . A decreasing function has 
a o derivative. The p variable is t or x and the 
q variable is / or y. The slope of y 2 (is) (is not) (dy/dx) 2 . 
The slope of (w(x)) 2 is r by the square rule. The slope of 
(2x + 3) 2 is s . 


9 Find Ay/ Ax for y(x) = x + x 2 . Then find dy/dx. 

10 Find Ay /Ax and dy/dx for y(x) = 1 + 2x + 3x 2 . 

11 When /(t) = 4/t, simplify the difference /(t + At) — /(t), 
divide by At, and set At — 0. The result is f'(t). 

12 Find the derivative of 1/t 2 from A/(t) = l/(t + At) 2 — 1/t 2 . 
Write Af as a fraction with the denominator t 2 (t + Af) 2 . 
Divide the numerator by At to find Af/ At. Set At = 0. 

13 Suppose /(f) = It to t = 1. Afterwards /(t) = 7 + 9(f — 1). 

(a) Find df/dt at t = \ and f = f . 

(b) Why doesn’t f(t) have a derivative at t= 1? 

14 Find the derivative of the derivative (the second derivative) 
of y = 3x 2 . What is the third derivative? 


1 Which of the following numbers (as is) gives df /dt at time 
t? If in doubt test on f(t) = t 2 . 


(a) 


f(t + At) — /(t) 


(c) lim 


At 

f(t — At) —f(t) 


(b) lim IMMzM 

h~* o 2 h 

(d) lim + 


2 Suppose f(x) = x 2 . Compute each ratio and set h — 0: 


(a) 


(c) 


f(x + h) -f(x) 


f(x + h) -f(x - h) 
2 h 


(b) 

(d) 


f(x + 5h) —f (x) 
5 h 

f(x + 1) -/(x) 


3 For f(x) = 3x and g(x) = 1 + 3x, find f(4 + h) and g( 4 4- h) 
and f(4) and g'( 4). Sketch the graphs of / and g — why do 
they have the same slope? 

4 Find three functions with the same slope as f(x) = x 2 . 

5 For f(x) = 1/x, sketch the graphs of f(x) + 1 and /(x + 1). 
Which one has the derivative —1/x 2 ? 

6 Choose c so that the line y = x is tangent to the parabola 
y = x 2 + c. They have the same slope where they touch. 

7 Sketch the curve y(x) = 1 — x 2 and compute its slope at 
x = 3. 


8 If f(t) = 1/t, what is the average velocity between t = j and 
f = 2? What is the average between t = \ and t = 1? What 
is the average (to one decimal place) between t = j and 
t = 101/200? 


15 Find numbers A and B so that the straight line y = x fits 
smoothly with the curve 7= A + JBx + x 2 at x = 1. Smoothly 
means that y=Y and dy/dx = dY/dx at x = 1. 

16 Find numbers A and B so that the horizontal line y = 4 
fits smoothly with the curve y = A + Bx + x 2 at the point 
x = 2. 

17 True (with reason) or false (with example): 

(a) If /(t) < 0 then df/dt < 0. 

(b) The derivative of (f(t)) 2 is 2 df/dt. 

(c) The derivative of 2 f(t) is 2 df /dt. 

(d) The derivative is the limit of Af divided by the limit 
of At. 

18 For f(x) = 1/x the centered difference f(x + h) —f(x — h) is 
l/(x -f h) — l/(x — h). Subtract by using the common denomi- 
nator (x + h)(x — h). Then divide by 2 h and set h = 0. Why 
divide by 2 h to obtain the correct derivative? 

19 Suppose y = mx + b for negative x and y = Mx + B for 

x ^ 0. The graphs meet if . The two slopes are 

. The slope at x = 0 is (what is possible?). 

20 The slope of y = 1/x at x= 1/4 is y' = —1/x 2 = —16. At 
h = 1/12, which of these ratios is closest to —16? 

y(x + h) - y(x) y(x) - y(x - h) y(x + h) - y(x - h) 
h h 2 h 

21 Find the average slope of y = x 2 between x = x x and 
x = x 2 . What does this average approach as x 2 approaches Xi ? 

22 Redraw Figure 2.1 when f(t) = 3 — 2t for t ^ 2 and 
f(t) = — 1 for t ^ 2. Include df/dt. 
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23 Redraw Figure 2.3 for the function y(x) — 1 — (1/x). 
Include dy/dx. 

24 The limit of 0/At as At 0 is not 0/0. Explain. 


25 Guess the limits by an informal working rule. Set At = 0.1 
and — 0. 1 and imagine At becoming smaller: 


(a) 

(c) 


1 + At 

2 + At 

At 4- (At) 2 
At - (At) 2 



(d) 


t T At 
t- At 


*26 Suppose /(x)/x -► 7 as x -► 0. Deduce that /(0) = 0 and 
/'( 0) = 7. Give an example other than f(x) = lx. 


27 What is lim — + ^ - - --- if it exists? What if x - 

x->0 X 


1 ? 


Problems 28-31 use the square rule: d(u 2 )/dx = 2 u(du/dx). 

28 Take u = x and find the derivative of x 2 (a new way). 

29 Take u = x 4 and find the derivative of x 8 (using 
dujdx = 4x 3 ). 

30 If u = 1 then u 2 = 1. Then d\/dx is 2 times dl/dx. How is 
this possible? 

31 Take u — y/x. The derivative of w 2 = x is 1 = 2 u(du/dx). So 
what is du/dx , the derivative of ^/x? 

32 The left figure shows f(t) = t 2 . Indicate distances f(t + At) 
and At and A/. Draw lines that have slope A// At and /'(t). 


33 The right figure shows f(x) and Ax. Find A//Ax and /'( 2). 

34 Draw /(x) and Ax so that A/ /Ax = 0 but /'(x) # 0. 

35 If /= n 2 then d / /dx — 2u dujdx. If g =/ 2 then 
dg/dx = If df /dx. Together those give g = w 4 and dg/dx — 


36 True or false , assuming /(0) = 0: 

(a) If /(x) ^ x for all x, then d/ /dx ^ 1. 

(b) If df/dx ^ 1 for all x, then /(x) ^ x. 

37 The graphs show A/ and A / /h for /(x) = x 2 . Why is 2x + h 
the equation for A / /hi If h is cut in half, draw in the new 
graphs. 




38 Draw the corresponding graphs for f(x) = jx. 

39 Draw 1/x and 1 /(x + h) and A f/h — either by hand with 
h = j or by computer to show h -» 0. 

40 For y = e x , show on computer graphs that dy/dx = y. 

41 Explain the derivative in your own words . 
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This section has two main goals. One is to find the derivatives of f(x) = x 3 and x 4 
and x 5 (and more generally f(x ) = x”). The power or exponent n is at first a positive 
integer. Later we allow x 11 and x 2,2 and every x". 

The other goal is different. While computing these derivatives, we look ahead to 
their applications. In using calculus, we meet equations with derivatives in them — 
“differential equations.” It is too early to solve those equations. But it is not too early 
to see the purpose of what we are doing. Our examples come from economics and 
biology. 
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With n = 2, the derivative of x 2 is 2x. With n - ~ 1, the slope of x -1 is -lx -2 . 
Those are two pieces in a beautiful pattern, which it will be a pleasure to discover. 
We begin with x 3 and its derivative 3x 2 , before jumping to x". 

EXAMPLE 1 If /(x) = x 3 then A /= (x 4- hf - x 3 = (x 3 + 3x 2 h 4- 3 xh 2 4- h 3 ) - x 3 . 
Step 1: Cancel x 3 . Step 2: Divide by h. Step 3: h goes to zero. 

= 3x 2 4 - 3 xh 4 - h 2 approaches = 3x 2 . 

h ax 

That is straightforward, and you see the crucial step. The power (x + h ) 3 yields four 
separate terms x 3 4 - 3x 2 h + 3xh 2 4- h 3 . (Notice 1, 3, 3, 1.) After x 3 is subtracted, we 
can divide by h. At the limit ( h = 0) we have 3x 2 . 

For f(x) = x n the plan is the same. A step of size h leads to /(x + /i) = (x + (if. 
One reason for algebra is to calculate powers like (x + h) n , and if you have forgotten 
the binomial formula we can recapture its main point. Start with n = 4: 

(x 4- h){x 4- h){x 4 - h)(x + h) = x 4 4- ??? 4- h\ (1) 

Multiplying the four x’s gives x 4 . Multiplying the four /z’ s gives h 4 . These are the easy 
terms, but not the crucial ones. The subtraction (x 4- h) 4 - x 4 will remove x 4 , and the 
limiting step h -► 0 will wipe out /z 4 (even after division by h). The products that matter 
are those with exactly one h. In Example 1 with (x + /z) 3 , this key term was 3 x 2 /z. 
Division by h left 3x 2 . 

With only one /z, there are n places it can come from. Equation (1) has four K s in 
parentheses, and four ways to produce x 3 /z. Therefore the key term is 4 x 3 h. (Division 
by h leaves 4x 3 .) In general there are n parentheses and n ways to produce x w-1 /z, so 
the binomial formula contains nx" _1 /z: 

(x + h) n = x n + nx tt ~ 1 h + ••• + h n . (2) 


2B For n = 1, 2, 3, 4, the derivative of x" is nx n l . 


Subtract x" from (2). Divide by h. The key term is nx n l . The rest disappears as h -> 0: 

A f (x + /zr-x" nx n ~ 1 /z + ■ • • + h n df 
Ax h h S ° dx~ HX ' 

The terms replaced by the dots involve h 2 and /z 3 and higher powers. After dividing 
by /z, they still have at least one factor h . All those terms vanish as h approaches zero. 

EXAMPLE 2 (x 4 - h ) 4 = x 4 4- 4 x 3 /z + 6x 2 h 2 + 4 xh 3 4- h 4 . This is n = 4 in detail. 

Subtract x 4 , divide by /z, let h -► 0. The derivative is 4x 3 . The coefficients 1, 4, 6, 4, 1 
are in Pascal’s triangle below. For (x + h) 5 the next row is 1, 5, 10, J_. 

Remark The missing terms in the binomial formula (replaced by the dots) contain 
all the products x n ~ j h j . An x or an h comes from each parenthesis. The binomial 
coefficient “n choose j” is the number of ways to choose j /z’s out of n parentheses. It 
involves n factorial , which is n(n —!)••• (1). Thus 5! = 5*4*3 # 2*1 = 120. 
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These are numbers that gamblers know and love: 

1 Pascal's 

1 1 triangle 

12 1 

1331 n = 3 

14641 n= 4 

In the last row, the coefficient of x 3 ft is 4! / 1! 3! = 4 • 3 • 2 • 1/1 • 3 • 2 • 1 = 4. For 
the x 2 h 2 term, with j = 2, there are 4 • 3 • 2 • 1/2 • 1 • 2 • 1 = 6 ways to choose two ft’s. 
Notice that 1 + 4 + 6 4- 4 + 1 equals 16, which is 2 4 . Each row of Pascal’s triangle 
adds to a power of 2. 

Choosing 6 numbers out of 49 in a lottery, the odds are 49 • 48 • 47 • 46 • 45 • 44/6! 
to 1. That number is N = “49 choose 6” = 13,983,816. It is the coefficient of x 43 ft 6 
in (x + ft) 49 . If a times N tickets are bought, the expected number of winners is X. The 
chance of no winner is e~ x . The chance of one winner is Ae~ x . See Section 8.4. 

Florida’s lottery in September 1990 (these rules) had six winners out of 109,163,978 
tickets. 

DERIVATIVES OF POLYNOMIALS 

Now we have an infinite list of functions and their derivatives: 

x x 2 x 3 x 4 x 5 • ■ • 1 2x 3x 2 4x 3 5x 4 • • • 

The derivative of x n is n times the next lower power x" -1 . That rule extends beyond 
these integers 1, 2, 3, 4, 5 to all powers : 


/= 1/x 

has 

/' = -l/x 2 : 

Example 3 of Section 2.1 

(»= -i) 

/= 1/x 2 

has 

/' = - 2/x 3 : 

Example 6 of Section 2.1 

(« = - 2) 

f=y/x 

has 

/' = i*- 1/2 : 

true but not yet checked 

(« = i) 


Remember that x 2 means 1/x 2 and x 1/2 means l/ v /x. Negative powers lead to 
decreasing functions, approaching zero as x gets large. Their slopes have minus signs. 

Question What are the derivatives of x 10 and x 2,2 and x~ 1/2 ? 

Answer 10x 9 and 2.2x 1,2 and -^x~ 3/2 . Maybe (x + ft) 2,2 is a little unusual. 
Pascal’s triangle can’t deal with this fractional power, but the formula stays firm: 
After x 2 2 comes 2.2 x 1,2 ft. The complete binomial formula is in Section 10.5. 

That list is a good start, but plenty of functions are left. What comes next is really 
simple. A tremendous number of new functions are “linear combinations’’ like 

/(x) = 6x 3 or 6x 3 + ^x 2 or 6x 3 - lx 2 . 

What are their derivatives? The answers are known for x 3 and x 2 , and we want to 
multiply by 6 or divide by 2 or add or subtract. Do the same to the derivatives : 

f'(x) = 18x 2 or 18x 2 + x or 18x 2 - x. 


2C The derivative of c times /(x) is c times /'(x). 
2D The derivative of / (x) + g(x) is f'(x) + g'(x). 


“n choose j" : 




The number c can be any constant. We can add (or subtract) any functions. The rules 
allow any combination of / and g: The derivative of9f(x) — lg(x) is 9/'(x) — 7g'(x). 
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The reasoning is direct. When /(x) is multiplied by c, so is f(x + h). The difference 
A if is also multiplied by c. All averages A / /h contain c, so their limit is c/'. The only 
incomplete step is the last one (the limit). We still have to say what “limit” means. 

Rule 2D is similar. Adding /+ g means adding A/+ A g. Now divide by h. In the 
limit as h -> 0 we reach /' + g f — because a limit of sums is a sum of limits. Any 
example is easy and so is the proof — it is the definition of limit that needs care 
(Section 2.6). 

You can now find the derivative of every polynomial. A “polynomial” is a combina- 
tion of 1 , x, x 2 , ... , x" — for example 9 + 2x - x 5 . That particular polynomial has slope 
2 - 5x 4 . Note that the derivative of 9 is zero! A constant just raises or lowers the 
graph, without changing its slope. It alters the mileage before starting the car. 

The disappearance of constants is one of the nice things in differential calculus. 
The reappearance of those constants is one of the headaches in integral calculus. 
When you find v from /, the starting mileage doesn’t matter. The constant in / has 
no effect on v. (A/ is measured by a trip meter ; At comes from a stopwatch.) To find 
distance from velocity, you need to know the mileage at the start. 

A LOOK AT DIFFERENTIAL EQUATIONS (FIND y FROM dy/dx) 

We know that y = x 3 has the derivative dy/dx = 3x 2 . Starting with the function, we 
found its slope. Now reverse that process. Start with the slope and find the function. 
This is what science does all the time — and it seems only reasonable to say so. 
Begin with dy/dx = 3x 2 . The slope is given, the function y is not given. 

Question Can you go backward to reach y = x 3 ? 

Answer Almost but not quite. You are only entitled to say that y = x 3 + C. The 
constant C is the starting value of y (when x = 0). Then the differential equation 
dy/dx = 3x 2 is solved. 

Every time you find a derivative, you can go backward to solve a differential 
equation. The function y - x 2 + x has the slope dy/dx = 2x + 1. In reverse, the slope 
2x + 1 produces x 2 + x — and all the other functions x 2 + x + C, shifted up and down. 
After going from distance / to velocity v, we return to /+ C. But there is a lot more 
to differential equations. Here are two crucial points: 

1. We reach dy/dx by way of Ay/Ax, but we have no system to go backward. With 
dy/dx = (sin x)/x we are lost. What function has this derivative? 

2. Many equations have the same solution y = x 3 . Economics has dy/dx = 3 y/x. 
Geometry has dy/dx = 3 y 2/3 . These equations involve y as well as dy/dx. Func- 
tion and slope are mixed together! This is typical of differential equations. 

To summarize: Chapters 2-4 compute and use derivatives. Chapter 5 goes in reverse. 
Integral calculus discovers the function from its slope. Given dy/dx we find y(x). Then 
Chapter 6 solves the differential equation dy/dt = y, function mixed with slope. 
Calculus moves from derivatives to integrals to differential equations. 

This discussion of the purpose of calculus should mention a specific example. 
Differential equations are applied to an epidemic (like AIDS). In most epidemics the 
number of cases grows exponentially. The peak is quickly reached by e, and the 
epidemic dies down. Amazingly, exponential growth is not happening witlp AIDS — 
the best fit to the data through 1988 is a cubic polynomial (Los Alamos Sciehce , 1989): 

The number of cases fits a cubic within 2%: y = 174.6(t - 1981.2) 3 + 340. 
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This is dramatically different from other epidemics. Instead of dy/dt = y we have 
dy/dt = 3 y/t. Before this book is printed, we may know what has been preventing e‘ 
(fortunately). Eventually the curve will turn away from a cubic — I hope that 
mathematical models will lead to knowledge that saves lives. 

Added in proof: In 1989 the curve for the U.S. dropped from t 3 to t 2 . 

MARGINAL COST AND ELASTICITY IN ECONOMICS 


First point about economics: The marginal cost and marginal income are crucially 
important. The average cost of making automobiles may be $10,000. But it is the 
$8000 cost of the next car that decides whether Ford makes it. ” The average describes 
the past, the marginal predicts the future .” For bank deposits or work hours or wheat, 
which come in smaller units, the amounts are continuous variables. Then the word 
“marginal” says one thing: Take the derivative .| 

The average pay over all the hours we ever worked may be low. We wouldn’t work 
another hour for that! This average is rising, but the pay for each additional hour 
rises faster — possibly it jumps. When $10/hour increases to $15/hour after a 40-hour 
week, a 50-hour week pays $550. The average income is $ ll/hour. The marginal 
income is $ 15/hour — the overtime rate. 

Concentrate next on cost. Let y(x) be the cost of producing x tons of steel. The 
cost of x + Ax tons is y(x + Ax). The extra cost is the difference Ay. Divide by Ax, 
the number of extra tons. The ratio Ay/Ax is the average cost per extra ton. When 
Ax is an ounce instead of a ton, we are near the marginal cost dy/dx. 

Example: When the cost is x 2 , the average cost is x 2 /x = x. The marginal cost is 
2x. Figure 2.4 has increasing slope — an example of “diminishing returns to scale.” 




Fig. 2.4 Marginal exceeds average. Constant elasticity E = +1. Perfectly elastic to perfectly 
inelastic (T curve). 


This raises another point about economics. The units are arbitrary. In yen per 
kilogram the numbers look different. The way to correct for arbitrary units is to work 
with percentage change or relative change. An increase of Ax tons is a relative increase 
of Ax/x. A cost increase Ay is a relative increase of A y/y. Those are dimensionless, the 
same in tons/tons or dollars/dollars or yen/yen. 

A third example is the demand y at price x. Now dy/dx is negative. But again the 
units are arbitrary. The demand is in liters or gallons, the price is in dollars or pesos. 


fThese paragraphs show how calculus applies to economics. You do not have to be an 
economist to understand them. Certainly the author is not, probably the instructor is not, 
possibly the student is not. We can all use dy/dx. 
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Relative changes are better. When the price goes up by 10%, the demand may drop 
by 5%. If that ratio stays the same for small increases, the elasticity of demand is j. 

Actually this number should be - \. The price rose, the demand dropped. In our 
definition, the elasticity will be — In conversation between economists the minus 
sign is left out (I hope not forgotten). 


DEFINITION The elasticity of the demand function y(x) is 


m 


^TT 

Ax-0 Ax/x 


dyjdx 

y/x 


( 3 ) 


Elasticity is “marginal” divided by “average.” £(x) is also relative change in y divided 
by relative change in x. Sometimes £(x) is the same at all prices — this important case 
is discussed below. 


EXAMPLE 1 Suppose the demand is y = c/x when the price is x. The derivative 
dy/dx = — c/x 2 comes from calculus. The division y/x = c/x 2 is only algebra. The ratio 
is E= — 1: 

For the demand y = c/x, the elasticity is (- c/x 2 )/(c/x 2 ) = - 1 . 

All demand curves are compared with this one. The demand is inelastic when |£| < 1. 
It is elastic when |£|> 1. The demand 20/ ^/x is inelastic (£ = -j), while x -3 is 
elastic (£= - 3). The power y = ex”, whose derivative we know, is the function with 
constant elasticity n : 

if y = cx" then dy/dx = cnx" -1 and £ = cnx" ~ l /(cx n /x) = n. 

It is because y = cx" sets the standard that we could come so early to economics. 

In the special case when y = c/x, consumers spend the same at all prices. Price x 
times quantity y remains constant at xy = c. 


EXAMPLE 2 The supply curve has £>0 — supply increases with price. Now the 
baseline case is y = cx. The slope is c and the average is y/x = c. The elasticity is 
E = c/c— 1. 

Compare £ = 1 with £ = 0 and £ = oo. A constant supply is “perfectly inelastic.” 
The power n is zero and the slope is zero: y = c. No more is available when the 
harvest is over. Whatever the price, the farmer cannot suddenly grow more wheat. 
Lack of elasticity makes farm economics difficult. 

The other extreme £ = oo is “perfectly elastic.” The supply is unlimited at a fixed 
price x. Once this seemed true of water and timber. In reality the steep curve 
x = constant is leveling off to a flat curve y = constant. Fixed price is changing to 
fixed supply, £ = oo is becoming £ = 0, and the supply of water follows a “gamma 
curve” shaped like T. 


EXAMPLE 3 Demand is an increasing function of income — more income, more 
demand. The income elasticity is £(/) = (dy/dl)/(y/l). A luxury has £ > 1 (elastic). 
Doubling your income more than doubles the demand for caviar. A necessity has 
£ < 1 (inelastic). The demand for bread does not double. Please recognize how the 
central ideas of calculus provide a language for the central ideas of economics. 

Important note on supply = demand This is the basic equation of microeconomics. 
Where the supply curve meets the demand curve, the economy finds the equilibrium 
price. Supply = demand assumes perfect competition. With many suppliers, no one can 
raise the price. If someone tries, the customers go elsewhere. 
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The opposite case is a monopoly^ no competition. Instead of many small producers 
of wheat, there is one producer of electricity. An airport is a monopolist (and maybe 
the National Football League). If the price is raised, some demand remains. 

Price fixing occurs when several producers act like a monopoly — which antitrust 
laws try to prevent. The price is not set by supply = demand. The calculus problem 
is different — to maximize profit . Section 3.2 locates the maximum where the marginal 
profit (the slope!) is zero. 

Question on income elasticity From an income of $10,000 you save $500. The 
income elasticity of savings is E = 2. Out of the next dollar what fraction do you 
save? 

Answer The savings is y = cx 2 because E = 2. The number c must give 500 = 
c(10,000) 2 , so c is 5 • 10~ 6 . Then the slope dy/dx is 2 cx = 10 • 10 -6 • 10 4 = to. This is 
the marginal savings, ten cents on the dollar. Average savings is 5%, marginal savings 
is 10%, and E — 2. 


2.2 EXERCISES 


Read-through questions 

The derivative of /= x 4 is /' = o . That comes from 
expanding (x -f /z) 4 into the five terms b . Subtracting x 4 
and dividing by h leaves the four terms c . This is A / //z, 
and its limit is d . 


12 Find the mistake : x 2 is x + x 4- ••• + x (with x terms). Its 
derivative is 1 + 1 + ••• +1 (also x terms). So the derivative 
of x 2 seems to be x. 

13 What are the derivatives of 3x 1/3 and — 3x" 1/3 and 
(3x 1/3 ) -1 ? 


The derivative of /= x" is /' = e Now (x 4 - h) n comes 
from the f theorem. The terms to look for are x n ~ l h, 
containing only one 9 . There are h of those terms, 

so (x + hf = x” 4 - i -i — . After subtracting i and 
dividing by /z, the limit of A f /h is k . The coefficient of 
x n ~ j h j , not needed here, is “zz choose /” = l where nl 
means m . 

The derivative of x ~ 2 is n . The derivative of x 1/2 is 
o . The derivative of 3x + (1/x) is P . which uses the 
following rules: The derivative of 3/(x) is q and the deriv- 
ative of /(x) 4 - g(x) is r . Integral calculus recovers s 
from dy/dx. If dy/dx = x 4 then y(x) = t 

1 Starting with / = x 6 , write down /' and then /". (This is 

“/ double prime,” the derivative of /'.) After deriva- 

tives of x 6 you reach a constant. What constant? 

2 Find a function that has x 6 as its derivative. 


Find the derivatives of the functions in 3-10. Even if n is nega- 
tive or a fraction, the derivative of x" is nx n ~ l . 


3 x 2 + 7x + 5 
5 1 4- x 4- x 2 4- x 3 4- x 4 
7 x" + x“ n 

9 l+x+ r 2 + v i+ h x4 


4 1 + (7/x) + (5/x 2 ) 
6 (x 2 + l) 2 
8 x n /n\ 

10 ^x il2 + - s x S12 


14 The slope of x + (1/x) is zero when x = . What 

does the graph do at that point? 

15 Draw a graph of y = x 3 — x. Where is the slope zero? 

16 If df/dx is negative, is /(x) always negative? Is /(x) nega- 
tive for large x? If you think otherwise, give examples. 

17 A rock thrown upward with velocity 16 ft/sec reaches 
height /= 16f — 16t 2 at time t. 

(a) Find its average speed A / /At from t — 0 to t = 7 . 

(b) Find its average speed A / /At from t = \ to t — 1. 

(c) What is df /dt at t = 7 ? 

18 When / is in feet and t is in seconds, what are the units 

of /' and its derivative /"? In /— 16f — 16 1 2 , the first 16 is 
ft/sec but the second 16 is . 

19 Graph y = x 3 -I- x 2 — x from x = — 2 to x — 2 and estimate 
where it is decreasing. Check the transition points by solving 
dy/dx = 0 . 

20 At a point where dy/dx = 0, what is special about the 
graph of y(x)? Test case: y — x 2 . 

21 Find the slope of y = by algebra (then h -> 0): 

Ay = ,/x + h-y/x = + v ' / x + fr + ,/x 

h h h /x + ft + Vx' 


11 Name two functions with df/dx = 1/x 2 . 


22 Imitate Problem 21 to find the slope of y = l/ v /x. 
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23 Complete Pascal’s triangle for n - 5 and n - 6. Why do spent on the car? Compare dy/dx (marginal) with y/x 
the numbers across each row add to 2"? (average). 


24 Complete (x + h) 5 = x 5 4- . What are the bino- 

mial coefficients ^ J and and 

25 Compute (x -1- h ) 3 — (x — /i) 3 , divide by 2 h, and set h = 0. 
Why divide by 2 h to find this slope ? 



40 Name a product whose price elasticity is 

(a) high (b) low (c) negative (?) 

41 The demand y = c/x has dy/dx = — y/x. Show that Ay/ Ax 
is not —y/x. (Use numbers or algebra.) Finite steps miss the 
special feature of infinitesimal steps. 


26 Solve the differential equation y" = x to find y(x). 

27 For /(x) = x 2 + x 3 , write out /(x + Ax) and Af /Ax. What 
is the limit at Ax = 0 and what rule about sums is confirmed? 

28 The derivative of (w(x)) 2 is from Section 2. 1. Test 

this rule on u = x". 

29 What are the derivatives of x 7 + 1 and (x + l) 7 ? Shift the 
graph of x 7 . 

30 If df /dx is u(x), what functions have these derivatives? 

(a) 4v(x) (b) u(x) + 1 

(c) v(x + 1) (d) u(x) + i/(x). 

31 What function /(x) has fourth derivative equal to 1? 

32 What function /(x) has nth. derivative equal to 1? 

33 Suppose df /dx = 1 + x + x 2 + x 3 . Find /(x). 

34 Suppose df/dx = x~ 2 — x~ 3 . Find /(x). 

35 /(x) can be its own derivative. In the infinite polynomial 

/= 1 + x + jx 2 + £x 3 + , what numbers multiply x 4 

and x 5 if df /dx equals /? 

36 Write down a differential equation dy/dx — that 

is solved by y — x 2 . Make the right side involve y (not just 2x). 

37 True or false: (a) The derivative of x 71 is nx n . 

(b) The derivative of ax n /bx n is a/b. 

(c) If df /dx = x 4 and dg/dx — x 4 then /(x) = g(x). 

(d) (f(x) — f(a))/(x — a) approaches f\a) as x -► a. 

(e) The slope of y — (x — l) 3 is y’ = 3(x — l) 2 . 

Problems 38-44 are about calculus in economics. 

38 When the cost is y — y 0 + cx, find £(x) = (dy/dx) /(y/x). It 

approaches for large x. 

39 From an income of x = $10,000 you spend y = $1200 on 
your car. If E~j, what fraction of your next dollar will be 


42 The demand y = x" has E = . The revenue xy 

(price times demand) has elasticity E = . 

43 y = 2x + 3 grows with marginal cost 2 from the fixed cost 
3. Draw the graph of E(x). 

44 From an income / we save S(I). The marginal propensity 

to save is . Elasticity is not needed because S and / 

have the same . Applied to the whole economy this 

is (microeconomics) (macroeconomics). 

45 2 X is doubled when t increases by . t 3 is doubled 

when t increases to t. The doubling time for AIDS 

is proportional to t. 

46 Biology also leads to dy/y = n dx/x, for the relative growth 
of the head (dy/y) and the body (dx/x). Is n > 1 or n < 1 for a 
child? 

47 What functions have df /dx = x 9 and df /dx — x"? Why 
does n = — 1 give trouble? 

48 The slope of y = x 3 comes from this identity: 

(x T h) 3 x 3 

= (x + h) 2 + (x + h)x + x 2 . 

n 

(a) Check the algebra. Find dy/dx as h -►0. 

(b) Write a similar identity for y = x 4 . 

49 (Computer graphing) Find all the points where y = 
x 4 + 2x 3 — lx 2 + 3 = 0 and where dy/dx — 0. 

50 The graphs of yi(x) = x 4 + x 3 and y 2 (x) = lx — 5 touch at 

the point where y 3 (x) = = 0. Plot y 3 (x) to see what is 

special. What does the graph of y(x) do at a point where 

y = y' = 0? 

51 In the Massachusetts lottery you choose 6 numbers out 
of 36. What is your chance to win? 

52 In what circumstances would it pay to buy a lottery ticket 
for every possible combination, so one of the tickets would 
win? 
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2.3 The Slope and the Tangent Line 

Chapter 1 started with straight line graphs. The velocity was constant (at least piece- 
wise). The distance function was linear. Now we are facing polynomials like x 3 — 2 
or x 4 - x 2 + 3, with other functions to come soon. Their graphs are definitely curved. 
Most functions are not close to linear — except if you focus all your attention near a 
single point. That is what we will do. 

Over a very short range a curve looks straight . Look through a microscope, or zoom 
in with a computer, and there is no doubt. The graph of distance versus time becomes 
nearly linear. Its slope is the velocity at that moment. We want to find the line that 
the graph stays closest to — the “ tangent line ” — before it curves away. 

The tangent line is easy to describe. We are at a particular point on the graph of 
y=f(x). At that point x equals a and y equals f(a) and the slope equals /'(a). 
The tangent line goes through that point x = a, y=f{a) with that slope m = f'(a). 
Figure 2.5 shows the line more clearly than any equation, but we have to turn the 
geometry into algebra. We need the equation of the line. 

EXAMPLE 1 Suppose y — x 4 - x 2 + 3. At the point x = a = 1, the height is y =f(a) = 3. 
The slope is dy/dx = 4x 3 - 2x. At x = 1 the slope is 4 — 2 = 2. That is /'(a): 

The numbers x = 1, y = 3, dy/dx = 2 determine the tangent line. 

The equation of the tangent line is y- 3 = 2(x — 1), and this section explains why. 





Fig. 2.5 The tangent line has the same slope 2 as the curve (especially after zoom). 


THE EQUATION OF A LINE 

A straight line is determined by two conditions. We know the line if we know two 
of its points. (We still have to write down the equation.) Also, if we know one point 
and the slope , the line is set. That is the situation for the tangent line, which has a 
known slope at a known point: 

1. The equation of a line has the form y — mx + b 

2 . The number m is the slope of the line, because dy/dx = m 

3. The number b adjusts the line to go through the required point. 

I will take those one at a time — first y = mx + b, then m, then b. 

1. The graph of y = mx + b is not curved. How do we know? For the specific example 
y = 2x 4- 1, take two points whose coordinates x, y satisfy the equation: 

x = 0, y=l and x = 4, y = 9 both satisfy y = 2x+l. 
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Those points (0, 1) and (4, 9) lie on the graph. The point halfway between has x-2 
and y = 5. That point also satisfies y = 2x + 1. The halfway point is on the graph. If 
we subdivide again, the midpoint between (0, 1) and (2, 5) is (1, 3). This also has 
y = 2x + 1. The graph contains all halfway points and must be straight. 

2. What is the correct slope m for the tangent line? In our example it is m = f'(a) = 2. 
The curve and its tangent line have the same slope at the crucial point : dy/dx = 2. 

Allow me to say in another way why the line y = mx + b has slope m. At x = 0 its 
height is y = b. At x = 1 its height is y = m + b. The graph has gone one unit across 
(0 to 1) and m units up (b to m + b). The whole idea is 

distance up m 

SOpe_ distance across - I' (1) 

Each unit across means m units up, to 2m + b or 3m + b. A straight line keeps a 
constant slope, whereas the slope of y = x 4 - x 2 + 3 equals 2 only at x = 1 . 

3. Finally we decide on b . The tangent line y = 2x + b must go through x = 1, y = 3. 
Therefore b = 1. With letters instead of numbers, y = mx + b leads to f(a) = ma + b. 
So we know b : 


2E The equation of the tangent line has b=f(a ) - ma: 

y = mx+f(a)-~ ma or y -/(a) = m(x - a). (2) 


That last form is the best. You see immediately what happens at x = a. The factor 
x — a is zero. Therefore y =f(a) as required. This is the point-slope form of the equa- 
tion, and we use it constantly: 


y — 3 — 2(x — 1) 


or 


y~ 3 distance up 

= = slope 2. 

x - 1 distance across 


EXAMPLE 2 The curve y = x 3 - 2 goes through y — 6 when x = 2. At that point 
dy/dx = 3x 2 = 12. The point-slope equation of the tangent line uses 2 and 6 and 12: 

y- 6 = 12(x - 2), which is also y = 12x 18. 

There is another important line. It is perpendicular to the tangent line and perpen- 
dicular to the curve. This is the normal line in Figure 2.6. Its new feature is its slope. 
When the tangent line has slope m, the normal line has slope - 1/m. (Rule: Slopes of 



Fig. 2.6 


Tangent line y - y 0 = m(x - x 0 ). Normal line y - y 0 = - — (x - x 0 ). Leaving a roller- 
coaster and catching up to a car. m 
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perpendicular lines multiply to give — 1.) Example 2 has m = 12, so the normal line 
has slope -1/12: 

tangent line : y — 6 = 12(x — 2) normal line : y — 6 = — jj(x — 2). 

Light rays travel in the normal direction. So do brush fires — they move perpendicular 
to the fire line. Use the point-slope form! The tangent is y= 12x - 18, the normal is 
not y = ~ tjx - 18. 

EXAMPLE 3 You are on a roller-coaster whose track follows y = x 2 + 4. You see a 
friend at (0, 0) and want to get there quickly. Where do you step off? 

Solution Your path will be the tangent line (at high speed). The problem is to choose 
x = a so the tangent line passes through x — 0, y = 0. When you step off at x = a, 

the height is y — a 2 + 4 and the slope is 2a 

the equation of the tangent line is y - (a 2 + 4) = 2a(x — a) 

this line goes through (0, 0) if - (a 2 + 4) = - la 2 or a = + 2. 

The same problem is solved by spacecraft controllers and baseball pitchers. Releasing 
a ball at the right time to hit a target 60 feet away is an amazing display of calculus. 
Quarterbacks with a moving target should read Chapter 4 on related rates. 

Here is a better example than a roller-coaster. Stopping at a red light wastes gas. 
It is smarter to slow down early, and then accelerate. When a car is waiting in front 
of you, the timing needs calculus: 

EXAMPLE 4 How much must you slow down when a red light is 72 meters away? 
In 4 seconds it will be green. The waiting car will accelerate at 3 meters/sec 2 . You 
cannot pass the car. 

Strategy Slow down immediately to the speed V at which you will just catch that 

car. (If you wait and brake later, your speed will have to go below V.) At the catch- 

up time T, the cars have the same speed and same distance. Two conditions , so the 
distance functions in Figure 2.6d are tangent. 

Solution At time T, the other car's speed is 3(7" — 4). That shows the delay of 4 
seconds. Speeds are equal when 3(T— 4) = V or T= + 4. Now require equal dis- 
tances. Your distance is V times T. The other car’s distance is 72 + \at 2 \ 

72 + i*3(T-4) 2 = VT becomes 72 + i-$K 2 = K($K+4). 

The solution is V — 12 meters/second. This is 43 km/hr or 27 miles per hour. 

Without the other car, you only slow down to V= 72/4= 18 meters/second. As 
the light turns green, you go through at 65 km/hr or 40 miles per hour. Try it. 

THE SECANT LINE CONNECTING TWO POINTS ON A CURVE 

Instead of the tangent line through one point, consider the secant line through two 
points. For the tangent line the points came together. Now spread them apart. The 
point-slope form of a linear equation is replaced by the two-point form. 

The equation of the curve is still y =/(x). The first point remains at x = a, y =f(a). 
The other point is at x = c\ y —f{c). The secant line goes between them, and we want 
its equation. This time we don't start with the slope — but m is easy to find. 
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EXAMPLE 5 The curve y = x 3 — 2 goes through x = 2, y = 6. It also goes through 
x = 3, y = 25. The slope between those points is 

change in y 25-6 

m : — - — 19. 

change in x 3 — 2 

The point-slope form (at the first point) is y — 6 = 19(x — 2). This line automatically 
goes through the second point (3, 25). Check: 25-6 equals 19(3 - 2). The secant 
has the right slope 19 to reach the second point. It is the average slope Ay/Ax. 


A look ahead The second point is going to approach the first point. The secant 
slope Ay/Ax will approach the tangent slope dy/dx. We discover the derivative (in 
the limit). That is the main point now — but not forever. 

Soon you will be fast at derivatives. The exact dy/dx will be much easier than 
Ay/Ax. The situation is turned around as soon as you know that x 9 has slope 
9x 8 . Near x = 1, the distance up is about 9 times the distance across. To find 
Ay = 1.001 9 - l 9 , just multiply Ax = .001 by 9. The quick approximation is .009, the 
calculator gives Ay = .009036. It is easier to follow the tangent line than the curve. 


Come back to the secant line, and change numbers to letters. What line connects 
x = a, y =/(a) to x = c, y =/(c)? A mathematician puts formulas ahead of numbers, 
and reasoning ahead of formulas, and ideas ahead of reasoning: 


distance up f(c)-f(a) 


(1) The slope is m = - . 

distance across c- a 

(2) The height is y =f(a) at x = a 

(3) The height is y =/(c) at x = c (automatic with correct slope). 


2F The two-point form uses the slope between the points: 

secant line: y —/(a) ** (x ~ a)- 


( 3 ) 


At x = a the right side is zero. So y =/(a) on the left side. At x = c the right side has 
two factors c — a. They cancel to leave y =/(c). With equation (2) for the tangent line 
and equation (3) for the secant line, we are ready for the moment of truth. 

THE SECANT LINE APPROACHES THE TANGENT LINE 


What comes now is pretty basic. It matches what we did with velocities: 


average velocity = 


A distance 
A time 


/(t + At)-/(t) 
At 


The limit is df/dt. We now do exactly the same thing with slopes. The secant tine 
turns into the tangent line as c approaches a: 


slope of secant line: 


Af_ f(c)-f(a) 
Ax c — a 


df A f 

slope of tangent line: — = limit of — . 

ax Ax 
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There stands the fundamental idea of differential calculus! You have to imagine more 
secant lines than I can draw in Figure 2.7, as c comes close to a. Everybody recognizes 
c-a as Ax. Do you recognize f(c)-f(a) as f(x + Ax) -/(x)? It is A/, the change 
in height. All lines go through x = a, y =f(a). Their limit is the tangent line. 



secant 

secant 

secant 

tangent 


Fig. 2.7 Secants approach tangent as their 
slopes A//Ax approach df/dx. 


secant y-/(a) = (x - a) 

c — a 

tangent y -f(a) =f'(a){x - a) 


Intuitively, the limit is pretty clear. The two points come together, and the tangent 
line touches the curve at one point. (It could touch again at faraway points.) Mathe- 
matically this limit can be tricky — it takes us from algebra to calculus. Algebra stays 
away from 0/0, but calculus gets as close as it can. 

The new limit for df jdx looks different, but it is the same as before: 


/'(a) = lim 

c~*a 


m_ ~/(a) 

c — a 


(4) 


EXAMPLE 6 Find the secant lines and tangent line for y = /(x) = sin x at x = 0. 

The starting point is x = 0, y = sin 0. This is the origin (0, 0). The ratio of distance up 
to distance across is (sin c)/c: 

sin c , 

secant equation y = x tangent equation y = lx. 

c 

As c approaches zero, the secant line becomes the tangent line. The limit of (sin c)/c 
is not 0/0, which is meaningless, but 1, which is dy/dx. 

EXAMPLE 7 The gold you own will be worth ft million dollars in t years. When 
does the rate of increase drop to 10% of the current value, so you should sell the 
gold and buy a bond? At t = 25, how far does that put you ahead of ft = 5? 

Solution The rate of increase is the derivative of ft, which is 1/2 yft. That is 10% 
of the current value yft when 1/2 ft = yft/ 10. Therefore 2t = 10 or t = 5. At that time 
you sell the gold, leave the curve, and go onto the tangent line: 

y - yfs = ( t - 5) becomes y - f5 = 2^/5 at t = 25. 

With straight interest on the bond, not compounded, you have reached 
y = 2>yf> = 6.7 million dollars. The gold is worth a measly five million. 


2.3 EXE RCISES 

of the c . The point-slope form of the tangent equation 

is y —f{a) = d - • 


Read-through questions 

A straight line is determined by a points, or one point 
and the b . The slope of the tangent line equals the slope 


The tangent line to y = x 3 + x at x = 1 has slope 8 . Its 
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equation is 1 . It crosses the y axis at 9 and the 

x axis at h . The normal line at this point (1, 2) has 
slop e 1 . Its equation is y — 2 — i . The secant line 

from (1,2) to (2, k ) has slope 1 . Its equation is 

v — 2 = ™ . 

The point (c, /(c)) is on the line y — f(a) = m(x — a) pro- 
vided m = n . As c approaches a, the slope m approaches 
° . The secant line approaches the p line. 

1 (a) Find the slope of y = 12/ x. 

(b) Find the equation of the tangent line at (2, 6). 

(c) Find the equation of the normal line at (2, 6). 

(d) Find the equation of the secant line to (4, 3). 

2 For y = x 2 + x find equations for 

(a) the tangent line and normal line at (1, 2); 

(b) the secant line to x = 1 4- h, y = (1 4- h) 2 + (1 + h). 

3 A line goes through (1, —1) and (4, 8). Write its equation 
in point-slope form. Then write it as y = mx 4- b. 

4 The tangent line to y = x 3 4- 6x at the origin is 

y = . Does it cross the curve again? 

5 The tangent line to y = x 3 — 3x 2 4- x at the origin is 

y = . It is also the secant line to the point . 

6 Find the tangent line to x = y 2 at x = 4, y = 2. 

7 For y = x 2 the secant line from (a, a 2 ) to (c,c 2 ) has the 

equation . Do the division by c — a to find the tan- 

gent line as c approaches a. 

8 Construct a function that has the same slope at x — 1 and 
x = 2. Then find two points where y = x 4 — 2x 2 has the same 
tangent line (draw the graph). 

9 Find a curve that is tangent to y = 2x — 3 at x = 5. Find 
the normal line to that curve at (5, 7). 

10 For y = 1/x the secant line from ( a , 1/a) to (c, 1/c) has the 

equation . Simplify its slope and find the limit as c 

approaches a. 

11 What are the equations of the tangent line and normal 
line to y = sin x at x = 7r/2? 

12 If c and a both approach an in-between value x-b, then 

the secant slope (/(c) — f(a))/(c — a) approaches . 

13 At x = a on the graph of y — 1/x, compute 

(a) the equation of the tangent line 

(b) the points where that line crosses the axes. 

The triangle between the tangent line and the axes always has 
area . 

14 Suppose g(x)=f(x) + l. The tangent lines to / and g at 

x = 4 are . True or false : The distance between those 

lines is 7. 


15 Choose c so that y — 4x is tangent to y = x 2 + c. Match 
heights as well as slopes. 

16 Choose c so that y — 5x — 7 is tangent to y = x 2 4- cx. 

17 For y = x 3 4- 4x 2 — 3x + 1, find all points where the tan- 
gent is horizontal. 

18 y = 4x can’t be tangent to y = cx 2 . Try to match heights 
and slopes, or draw the curves. 

19 Determine c so that the straight line joining (0, 3) and 
(5, —2) is tangent to the curve y = c/(x 4- 1). 

20 Choose h, c, d so that the two parabolas y = x 2 4- bx + c 
and y — dx — x 2 are tangent to each other at x = 1, y — 0. 

21 The graph of /(x) = x 3 goes through (1, 1). 

(a) Another point is x = c = 1 + h, y =/(c) = . 

(b) The change in / is A/= . 

(c) The slope of the secant is m = . 

(d) As h goes to zero, m approaches , 

22 Construct a function y —f(x) whose tangent line at x = 1 
is the same as the secant that meets the curve again at x = 3. 

23 Draw two curves bending away from each other. Mark 
the points P and Q where the curves are closest. At those 

points, the tangent lines are and the normal lines 

are . 

*24 If the parabolas y = x 2 4- 1 and y = x — x 2 come closest at 
(i a , a 2 4- 1) and (c, c — c 2 ), set up two equations for a and c. 

25 A light ray comes down the line x = a. It hits the parabolic 
reflector y = x 2 at P = ( a , a 2 ). 

(a) Find the tangent line at P. Locate the point Q where 
that line crosses the y axis. 

(b) Check that P and Q are the same distance from the 
focus at F = (0, £). 

(c) Show from (b) that the figure has equal angles. 

(d) What law of physics makes every ray reflect off the 
parabola to the focus at F? 


vertical ray 




26 In a bad reflector y = 2/x, a ray down one special line 
x — a is reflected horizontally. What is a? 
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27 For the parabola 4 py = x 2 , where is the slope equal to 1? 

At that point a vertical ray will reflect horizontally. So the 
focus is at (0, ). 

28 Why are these statements wrong? Make them right. 

(a) If y = 2x is the tangent line at (I, 2 ) then y - — jx is 
the normal line. 

(b) As c approaches a , the secant slope (/(c) - f(a))/(c - a) 
approaches (f{a) — f(a))/(a — a). 

(c) The line through ( 2 , 3) with slope 4 is y — 2 = 4(x — 3). 

29 A ball goes around a circle: x — cos t, y — sin t. At £ = 37r/4 
the bail flies off on the tangent line. Find the equation of that 
line and the point where the ball hits the ground ( y = 0). 

30 If the tangent line to y —f(x) at x = a is the same as the 
tangent line t oy- g(x) at x = 6, find two equations that must 
be satisfied by a and b. 

31 Draw a circle of radius 1 resting in the parabola y = x 2 . 

At the touching point (a, a 2 ), the equation of the normal line 
is . That line has x = 0 when y — . The dis- 
tance to (a, a 2 ) equals the radius 1 when a = . This 

locates the touching point. 

32 Follow Problem 31 for the flatter parabola y — jx 2 and 
explain where the circle rests. 

33 You are applying for a $1000 scholarship and your time 
is worth $10 a hour. If the chance of success is 1 — (1/x) from 
x hours of writing, when should you stop? 

34 Suppose |/(c) — f(a )\ ^ |c — a\ for every pair of points a 
and c. Prove that \df/dx\ ^ 1. 

35 From which point x = a does the tangent line to y = 1/x 2 
hit the x axis at x — 3? 


36 If u(x)/v(x) = 7 find u'(x)/v f (x). Also find (u(x)/v(x))'. 

37 Find /(c) = 1.001 10 in two ways — by calculator and by 
/(c) —f(a) « f'(a)(c — a). Choose a = 1 and /(x) = x 10 . 

38 At a distance Ax from x = 1, how far is the curve y = 1/x 
above its tangent line? 

39 At a distance Ax from x = 2, how far is the curve y — x 3 
above its tangent line? 

40 Based on Problem 38 or 39, the distance between curve 
and tangent line grows like what power (Ax) p ? 

41 The tangent line to /(x) = x 2 — 1 at x 0 = 2 crosses the 

x axis at x 1 = . The tangent line at x 1 crosses the 

x axis at x 2 = , Draw the curve and the two 

lines, which are the beginning of Newton's method to solve 
/(x) = 0. 

42 (Puzzle) The equation y = mx + b requires two numbers, 
the point-slope form y — f(a) - f{a)(x — a) requires three , and 
the two-point form requires four, a, f(a\ c, /(c). How can 
this be? 

43 Find the time T at the tangent point in Example 4, when 
you catch the car in front. 

44 If the waiting car only accelerates at 2 meters/sec 2 , what 
speed V must you slow down to? 

45 A thief 40 meters away runs toward you at 8 meters 
per second. What is the smallest acceleration so that v = at 
keeps you in front? 

46 With 8 meters to go in a relay race, you slow down badly 
(/= — 8 + 6f-?t 2 ). How fast should the next runner start 
(choose v in /= vt ) so you can just pass the baton? 


2.4 The Derivative of the Sine and Cosine 


This section does two things. One is to compute the derivatives of sin x and cos x. 
The other is to explain why these functions are so important. They describe oscillation , 
which will be expressed in words and equations. You will see a “ differential equation." 
It involves the derivative of an unknown function y(x). 

The differential equation will say that the second derivative — the derivative of the 
derivative — is equal and opposite to y. In symbols this is y" = — y. Distance in one 
direction leads to acceleration in the other direction. That makes y and y’ and y” all 
oscillate. The solutions to y" = — y are sin x and cos x and all their combinations. 

We begin with the slope. The derivative of y = sin x is / = cos x. There is no reason 
for that to be a mystery, but I still find it beautiful. Chapter 1 followed a ball around 
a circle; the shadow went up and down. Its height was sin t and its velocity was cos t. 


2.4 The Derivative of the Sine and Cosine 


65 


We now find that derivative by the standard method of limits , when y(x) = sin x: 


*1 - limit of ^-lin. 
dx Ax ft- o 


sin (x + h) — sin x 
h 


(1) 


The sine is harder to work with than x 2 or x 3 . Where we had (x + h) 2 or (x + h) 3 , we 

now have sin(x + h). This calls for one of the basic “addition formulas” from trigo- 

nometry, reviewed in Section 1.5: 

sin (x 4* h) = sin x cos h 4- cos x sin h (2) 

cos (x + h) — cos x cos h - sin x sin h. (3) 


Equation (2) puts Ay = sin (x + h) — sin x in a new form: 

Ay _ sin x cos h + cos x sin h — sin x . f cos h - 1 


Ax 


/ cos h ~ l\ /sin h\ 

. sm , __ +cos * _ . 


(4) 


The ratio splits into two simpler pieces on the right. Algebra and trigonometry got 
us this far, and now comes the calculus problem. What happens as h -► 0? It is no 
longer easy to divide by h. (I will not even mention the unspeakable crime of writing 
(sin h)/h = sin.) There are two critically important limits — the first is zero and the 
second is one: 


cos/i-1 

lim 

o h 


= 0 


and 


lim 

# i -»0 


sin h 

~Y~ 


(5) 


The careful reader will object that limits have not been defined! You may further 
object to computing these limits separately, before combining them into equation (4). 
Nevertheless — following the principle of ideas now , rigor later — I would like to pro- 
ceed. It is entirely true that the limit of (4) comes from the two limits in (5): 


dy 

— = (sin x)(first limit) + (cos x)(second limit) = 0 + cos x. 
dx 


(6) 


The secant slope Ay/ Ax has approached the tangent slope dy/dx. 


2G The derivative of y — sin x is dy/dx - cos x. 


1 . 00001 . _ 


.995 


.995 cos h 


We cannot pass over the crucial step — the two limits in (5). They contain the real 
ideas. Both ratios become 0/0 if we just substitute h = 0. Remember that the cosine of 
a zero angle is 1, and the sine of a zero angle is 0. Figure 2.8a shows a small angle h 
(as near to zero as we could reasonably draw). The edge of length sin h is close to 
zero, and the edge of length cos h is near 1. Figure 2.8b shows how the ratio of sin h 
to h (both headed for zero) gives the slope of the sine curve at the start. 

When two functions approach zero, their ratio might do anything. We might have 

h 2 . ft. \/ft 

- — >0 or 7-+I or >oo. 

ft ft ft 


No clue comes from 0/0. What matters is whether the top or bottom goes to zero 
sin h more quwkly- Roughly speaking, we want to show that (cos ft - l)/ft is like ft 2 /ft and 

1 / 3 h (sin ft)/ft is like ft/ft. 

100 ' • • Time out The graph of sin x is in Figure 2.9 (in black). The graph of sin (x + Ax) 

fig- 2.8 sits just beside it (in red). The height difference is A f when the shift distance is Ax. 
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sin h 



A f _ sin (x + h) - sin x 
h h 


sin ( x + h) 

Fig. 2.9 sin(x + h) with /i = 10° = 7c/18 radians. Af/Ax is close to cos x. 



Now divide by that small number Ax (or h). The second figure shows A/ /Ax. It is 
close to cos x. (Look how it starts — it is not quite cos x.) Mathematics will prove 
that the limit is cos x exactly, when Ax -> 0. Curiously, the reasoning concentrates 
on only one point (x = 0). The slope at that point is cos 0 = 1. 

We now prove this: sin Ax divided by Ax goes to 1. The sine curve starts with 
slope 1. By the addition formula for sin(x + to), this answer at one point will lead to 
the slope cos x at all points. 

Question Why does the graph of /(x + Ax) shift left from /(x) when Ax > 0? 
Answer When x = 0, the shifted graph is already showing /(Ax). In Figure 2.9a, the 
red graph is shifted left from the black graph. The red graph shows sin h when the 
black graph shows sin 0. 


THE LIMIT OF (sin h)/h IS 1 

There are several ways to find this limit. The direct approach is to let a computer 
draw a graph. Figure 2.10a is very convincing. The function (sin h)/h approaches 1 at 
the key point h = 0. So does (tan h)/h. In practice, the only danger is that you might 
get a message like “undefined function” and no graph. (The machine may refuse to 
divide by zero at h = 0. Probably you can get around that.) Because of the importance 
of this limit, I want to give a mathematical proof that it equals 1. 




Fig. 2.10 (sin h)/h squeezed between cos x and 1; (tan h)/h decreases to 1. 


Figure 2.10b indicates, but still only graphically, that sin h stays below h. (The first 
graph shows that too; (sin h)/h is below 1.) We also see that tan/z stays above h. 
Remember that the tangent is the ratio of sine to cosine. Dividing by the cosine is 
enough to push the tangent above h. The crucial inequalities (to be proved when h 
is small and positive) are 


sin h < h and tan h > h. 


(?) 
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Since tan h = (sin h)/(cos h), those are the same as 


sin h 
~~h~ 


<1 


, sin h 

and — — > cos n. 

h 


( 8 ) 


What happens as h goes to zero? The ratio (sin h)/h is squeezed between cos h and 1. 
But cos h is approaching 1! The squeeze as h -► 0 leaves only one possibility for 
(sin h)/h, which is caught in between: The ratio (sin h)/h approaches 1. 

Figure 2.10 shows that “squeeze play.” If two functions approach the same limit , so 
does any function caught in between. This is proved at the end of Section 2.6. 

For negative values of h , which are absolutely allowed, the result is the same. To 
the left of zero, h reverses sign and sin h reverses sign. The ratio (sin h)/h is unchanged. 
(The sine is an odd function: sin (— h) = — sin h.) The ratio is an even function, sym- 
metric around zero and approaching 1 from both sides. 

The proof depends on sin h < h < tan h , which is displayed by the graph but not 
explained. We go back to right triangles. 




Fig. 2.11 Line shorter than arc: 2 sin h < 2h. Areas give h < tan h. 


Figure 2.11a shows why sin h<h. The straight line PQ has length 2 sin h. The 
circular arc must be longer, because the shortest distance between two points is a 
straight line.f The arc PQ has length 2 h. (Important: When the radius is 1, the arc 
length equals the angle. The full circumference is 2n and the full angle is also 27t.) 
The straight distance 2 sin h is less than the circular distance 2 h, so sin h < h. 

Figure 2.11b shows why h < tan h. This time we look at areas. The triangular area 
is ?(base)(height) = i(l)(tan h). Inside that triangle is the shaded sector of the circle. 
Its area is h/2n times the area of the whole circle (because the angle is that fraction 
of the whole angle). The circle has area nr 2 = n, so multiplication by h/2n gives \h 
for the area of the sector. Comparing with the triangle around it, ^ tan h > jh. 

The inequalities sin h<h< tan h are now proved. The squeeze in equation (8) 
produces (sin h)/h -* 1. Q.E.D. Problem 13 shows how to prove sin h < h from areas. 

Note All angles x and h are being measured in radians. In degrees, cos x is not the 
derivative of sin x. A degree is much less than a radian, and dy/dx is reduced by the 
factor 27T/360. 

THE LIMIT OF (cos h - i)/h IS 0 

This second limit is different. We will show that 1 - cos h shrinks to zero more quickly 
than h. Cosines are connected to sines by (sin h) 2 + (cos h) 2 = 1 . We start from the 


tlf we try to prove that, we will be here all night. Accept it as true. 
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known fact sin h < h and work it into a form involving cosines: 

(1 - cos h){ 1 + cos h) = 1 — (cos h) 2 = (sin h) 2 < h 2 . 

Note that everything is positive. Divide through by h and also by 1 + cos h: 

n 1 - cos h h 

0 < < . 

h 1 + cos h 


( 9 ) 

( 10 ) 


Our ratio is caught in the middle. The right side goes to zero because h -> 0. This is 
another “ squeeze ” — there is no escape. Our ratio goes to zero. 

For cos h - 1 or for negative h , the signs change but minus zero is still zero. This 
confirms equation (6). The slope of sin x is cos x. 


Remark Equation (10) also shows that 1 - cos h is approximately \h 2 . The 2 comes 
from 1 + cos h. This is a basic purpose of calculus — to find simple approximations 
like \h 2 . A “tangent parabola” 1 - \h 2 is close to the top of the cosine curve. 


THE DERIVATIVE OF THE COSINE 


This will be easy. The quick way to differentiate cos x is to shift the sine curve by 
7i/2.That yields the cosine curve (solid line in Figure 2. 12b). The derivative also shifts 
by n/2 (dotted line). The derivative of cos x is — sin x. 

Notice how the dotted line (the slope) goes below zero when the solid line turns 
downward. The slope equals zero when the solid line is level. Increasing functions 
have positive slopes. Decreasing functions have negative slopes. That is important, and 
we return to it. 

There is more information in dy/dx than “function rising” or “function falling.” 
The slope tells how quickly the function goes up or down. It gives the rate of change. 
The slope of y = cos x can be computed in the normal way, as the limit of Ay/ Ax: 


Ay 

Ax 


cos (x + h) — cos x 


= COS X 


cos h — 1 
h 


- sin x 


sin h 


dy 

^ = (cos x)(0) 


(sin x)( 1 ) : 


sin x. 


(ID 


The first line came from formula (3) for cos (x + h). The second line took limits, 
reaching 0 and 1 as before. This confirms the graphical proof that the slope of cos x 
is -sin x. 



v = cos a is decreasing 

\ 


v - - sin v is negative 


v = - sin t is negative 

Fig. 2.12 y(.v) increases where v' is positive. y(.\) bends up where y" is positive. 



THE SECOND DERIVATIVES OF THE SINE AND COSINE 

We now introduce the derivative of the derivative. That is the second derivative of the 
original function. It tells how fast the slope is changing, not how fast y itself is 
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changing. The second derivative is the “rate of change of the velocity.” A straight line 
has constant slope (constant velocity), so its second derivative is zero: 

f{t) = 5t has df/dt= 5 and d 2 f/dt 2 = 0. 

The parabola y = x 2 has slope 2x (linear) which has slope 2 (constant). Similarly 

f (0 = i a ? 2 has df jdt — at and d 2 f jdt 2 — a . 

There stands the notation d 2 f/dt 2 (or d 2 y/dx 2 ) for the second derivative. A short 
form is /" or y". (This is pronounced / double prime or y double prime). Example: 
The second derivative of y = x 3 is y" = 6x. 

In the distance-velocity problem, /" is acceleration. It tells how fast v is changing, 
while v tells how fast / is changing. Where df/dt was distance/time, the second 
derivative is distance/(time) 2 . The acceleration due to gravity is about 32 ft/sec 2 or 
9.8 m/sec 2 , which means that v increases by 32 ft/sec in one second. It does not mean 
that the distance increases by 32 feet! 

The graph of y = sin t increases at the start. Its derivative cos t is positive. However 
the second derivative is - sin t. The curve is bending down while going up. The arch 
is “ concave down ” because y" = - sin t is negative. 

At t = n the curve reaches zero and goes negative. The second derivative becomes 
positive. Now the curve bends upward. The lower arch is “ concave up .” 

y” > 0 means that y' increases so y bends upward (concave up) 

y" < 0 means that / decreases so y bends down (concave down). 

Chapter 3 studies these things properly — here we get an advance look for sin t. 

The remarkable fact about the sine and cosine is that y" = — y. That is unusual 
and special: acceleration = - distance. The greater the distance, the greater the force 
pulling back: 

y = sin t has dy/dt = + cos t and d 2 y/dt 2 = - sin t = - y. 
y = cost has dy/dt =— sin t and d 2 yjdt 2 = — cos t = - y. 


Question Does d 2 y/dt 2 < 0 mean that the distance y(t) is decreasing? 

Answer No. Absolutely not! It means that dy/dt is decreasing, not necessarily y. 
At the start of the sine curve, y is still increasing but y" < 0. 

Sines and cosines give simple harmonic motion — up and down, forward and back, 
out and in, tension and compression. Stretch a spring, and the restoring force pulls 
it back. Push a swing up, and gravity brings it down. These motions are controlled 
by a differential equation : 


d 2 y 

It 1 


-y- 


( 12 ) 


All solutions are combinations of the sine and cosine: y = A sin t + B cos t. 

This is not a course on differential equations. But you have to see the purpose of 
calculus. It models events by equations. It models oscillation by equation (12). Your 
heart fills and empties. Balls bounce. Current alternates. The economy goes up and 
down: 

high prices -* high production low prices -> ■ 

We can’t live without oscillations (or differential equations). 
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2.4 EXERCISES 


Read-through questions 


11 Find by calculator or calculus: 


The derivative of y = sin x is y' = a . The second deriva- 
tive (the b of the derivative) is y" = c . The fourth 
derivative is v"" = d . Thus y = sinx satisfies the 
differential equations y" = e and y"" = f . So does 
y = cos x, whose second derivative is g . 


(a) lim 
0 


sin 3h 
sin 2 h 


(b) lim 

h-* 0 


1 - cos 2 h 
1 — cos h ‘ 


12 Compute the slope at x = 0 directly from limits: 
(a) y = tan x (b) y = sin (— x) 


All these derivatives come from one basic limit: (sin h)/h 
approaches h . The sine of .01 radians is very close 
to i . So is the i of .01. The cosine of .01 is 
not .99, because 1 — cos h is much k than h. The ratio 
(1 — cos h)jh 2 approaches i . Therefore cos h is close to 
1 — \h 2 and cos .01 « rn . We can replace h by x. 

The differential equation y" — — y leads to n . When y 
is positive, y" is o . Therefore y' is p . Eventually y 
goes below zero and y" becomes q . Then y' is r . 
Examples of oscillation in real life are s and * . 


1 Which of these ratios approach 1 as h -> 0? 


(a) 


sin h 


(b) 


sin 2 Jt 


(c) 


sin h 
sin 2 h 


(d) 


sin (— h) 


2 (Calculator) Find (sinh)/h at fc = 0.5 and 0.1 and .01. 
Where does (sin h)/h go above .99? 

3 Find the limits as h -> 0 of 


(a) 


sin 2 /i 


(b) 


sin 5h 
5h 


(c) 


sin 5h 


(d) 


sin h 
~5h~ 


4 Where does tan h = 1.01 hi Where does tan h-W. 

5 y = sinx has period 2 tc, which means that sinx = 

. The limit of (sin(27i + h) — sin 2n)/h is 1 because 

. This gives dy/dx at x = , 

6 Draw cos(x + Ax) next to cos x. Mark the height differ- 
ence Ay. Then draw Ay/Ax as in Figure 2.9. 

7 The key to trigonometry is cos 2 0 = 1 — sin 2 0. Set 
sin 0«0 to find cos 2 0«l — 0 2 . The square root is 

cos 0 « 1 — £0 2 . Reason: Squaring gives cos 2 0 « 

and the correction term is very small near 0 = 0. 

8 (Calculator) Compare cos 0 with 1 — £0 2 for 

(a) 0 = 0. 1 (b) 0 = 0.5 (c) 0 = 30° (d) 0 = 3°. 

9 Trigonometry gives cos 0 = 1—2 sin 2 £0. The approxima- 
tion sin ^0 « leads directly to cos 0 « 1 — j0 2 . 

10 Find the limits as h -> 0: 


(a) 


1 — cos h 
h 2 


(c) 


1 —cos 2 ft 
sin 2 h 


(b) 


1 - cos 2 h 
h 2 


(d) 


1 — cos 2h 
h 


13 The unmarked points in Figure 2.11 are P and S. Find the 
height PS and the area of triangle OPR. Prove by areas that 
sin h<h. 


14 The slopes of cos x and 1 — \x 2 are — sin x and 

The slopes of sin x and are cos x and 1 — ^x 2 . 


15 Chapter 10 gives an infinite series for sin x: 


x x J 

sm x = - - 77 T— j- 




From the derivative find the series for cos x. Then take its 
derivative to get back to —sin x. 

16 A centered difference for /(x) = sin x is 

f(x + h) — /(x — h) sin (x + h) — sin (x — h) 0 
2h = 21 h 


Use the addition formula (2). Then let h -» 0. 

17 Repeat Problem 16 to find the slope of cos x. Use formula 
(3) to simplify cos (x -I- h) — cos (x - h). 

18 Find the tangent line to y = sin x at 

(a) x = 0 (b) x = n (c) x = n/4 

19 Where does y = sin x + cos x have zero slope? 

20 Find the derivative of sin(x + 1) in two ways: 

(a) Expand to sin x cos 1 + cos x sin 1. Compute dy/dx. 

(b) Divide Ay = sin (x + 1 + Ax) — sin (x + 1) by Ax. Write 
X instead of x + 1. Let Ax go to zero. 

21 Show that (tan h)/h is squeezed between 1 and 1/cos h. As 

h -* 0 the limit is . 

22 For y = sin 2x, the ratio A y/h is 

sin 2(x + h) — sin 2x sin 2x(cos 2h — 1) + cos 2x sin 2h 
h = h ’ 

Explain why the limit dy/dx is 2 cos 2x. 

23 Draw the graph of y = sin jx. State its slope at x = 0, n/2, 
n, and 2n. Does ? sin x have the same slopes? 

24 Draw the graph of y = sin x + N /3 cos x. Its maximum 

value is y = at x = . The slope at that point 

is , 

25 By combining sinx and cosx, find a combination that 

starts at x = 0 from y = 2 with slope 1. This combination also 
solves y" = 
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26 True or false, with reason: 

(a) The derivative of sin 2 x is cos 2 x 

(b) The derivative of cos(— x) is sin x 

(c) A positive function has a negative second derivative. 

(d) If / is increasing then y" is positive. 

27 Find solutions to dy/dx = sin 3x and dy/dx = cos 3x. 

28 If y = sin 5x then / = 5 cos 5x and y" — — 25 sin 5x. So 

this function satisfies the differential equation y” = . 


29 If h is measured in degrees, find lim fc _ 0 (sin h)/h. You could 
set your calculator in degree mode. 

30 Write down a ratio that approaches dy/dx at x = n. For 
y = sin x and Ax = .01 compute that ratio. 

31 By the square rule, the derivative of (u(x)) 2 is 2 u du/dx. 
Take the derivative of each term in sin 2 x + cos 2 x = 1. 

32 Give an example of oscillation that does not come from 
physics. Is it simple harmonic motion (one frequency only)? 

33 Explain the second derivative in your own words. 


2.5 The Product and Quotient and Power Rules 


What are the derivatives of x + sin x and x sin x and 1/sin x and x/sin x and sin"x? 
Those are made up from the familiar pieces x and sin x, but we need new rules. 
Fortunately they are rules that apply to every function, so they can be established 
once and for all. If we know the separate derivatives of two functions u and v , then 
the derivatives of u + v and uv and l/v and u/v and u n are immediately available. 

This is a straightforward section, with those five rules to learn. It is also an impor- 
tant section, containing most of the working tools of differential calculus. But I am 
afraid that five rules and thirteen examples (which we need — the eyes glaze over with 
formulas alone) make a long list. At least the easiest rule comes first. When we add 
functions , we add their derivatives. 

Sum Rule 

The derivative of the sum u(x ) + t;(x) is -f- (u + v) = ~ ^ . (1) 

dx dx dx 


EXAMPLE 1 The derivative of x + sin x is 1 + cos x. That is tremendously simple, 
but it is fundamental. The interpretation for distances may be more confusing (and 
more interesting) than the rule itself: 

Suppose a train moves with velocity 1. The distance at time t is t. On the train 
a professor paces back and forth (in simple harmonic motion). His distance from 
his seat is sin t. Then the total distance from his starting point is t + sin t, and 
his velocity (train speed plus walking speed) is 1 + cos t. 

If you add distances, you add velocities. Actually that example is ridiculous, because 
the professor’s maximum speed equals the train speed (= 1). He is running like mad, 
not pacing. Occasionally he is standing still with respect to the ground. 

The sum rule is a special case of a bigger rule called “ linearity .” It applies when 
we add or subtract functions and multiply them by constants — as in 3x - 4 sin x. By 
linearity the derivative is 3 - 4 cos x. The rule works for all functions u(x) and i?(x). 
A linear combination is y(x) = au(x) 4- bv(x), where a and b are any real numbers. 
Then Ay /Ax is 


au(x + Ax) + bv(x + Ax) - au(x) - bv(x) u(x + Ax) - u(x) , v(x + Ax) — u(x) 
= a i h b 


Ax 


Ax 


Ax 
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The limit on the left is dy/dx. The limit on the right is a du/dx + b dv/dx. We are 
allowed to take limits separately and add. The result is what we hope for: 

Rule of Linearity 

The derivative of au(x) + bv(x) is (au + bv) = a ^ + b — . (2) 

dx dx dx 

The product rule comes next. It can’t be so simple — products are not linear. The 
sum rule is what you would have done anyway, but products give something new. 
The derivative of u times v is not du/dx times dv/dx. Example: The derivative of x 5 
is 5x 4 . Don’t multiply the derivatives of x 3 and x 2 . (3x 2 times 2x is not 5x 4 .) 
For a product of two functions , the derivative has two terms. 


Product Rule (the key to this section) 

The derivative of u(x)v(x) is (uv) = u ~~ + v — . 

dx dx dx 


( 3 ) 


EXAMPLE 2 u = x 3 times v = x 2 is uv = x 5 . The product rule leads to 5x 4 : 

x 3 T~ + x 2 ~T “ * 3 (2x) + x 2 (3x 2 ) = 2x 4 + 3x 4 = 5x 4 . 
dx dx 


EXAMPLE 3 In the slope of x sin x, I don’t write dx/dx = 1 but it’s there: 

d 


dx 


(x sin x) = x cos x + sin x. 


EXAMPLE 4 If u = sin x and v = sin x then uv = sin 2 x. We get two equal terms: 
d d 

sin x — (sin x) + sin x — (sin x) = 2 sin x cos x. 
dx dx 

This confirms the “square rule” 2u du/dx , when u is the same as v. Similarly the slope 
of cos 2 x is - 2 cos x sin x (minus sign from the slope of the cosine). 

Question Those answers for sin 2 x and cos 2 x have opposite signs, so the derivative 
of sin 2 x + cos 2 x is zero (sum rule). How do you see that more quickly? 

EXAMPLE 5 The derivative of uv w is uvW + uv'w + u'v w — one derivative at a time. 
The derivative of xxx is xx 4- xx + xx. 


sum u + v Am + Ay 


m Am v Ay ^ v 


v{x) 


u(x + h) 



HH BiaiiBM 

product 
«( x)vQc) 1 





Ay 


vix) 


u{x) Am Am 

Fig. 2.13 Change in length = Am 4- Av. Change in area = u Av 4- v Au + Am Av. 
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After those examples we prove the product rule. Figure 2.13 explains it best. The 
area of the big rectangle is uv. The important changes in area are the two strips u Av 
and v Aw. The corner area Aw Av is much smaller. When we divide by Ax, the strips 
give w Av/Ax and v Aw/Ax. The corner gives Aw Av/Ax , which approaches zero. 

Notice how the sum rule is in one dimension and the product rule is in two 
dimensions. The rule for uvw would be in three dimensions. 

The extra area comes from the whole top strip plus the side strip. By algebra, 

w(x 4- h)v(x + h) — w(x)i;(x) = w(x + h)\y(x + h) — t;(x)] + y(x)[w(x + h) — w(x)]. (4) 

This increase is u(x + h)Av + i;(x)Aw — top plus side. Now divide by h (or Ax) and let 
h-> 0. The left side of equation (4) becomes the derivative of u(x)t;(x). The right side 
becomes w(x) times dv/dx— we can multiply the two limits — plus u(x) times du/dx. 
That proves the product rule — definitely useful. 

We could go immediately to the quotient rule for w(x)/u(x). But start with u— 1. 
The derivative of 1/x is -1/x 2 (known). What is the derivative of l/v(x)? 

Reciprocal Rule 


The derivative of — is — dv/dx 
v(x) v 


( 5 ) 


The proof starts with (v)(l/v)= 1. The derivative of 1 is 0. Apply the product rule: 


d (\ 


1 dv 


v— - ) + - — = 0 so that 


d \ 


dx\v I v dx 


dx\v 


- dv/dx 


( 6 ) 


It is worth checking the units — in the reciprocal rule and others. A test of dimen- 
sions is automatic in science and engineering, and a good idea in mathematics. The 
test ignores constants and plus or minus signs, but it prevents bad errors. If v is in 
dollars and x is in hours, dv/dx is in dollars per hour. Then dimensions agree: 

d ( l\ (1/dollars) , , -dv/dx dollars/hour 
“ dalso -?-* (dollars) 2 ' 

From this test, the derivative of l/v cannot be 1 /(dv/dx). A similar test shows that 
Einstein’s formula e = me 2 is dimensionally possible. The theory of relativity might 
be correct! Both sides have the dimension of (mass)(distance) 2 /(time) 2 , when mass 
is converted to energy.! 


EXAMPLE 6 The derivatives of x 1 9 x 2 , x "are -lx 2 , — 2x 3 , —nx " l . 
Those come from the reciprocal rule with t; = x and x 2 and any x": 


d , d ( 1 

dx (X * dx ( x" 


nx 


(x n 


— nx 


The beautiful thing is that this answer -nx " 1 fits into the same pattern as x". 
Multiply by the exponent and reduce it by one. 


For negative and positive exponents the derivative of x" is nx" 1 . 


( 7 ) 


fBut only Einstein knew that the constant is 1. 
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Reciprocal 


Quotient 


1 


v + Av 

u + Au 
v + Av 


1 

V 

u 

V 


-Av 
v(v + Av) 

v Au — u Av 
v(v 4- At;) 



Fig. 2.14 Reciprocal rule from (— Av)/v 2 . Quotient rule from (vAu — u Av)/v 2 . 


Au 

u 


EXAMPLE 7 


_ , . .1 ,1 + sinx . -cosx 

The derivatives of and — — are 5 — and —7-5 — . 

cos x sin x cos 2 x sin' 4 x 


Those come directly from the reciprocal rule. In trigonometry, 1/cos x is the secant 
of the angle x, and 1/sin x is the cosecant of x. Now we have their derivatives: 


d , x sin x 1 sin x 

— (sec x) = — =— = = sec x tan x. 

dx cos 2 x cos x cos x 


( 8 ) 


d , v cos x 1 cos x 

— (CSC x) = = : ; = — CSC X COt X. 

dx sin 2 x sin x sin x 


( 9 ) 


Those formulas are often seen in calculus. If you have a good memory they are worth 
storing. Like most mathematicians, I have to check them every time before using 
them (maybe once a year). It is really the rules that are basic, not the formulas. 

The next rule applies to the quotient u(x)/d(x). That is u times 1 /d. Combining the 
product rule and reciprocal rule gives something new and important: 

Quotient Rule 

, ... . u(x) . 1 du dvldx v dujdx — u dvldx 

The derivative of is - — - u = 5 — . 

d(x) v dx v tr 

You must memorize that last formula. The v 2 is familiar. The rest is new, but not very 
new. If i? = 1 the result is dujdx (of course). For u = 1 we have the reciprocal 
rule. Figure 2.14b shows the difference (u + Au)/(v + An) — (u/v). The denominator 
v(v + Ad) is responsible for v 2 . 


EXAMPLE 8 (only practice) If u/v = x 5 /x 3 (which is x 2 ) the quotient rule gives 2x: 
d /x 5 \ _ x 3 (5x*) — x 5 (3x 2 ) _ 5x 7 — 3x 7 _ ^ 

dx V*V x* ? 


EXAMPLE 9 (important) For u = sin x and v = cos x, the quotient is sin x/cos x = 
tan x. The derivative of tan x is sec 2 x. Use the quotient rule and cos 2 x + sin 2 x = 1: 

d ( sin x ^ _ cos x(cos x) - sin x(- sin x) _ 1 _ _ 2 „ ^ 

— I ] 2 2 see X. fit) 

dx\cosxJ COS X COS X 

Again to memorize: (tan x)' = sec 2 x. At x = 0, this slope is 1. The graphs of sinx 
and x and tan x all start with this slope (then they separate). At x = n/2 the sine 
curve is flat (cos x = 0) and the tangent curve is vertical (sec 2 x = 00 ). 

The slope generally blows up faster than the function. We divide by cos x, once 
for the tangent and twice for its slope. The slope of 1/x is — 1/x 2 . The slope is more 
sensitive than the function, because of the square in the denominator. 

d /sin x\ _ x cos x — sin x 
dx\ x J x 2 


EXAMPLE 10 
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That one I hesitate to touch at x = 0. Formally it becomes 0/0. In reality it is more 
like 0 3 /0 2 , and the true derivative is zero. Figure 2.10 showed graphically that (sin x)/x 
is flat at the center point. The function is even (symmetric across the y axis) so its 
derivative can only be zero. 

This section is full of rules, and I hope you will allow one more. It goes beyond x" 
to (u(x))\ A power of x changes to a power of u(x) — as in (sin x) 6 or (tan x) 7 or 
(x 2 + l) 8 . The derivative contains nu" -1 (copying nx" -1 ), but there is an extra factor 
du/dx. Watch that factor in 6(sin x) 5 cos x and 7(tan x) 6 sec 2 x and 8(x 2 + l) 7 (2x): 

Power Rule 

The derivative of ^u(x)]” is n[u(x)]" 1 (12) 


For n = 1 this reduces to du/dx = du/dx. For n = 2 we get the square rule 2m du/dx. 
Next comes u 3 . The best approach is to use mathematical induction, which goes from 
each n to the next power n + 1 by the product rule: 


—— (u n + 1 ) = — — (u n u) — u n h 

dx ( ’ dx K ’ dx 




du 

,„ + 1Ks . 


That is exactly equation (12) for the power n + 1. We get all positive powers this way, 
going up from n = 1 — then the negative powers come from the reciprocal rule. 

Figure 2.15 shows the power rule for n = 1, 2, 3. The cube makes the point 
best. The three thin slabs are u by u by Am. The change in volume is essentially 
3m 2 Am. From multiplying out (u + Au) 3 , the exact change in volume is 
3m 2 Am + 3m(Am) 2 + (Am) 3 — which also accounts for three narrow boxes and a midget 
cube in the corner. This is the binomial formula in a picture. 


u 


A u 

Fig. 2.15 



(Am ) 3 
1 cube 

m(Am ) 2 
3 bricks 


u 2 Au 
3 slabs 


Length change = Au; area change « 2 u Au; volume change « 3u 2 Au. 


EXAMPLE 11 (sin x) n = n(sin x)" 1 cos x. The extra factor cos x is du/dx. 

Our last step finally escapes from a very undesirable restriction — that n must be 
a whole number. We want to allow fractional powers n = p/q , and keep the same 
formula. The derivative of x" is still nx" -1 . 

To deal with square roots I can write (y/. x ) 2 = x. Its derivative is 2 v /x( v /x)' = 1. 
Therefore (>/x)' is 1/2^/x, which fits the formula when n = Now try n — p/q\ 
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Fractional powers Write u = x p/q as u q = x p . Take derivatives, assuming they exist: 


qu q ~ 1 


du 

dx 

du 

dx 

du 

dx 


px p ~ l 

px~ x 

qu~ l 

nx n ~ x 


(power rule on both sides) 
(cancel x p with u q ) 

(replace plq by n and u by x n ) 


EXAMPLE 12 The slope of x 1/3 is %x~ 2/3 . The slope is infinite at x = 0 and zero at 
x=oo. But the curve in Figure 2.16 keeps climbing. It doesn’t stay below an 
“asymptote.” 




Fig. 2.16 Infinite slope of x" versus zero slope: the difference between 0 < n < 1 and n > 1. 

EXAMPLE 13 The slope of x 4/3 is |x 1/3 . The slope is zero at x = 0 and infinite at 
x = oo. The graph climbs faster than a line and slower than a parabola (f is between 
1 and 2). Its slope follows the cube root curve (times f ). 

WE STOP NOW! I am sorry there were so many rules. A computer can memorize 
them all, but it doesn’t know what they mean and you do. Together with the chain 
rule that dominates Chapter 4, they achieve virtually all the derivatives ever computed 
by mankind. We list them in one place for convenience. 


Rule of Linearity 

(au + bv)' = aw' + bv' 

Product Rule 

(uv)' — uif 4- vu 9 

Reciprocal Rule 

(1/p)' = - p'/p 2 

Quotient Rule 

(u/p)' = (p«'-up')/p 2 

Power Rule 

(w")' = nu" _1 w' 


The power rule applies when n is negative , or a fraction, or any real number. The 

derivative of x* is nx n ~ \ according to Chapter 6. The derivative of (sin x)” is . 

And the derivatives of all six trigonometric functions are now established: 

(sin x)' = cos x (tan x)' = sec 2 x (sec x)' = sec x tan x 

(cos x)' = — sin x (cot x)' = - esc 2 x (esc x)' = — esc x cot x. 
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2.5 EXERCISES 


Read-through questions 

The derivatives of sinx cosx and 1/cos x and sinx/cosx 
and tan 3 x come from the a rule, b rule, c rule, 
and d rule. The product of sin x times cos x has 
(uv)' = uv' + e = f . The derivative of l/v is 9 . 
so the slope of sec x is h . The derivative of u/v is j . 
so the slope of tan x is I . The derivative of tan 3 x is 
k . The slope of x" is I and the slope of (u(x))" is 
m . With n = — 1 the derivative of (cosx) -1 is n . 
which agrees with the rule for sec x. 

Even simpler is the rule of o . which applies to 
au(x) + bv(x). The derivative is P . The slope of 3 sin x 4- 
4 cos x is q . The derivative of (3 sin x + 4 cos x) 2 is 
_j_. The derivative of « is 4 sin 3 x cos x. 


Find the derivatives of the functions in 1-26. 


1 (x + l)(x - 1) 

„ 1 1 

3 h : 

1 + x 1+sinx 

5 (x — l)(x — 2)(x — 3) 

7 x 2 cos x + 2x sin x 

„ x 3 4- 1 cos x 

9 - + - 

x + 1 sinx 

11 x 1/2 sin 2 x + (sin x) 1/2 

13 x 4 cos x + x cos 4 x 

15 ^x 2 sin x — x cos x -h sin x 

17 sec 2 x — tan 2 x 


19 


4 

(x — 5) 2/3 


4 

+ (5 — x) 2/3 


21 (sin x cos x) 3 + sin 2x 
23 m(x)i;(x)w(x)z(x) 


2 (x 2 + l)(x 2 - 1) 
, 1 1 


1 + x 2 ' 1 — sin x 

6 (x — l) 2 (x — 2) 2 

8 x 1/2 (x + sin x) 

x 2 + 1 sin x 
10 — — -4- 

x — 1 COS X 


12 x 3/2 sin 3 x + (sin x) 3/2 

14 V^(v^+l)(V^ + 2) 

16 (x — 6) 10 + sin 10 x 

18 esc 2 x — cot 2 x 

^ sin x - cos x 

20 

sin x + cos x 

22 x cos x esc x 

24 [«(x)] 2 [»(x)] 2 


25 26 x sin x + cos x 

tan x cot x 

27 A growing box has length r, width 1/(1 + t), and height 
cos t. 

(a) What is the rate of change of the volume? 

(b) What is the rate of change of the surface area? 

28 With two applications of the product rule show that the 

derivative of uvw is uvw' + uv'w + u'vw. When a box with sides 
m, v , w grows by Am , At;, Aw, three slabs are added with volume 
uv Aw and and . 

29 Find the velocity if the distance is f(t) = 

5 1 2 for t ^ 10, 500 + 100^-10 for t > 10. 


t 3/2 t 

30 A cylinder has radius r = ^ and height h = . 

J l+t 3/2 & 1 -Ft 

(a) What is the rate of change of its volume? 

(b) What is the rate of change of its surface area (including 
top and base)? 

31 The height of a model rocket is f(t) = t 3 /( 1 + 0- 

(a) What is the velocity v(t)l 

(b) What is the acceleration dv/dtl 

32 Apply the product rule to «(x)m 2 (x) to find the power rule 
for w 3 (x). 

33 Find the second derivative of the product u(x)t;(x). Find 
the third derivative. Test your formulas onu = v = x. 

34 Find functions y(x) whose derivatives are 

(a) x 3 (b) 1/x 3 (c) (1 — x) 3/2 (d) cos 2 x sinx. 

35 Find the distances f(t), starting from /( 0) = 0, to match 
these velocities: 

(a) v(t) = cos t sin t (b) v(t) = tan t sec 2 1 

( C ) i>(t)=yr+t 

36 Apply the quotient rule to (u(x)) 3 /(w(x)) 2 and —v'/v 2 . 

The latter gives the second derivative of 

37 Draw a figure like 2.13 to explain the square rule. 

38 Give an example where u(x)/v(x ) is increasing but du/dx = 
dv/dx = 1. 

39 True or false , with a good reason: 

(a) The derivative of x 2 ” is 2«x 2n_1 . 

(b) By linearity the derivative of a(x)u(x) + b(x)v(x) is 
fl(x) du/dx + b(x) dv/dx. 

(c) The derivative of |x| 3 is 3|x| 2 . 

(d) tan 2 x and sec 2 x have the same derivative. 

(e) (uv)' = mV is true when u(x) = 1. 

40 The cost of u shares of stock at v dollars per share is uv 
dollars. Check dimensions of d(uv)/dt and u dv/dt and v du/dt. 

41 If u(x)/t;(x) is a ratio of polynomials of degree n , what are 
the degrees for its derivative? 

42 For y = 5x + 3, is (dy/dx) 2 the same as d 2 y/dx 2< l 

43 If you change from f(t) = tcost to its tangent line at 
t = 7c/ 2, find the two-part function df /dt. 

44 Explain in your own words why the derivative of u(x)t;(x) 
has two terms. 

45 A plane starts its descent from height y — h at x = — L 
to land at (0, 0). Choose a, b, c, d so its landing path 
y = ax 3 + bx 2 + cx + d is smooth. With dx/dt — V= constant, 
find dy/dt and d 2 y/dt 2 at x = 0 and x = — L. (To keep 
d 2 y/dt 2 small, a coast-to-coast plane starts down L>100 
miles from the airport.) 
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2.6 Limits 


You have seen enough limits to be ready for a definition. It is true that we have 
survived this far without one, and we could continue. But this seems a reasonable 
time to define limits more carefully. The goal is to achieve rigor without rigor mortis. 

First you should know that limits of Ay/ Ax are by no means the only limits in 
mathematics. Here are five completely different examples. They involve n -► oo, not 
Ax -> 0: 

1. a n = (n - 3 )/(n + 3) (for large n, ignore the 3’s and find a n -> 1) 

2. a n = t 4- 4 (start with any a x and always a n -> 8) 

3. a n = probability of living to year n (unfortunately a n -* 0) 

4. a n — fraction of zeros among the first n digits of n ( a n -* ^?) 

5. a x = .4, a 2 = .49, a 3 = .493, .... iVo matter what the remaining decimals are , t/ie 
a’s converge to a limit . Possibly a n -► .493000 ..., but not likely. 

The problem is to say what the limit symbol -► really means . 

A good starting point is to ask about convergence to zero. When does a sequence 
of positive numbers approach zero? What does it mean to write a n -► 0? The numbers 
must become “small,” but that is too vague. We will propose four 
definitions of convergence to zero , and I hope the right one will be clear. 

1. All the numbers a n are below 10“ 10 . That may be enough for practical purposes, 
but it certainly doesn’t make the a n approach zero. 

2. The sequence is getting closer to zero — each a„ +1 is smaller than the preceding 
a n . This test is met by 1.1, 1.01, 1.001, ... which converges to 1 instead ofO. 

3. For any small number you think of , at least one of the af s is smaller. That pushes 

something toward zero, but not necessarily the whole sequence. The condition would 
be satisfied by 1, j, 1, 1, J, ... , which does not approach zero. 

4. For any small number you think of, the af s eventually go below that number and 
stay below. This is the correct definition. 

I want to repeat that. To test for convergence to zero, start with a small number — 
say 10“ 10 . The a^’s must go below that number. They may come back up and go 
below again — the first million terms make absolutely no difference. Neither do the 
next billion, but eventually all terms must go below 10“ 10 . After waiting longer 
(possibly a lot longer), all terms drop below 10 “ 20 . The tail end of the sequence 
decides everything. 

Question 1 Does the sequence 10" 3 , 10~ 2 , 10“ 6 , 10" 5 , 10“ 9 , 10 -8 , ... approach 0? 
Answer Yes. These up and down numbers eventually stay below any s. 




a n < e if n > 6 


• • • 


a p a 3 , • • • above e 



non-convergence 


Fig. 2.17 Convergence means: Only a finite number of a’s are outside any strip around L. 
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Question 2 Does 10 4 , 10 6 , 10 4 , 10 8 , 10 4 , 10 10 , ... approach zero? 

Answer No. This sequence goes below 10' 4 but does not stay below. 

There is a recognized symbol for “an arbitrarily small positive number.” By 
worldwide agreement, it is the Greek letter e (epsilon). Convergence to zero means 
that the sequence eventually goes below s and stays there. The smaller the e, the tougher 
the test and the longer we wait. Think of e as the tolerance, and keep reducing it. 

To emphasize that s comes from outside, Socrates can choose it. Whatever e he 
proposes, the a’s must eventually be smaller. After some a N , all the a’s are below the 
tolerance e. Here is the exact statement: 

for any e there is an N such that a n <e if n> N. 

Once you see that idea, the rest is easy. Figure 2.17 has N = 3 and then N = 6. 

EXAMPLE 1 The sequence f, f, ... starts upward but goes to zero. Notice that 
1, 4, 9, ... , 100, ... are squares, and 2, 4, 8, ... , 1024, ... are powers of 2. Eventually 2" 
grows faster than n 2 , as in a 10 = 100/1024. The ratio goes below any e. 

EXAMPLE 2 1, 0, j, 0, 0, ... approaches zero. These a’s do not decrease steadily 
(the mathematical word for steadily is “monotonically”) but still their limit is zero. 
The choice e= 1/1000 produces the right response: Beyond a 20 oi terms are below 
1/1000. So N = 2001 for that e. 

The sequence 1, j, 3, ... is much slower — but it also converges to zero. 

Next we allow the numbers a n to be negative as well as positive. They can converge 
upward toward zero, or they can come in from both sides. The test still requires the 
a„ to go inside any strip near zero (and stay there). But now the strip starts at -e. 

The distance from zero is the absolute value \a n \. Therefore a n ~* 0 means \a n \ -> 0. 
The previous test can be applied to |aj: 

for any e there is an N such that |a„| <eif n> N. 

EXAMPLE 3 1, — i,i, — i,... converges to zero because 1 , 3 , $ , i , . . . converges to zero. 

It is a short step to limits other than zero. The limit is L if the numbers a„-L 
converge to zero. Our final test applies to the absolute value \a„ — L\: 

for any e there is an N such that |a„ — L\ < e if n > N. 

This is the definition of convergence! Only a finite number of a’s are outside any strip 
around L (Figure 2.18). We write a n -* L or lim a„ = L or lim,,-,*, a„ = L. 




+ 


1 1 11111 



1 11111 



Fig. 2.18 a n -+ 0 in Example 3; a n -> 1 in Example 4; a n -> oo in Example 5 (but a n+1 -a n -+ 0). 
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EXAMPLE 4 The numbers §, f, J, ... converge to L=l. After subtracting 1 the 
differences i, i, ... converge to zero. Those difference are \a n — L\. 

EXAMPLE 5 The sequence 1, 1 + i, 1 + i + 3 , 1 + i + i ... fails to converge. 

The distance between terms is getting smaller. But those numbers a l9 a 2 , a 3 , a 4 , ... go 
past any proposed limit L. The second term is 1^. The fourth term adds on i + 5, 
so a 4 goes past 2. The eighth term has four new fractions i + £ + 4 + i, totaling 
more than i + £ + i + i = i. Therefore a 8 exceeds 2\. Eight more terms will add more 
than 8 times so a 16 is beyond 3. The lines in Figure 2.18c are infinitely long, not 
stopping at any L. 

In the language of Chapter 10, the harmonic series 1 + j + i + • • • does not converge. 
The sum is infinite, because the “partial sums” a n go beyond every limit L (a 5000 is 
past L = 9). We will come back to infinite series, but this example makes a subtle 
point: The steps between the a n can go to zero while still a n -> 00. 

Thus the condition a n+1 — a n -+ 0 is not sufficient for convergence. However this 
condition is necessary. If we do have convergence, then a n+ 1 — a n -* 0. That is a good 
exercise in the logic of convergence, emphasizing the difference between “sufficient” 
and “necessary.” We discuss this logic below, after proving that [statement A] implies 
[statement B~\\ 

If [a n converges to L] then [a w + 1 — a n converges to zero]. (1) 

Proof Because the a n converge, there is a number N beyond which \a n — L\<s and 
also \a n + 1 - L\ < s. Since a n + 1 - a n is the sum of a n + 1 - L and L- a„, its absolute 
value cannot exceed s + e = 2s. Therefore a n+i - a n approaches zero. 

Objection by Socrates : We only got below 2e and he asked for s. Our reply : If he 
particularly wants \a n + l ~ a n \< 1/10, we start with s = 1/20. Then 2e = 1/10. But this 
juggling is not necessary. To stay below 2s is just as convincing as to stay below e. 

THE LOGIC OF “IF” AND “ONLY IF” 

The following page is inserted to help with the language of mathematics. In ordinary 
language we might say “I will come if you call.” Or we might say “I will come only 
if you call.” That is different! A mathematician might even say “I will come if and 
only if you call.” Our goal is to think through the logic, because it is important and 
not so familiar.t 

Statement A above implies statement B. Statement A is a n -> L; statement B is 
<*n + i~ a n ^>0. Mathematics has at least five ways of writing down A=>B , and I 
though you might like to see them together. It seems excessive to have so many 
expressions for the same idea, but authors get desperate for a little variety. Here are 
the five ways that come to mind: 

A=> B 
A implies B 
if A then B 

A is a sufficient condition for B 
B is true if A is true 


tLogical thinking is much more important than s and 3 . 
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EXAMPLES /f [positive numbers are decreasing] then [they converge to a limit], 
//■[sequences a n and b n converge] then [the sequence a n + b n converges]. 

Iflf(x) is the integral of u(x)] then [t;(x) is the derivative of /(x)]. 

Those are all true, but not proved. A is the hypothesis, B is the conclusion. 

Now we go in the other direction. (It is called the “converse,” not the inverse.) We 
exchange A and B. Of course stating the converse does not make it true! B might 
imply A, or it might not. In the first two examples the converse was false — the a n 
can converge without decreasing, and a n + b n can converge when the separate 
sequences do not. The converse of the third statement is true — and there are five 
more ways to state it: 

A<= B 

A is implied by B 
if B then A 

A is a necessary condition for B 
B is true only if A is true 

Those words “necessary” and “sufficient” are not always easy to master. The same 
is true of the deceptively short phrase “if and only if.” The two statements A=> B and 
A <= B are completely different and they both require proof. That means two separate 
proofs. But they can be stated together for convenience (when both are true): 

AoB 

A implies B and B implies A 
A is equivalent to B 

A is a necessary and sufficient condition for B 
A is true if and only if B is true 

EXAMPLES [a„ -► L] o [ 2a n -+2L ] o [a„+l-L+l] o [«„-L-> 0]. 

RULES FOR LIMITS 

Calculus needs a definition of limits, to define dy/dx. That derivative contains two 
limits: Ax -> 0 and Ay/Ax -► dy/dx. Calculus also needs rules for limits, to prove the 
sum rule and product rule for derivatives. We started on the definition, and now we 
start on the rules. 

Given two convergent sequences, a n ^L and b n -+ M, other sequences also converge: 

Addition: a„ + b n -> L + M Subtraction: a n - -> L — M 

Multiplication: a n b n ^LM Division: a n /b n ^L/M (provided M ^ 0) 

We check the multiplication rule, which uses a convenient identity: 

a n b n ~ LM — (a n - L)(b n - M) + M(a n - L) + L(b n - M). (2) 

Suppose \a n — L| < e beyond some point N, and \b n — M\ < e beyond some other point 
N'. Then beyond the larger of N and N', the right side of (2) is small. It is less than 
s -s + Me + Le. This proves that (2) gives a n b n -* LM. 

An important special case is ca n -► cL. (The sequence of fe’s is c, c, c, c, ....) Thus a 
constant can be brought “outside” the limit, to give lim ca n = c lim a n . 
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THE LIMIT OF f(x) AS x -*cr 

The final step is to replace sequences by functions. Instead of a x , a 2 , ... there is a 
continuum of values /(x). The limit is taken as x approaches a specified point a 
(instead of n -* oo). Example: As x approaches a = 0, the function /(x) = 4 - x 2 
approaches L = 4. As x approaches a - 2, the function 5x approaches L = 10. Those 
statements are fairly obvious, but we have to say what they mean. Somehow it must 
be this: 

if x is close to a then f(x) is close to L. 

If x - a is small, then f(x) - L should be small. As before, the word small does not 
say everything. We really mean “arbitrarily small,” or “below any e.” The difference 
f(x) — L must become as small as anyone wants, when x gets near a. In that case 
lim x -, e /(x) = L. Or we write /(x) -*• L as x -* a. 

The statement is awkward because it involves two limits. The limit x -» a is forcing 
/(x) -* L. (Previously n -* oo forced a n -* L.) But it is wrong to expect the same e in 
both limits. We do not and cannot require that |x — a\ < e produces |/(x) — L\ < e. 
It may be necessary to push x extremely close to a (closer than e). We must guarantee 
that if x is close enough to a, then |/(x) - L\<e. 

We have come to the “ epsilon-delta definition” of limits. First, Socrates chooses 6. 
He has to be shown that /(x) is within e of L, for every x near a. Then somebody 
else (maybe Plato) replies with a number <5. That gives the meaning of “near a.” 
Plato’s goal is to get /(x) within e of L, by keeping x within <5 of a: 

if 0 < |x — a\ < 5 then |/(x) — L| < e. (3) 

The input tolerance is <5 (delta), the output tolerance is e. When Plato can find a 3 
for every e, Socrates concedes that the limit is L. 

EXAMPLE Prove that lim 5x = 10. In this case a = 2 and L = 10. 

x ~*2 

Socrates asks for |5x - 10| < e. Plato responds by requiring |x - 2| < 3. What 3 should 
he choose? In this case |5x — 10| is exactly 5 times |x — 2|. So Plato picks <5 below e/5 
(a smaller 3 is always OK). Whenever |x - 2| < e/5, multiplication by 5 shows that 
|5x - 10| < e. 

Remark 1 In Figure 2.19, Socrates chooses the height of the box. It extends above 
and below L, by the small number e. Second, Plato chooses the width. He must make 
the box narrow enough for the graph to go out the sides. Then |/(x) - L| < e. 



k f(x) — » L as x a 


1 limit L is not f(a) ‘ 

m 

1 f(x) = step function 

i 

a 

L+e ■ 

L—e j 

i H 

L- 

: h 

• no limit L 

; as x —> a 

a 

i 

i 


1 — i — i 


1 — 1 — 1 ► X 

1 — i — i 


a-8 a + 8 a a 


Fig. 2.19 S chooses height 2e, then P chooses width 25. Graph must go out the sides. 
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When /(x) has a jump, the box can’t hold it. A step function has no limit as x 
approaches the jump, because the graph goes through the top or bottom — no matter 
how thin the box. 

Remark 2 The second figure has fix) -> L, because in taking limits we ignore the 
final point x = a. The value f(a) can be anything, with no effect on L. The first figure 
has more: /(a) equals L. Then a special name applies — / is continuous. The left figure 
shows a continuous function, the other figures do not. 

We soon come back to continuous functions. 

Remark 3 In the example with /= 5x and 3 = e/5, the number 5 was the slope. That 
choice barely kept the graph in the box — it goes out the corners. A little narrower, 
say 5 = e/10, and the graph goes safely out the sides. A reasonable choice is 
to divide e by 2|/'(a)|. (We double the slope for safety.) I want to say why this 3 
works — even if the e-3 test is seldom used in practice. 

The ratio of fix) - L to x - a is distance up over distance across. This is A//Ax, 
close to the slope /'(a). When the distance across is 3, the distance up or down is 
near <5|/'(a)|. That equals e/2 for our “reasonable choice” of 3 — so we are safely 
below e. This choice solves most exercises. But Example 7 shows that a limit might 
exist even when the slope is infinite. 

EXAMPLE 7 lim y/x — 1 =0 (a one-sided limit). 

Notice the plus sign in the symbol x -*• 1 + . The number x approaches a = 1 only from 
above. An ordinary limit x -► 1 requires us to accept x on both sides of 1 (the exact 
value x = 1 is not considered). Since negative numbers are not allowed by the square 
root, we have a one-sided limit. It is L = 0. 

Suppose e is 1/10. Then the response could be 3 = 1/100. A number below 1/100 
has a square root below 1/10. In this case the box must be made extremely 
narrow, 3 much smaller than e, because the square root starts with infinite slope. 

Those examples show the point of the e-3 definition. (Given e, look for 3. This 
came from Cauchy in France, not Socrates in Greece.) We also see its bad feature: 
The test is not convenient. Mathematicians do not go around proposing g’s and 
replying with (5’s. We may live a strange life, but not that strange. 

It is easier to establish once and for all that 5x approaches its obvious limit 5a. 
The same is true for other familiar functions: x”-*a n and sin x -*• sin a and 
(1 — x) -1 -» (1 — a) -1 — except at a= 1. The correct limit L comes by substituting 
x- a into the function. This is exactly the property of a “ continuous function .” Before 
the section on continuous functions, we prove the Squeeze Theorem using e and 3. 

2W Squeeze Theorem Suppose fix) < g(x) < h(x) for x near a. If f(x)-*L 

and h(x) -> L as x -*• a, then the limit of gfx) is also £. 


Proof g(x) is squeezed between /(x) and h(x). After subtracting L, g(x) - L is between 
f{x)- L and h(x) - L. Therefore 

|g(x)-L|<e if |/(x)-L|<e and |/i(x) - L| < g. 

For any s, the last two inequalities hold in some region 0 < |x - a\ < 3. So the first 
one also holds. This proves that g(x) -* L. Values at x = a are not involved — until 
we get to continuous functions. 
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2.6 EXERCISES 


Read-through questions 

The limit of a n = (sin n)/n is a . The limit of a n = n 4 /2 n is 
b . The limit of a„ = (— 1)" is c . The meaning of a„ -> 0 
is: Only d of the numbers \a n \ can be e . The meaning 

of a„ -> L is: For every f there is an g such that 

h if n > i The sequence 1, 1 + i, 1 + j . . . is not 

1 because eventually those sums go past k , 


*5 If the sequence a x , a 2 , a 5 , ... approaches zero, prove that 
we can put those numbers in any order and the new sequence 
still approaches zero. 

*6 Suppose f(x)->L and g(x) -» M as x -* a. Prove from the 
definitions that f(x) T g(x) -► L + M as x a. 

Find the limits 7-24 if they exist. An e-S test is not required. 


The limit of f(x) = sin x as x -> a is l The limit of 

f(x) - x/|x| as x -> — 2 is m , but the limit as x -► 0 does 

not n . This function only has o -sided limits. The 

meaning of lim x -> fl /(x) = L is: For every e there is a 3 such 

that | f{x) — L\<e whenever p . 


7 lim 


t + 3 




9 lim / 1 > + I ! ) - /W (careful) 

x~*o h 


8 lim 


t 2 + 3 


2 t — 2 

/(l + fc)-/(l) 


10 lim 

h^o h 


Two rules for limits, when a n ^> L and b n -► M, are 
a n + b n -> q and a n b n -> r The corresponding rules 

for functions, when /(x)->L and g(x)->M as x-+a, are 
s and t . In all limits, \a n — L\ or |/(x) — L\ must 
eventually go below and u any positive v 

A => B means that A is a w condition for B. Then B is 
true x A is true. A <=> B means that A is a y condition 
for B . Then B is true * A is true. 

1 What is a 4 and what is the limit L? After which N is 
| a n — L\ < yo? (Calculator allowed) 

(a) — 1, + t, — i, ... (b) i + i, j + { + ... 

(c) 4, I, l ... a n = nf 2 n (d) 1.1, 1.11, 1.111, ... 

(e) a„ = '■!/” (0 a « = \/n 2 + n-n 

(g) l + Ul+i) 2 ,(l+i) 3 , ... 

2 Show by example that these statements are false: 

(a) If a n -> L and b n L then a n /b n 1 

(b) -> L if and only if -► L 2 

(c) If a n < 0 and a n ~* L then L < 0 

(d) If infinitely many a „' s are inside every strip around 
zero then a n -> 0. 

3 Which of these statements are equivalent to B=> A? 

(a) If A is true so is B 

(b) A is true if and only if B is true 

(c) B is a sufficient condition for A 

(d) A is a necessary condition for B. 

4 Decide whether A => B or B => A or neither or both: 

(a) A = [a n ~* 1] J5 = [-a„-+- 1] 

(b) Am[a n -* 0] B = k-a.-i -*0] 

(c) .4 = [u„ sS /i] B = [a„ = n] 

(d) /4 = [a n -»• 0] B = [sin a„ 0] 

(e) A = [a„ -> 0] B = [l/u„ fails to converge] 

(f) A = [u„ < n] B = [a„/n converges] 


sin 2 h cos 2 h 
11 lim -7= 

/i 2 


12 lim 


2x tan x 


sin x 


13 lim — - (one-sided) 

x-^0 + X 


| x I 

14 lim (one-sided) 

x^o- x 


15 lim 


sm x 


17 lim 


X -*• 1 x 

X 2 + 25 


5 x — 5 


.. 1 + X — 1 


16 lim /(c) - /(a) 


18 lim 


c-^a c — a 
x 2 — 25 


5 x — 5 


19 lim 

x->0 X 


(test x = .01) 20 lim 




x ^ 2 - / 6 + x 


21 lim [/(x) -/(a)](?) 


22 lim (sec x — tan x) 

x-"7r,/2 


23 lim — 


x^o sin x/2 


sin(x— 1) 

24 lim — ^ 

x 1 X —1 


25 Choose S so that |/(x)| < njo if 0 < x < S. 

f(x)= lOx f (x) = y/x f(x) = sin 2x f(x) = x sin x 

26 Which does the definition of a limit require? 

(1) \f(x) — L\ < e => 0<|x — a\<3 

(2) |/(x) — L\ < e <= 0<|x — a\<S 

(3) |/ (x) — L| < e o 0 < |x — n| < <5 

27 The definition of k ‘/(x)-^L as x oo” is this: For any 

e there is an X such that < e if x > X. Give an 

example in which f(x) 4 as x -> oc-. 

28 Give a correct definition of kt /(x) -► 0 as x ->■ — oo." 

29 The limit of f(x) = ( sin x)/x as x -> x is . For 

e = .01 find a point X beyond which |/(x)| < £. 

30 The limit of f(x) = 2x/(l + x) as x -+ x is L=2. For 
e = .01 find a point X beyond which |/(x) — 2| < e. 


31 The limit of f(x) = sin x as x x does not exist. Explain 
why not. 
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32 

33 

34 


(Calculator) Estimate the limit 


imit of ^1 + 


as x -» oo. 


For the polynomial f(x) = 2x — 5x 2 + lx 3 find 
(b) lim /(*) 


(a) lim f(x) 

X~> 1 

(c) lim 

JC-^OO X 


(d) lim 

x~~* — oo x 


For /(x) = 6x 3 + lOOOx find 


(a) lim 

X-»00 X 

(c) lim tw 

x->oo X 


(b) lim aa 

x->°o x 


(d) lim 


/w 

x 3 + 1 


Important rule As x -► oo the ratio of polynomials f(x)/g{x) 
has the same limit as the ratio of their leading terms. f(x) = 
x 3 — x + 2 has leading term x 3 and g(x) = 5x 6 + x -f 1 has 
leading term 5x 6 . Therefore f(x)/g(x) behaves like x 3 /5x 6 -► 0, 
g(x)/f{x) behaves like 5x 6 /* 3 -> oo, (f(x)) 2 /g(x) behaves like 
x 6 /5x 6 -> 1/5. 


35 Find the limit as x oo if it exists: 

3x 2 + 2x + 1 x 4 x 2 + 1000 . 1 

3 + 2x + x 2 x 3 + x 2 x 3 — 1000 * Sm x * 


38 If a n -> L prove that there is a number N with this prop- 
erty: If n > N and m> N then \a n — a m \ < 2e. This is Cauchy’s 
test for convergence. 

39 No matter what decimals come later, a x = .4, a 2 = .49, 
a 3 = .493, ... approaches a limit L. How do we know (when 
we can’t know L)? Cauchy's test is passed: the a' s get closer 
to each other. 

(a) From a A onwards we have \a n — a m \< , 

(b) After which a N is \a m — a n \ < 10 -7 ? 

40 Choose decimals in Problem 39 so the limit is L = .494. 
Choose decimals so that your professor can’t find L. 

41 If every decimal in . abcde is picked at random from 
0, 1, ..., 9, what is the “average” limit L? 

42 If every decimal is 0 or 1 (at random), what is the average 
limit L? 

43 Suppose a n = ja n - l +4 and start from a x = 10. Find a 2 
and a 3 and a connection between a n — 8 and a n - 1 — 8. Deduce 
that a n 8. 

44 “For every d there is an e such that |/(x)| < e if |x| < d .” 
That test is twisted around. Find e when /(x) = cos x, which 
does not converge to zero. 


36 If a particular S achieves |/(x) — L| <e, why is it OK to 
choose a smaller <5? 

37 The sum of 1 +r + r 2 4- ••• +r" -1 is a n = { 1 -r")/(l -r). 
What is the limit of a n as n -* oo? For which r does the limit 
exist? 


45 Prove the Squeeze Theorem for sequences, using e: If 
a n L and c n -» L and a n ^b n ^ c„ for n> N, then b n -> L. 

46 Explain in 110 words the difference between “we will get 
there if you hurry” and “we will get there only if you hurry” 
and “we will get there if and only if you hurry.” 
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This will be a brief section. It was originally included with limits, but the combination 
was too long. We are still concerned with the limit of f(x) as x -► a, but a new number 
is involved. That number is f(a ), the value off at x = a. For a “limit,” x approached 
a but never reached it — so f(a) was ignored. For a “continuous function,” this final 
number f(a) must be right. 

May I summarize the usual (good) situation as x approaches a? 

1. The number f{a) exists (/ is defined at a) 

2. The limit of /(x) exists (it was called L) 

3. The limit L equals f(a) (f(a) is the right value) 

In such a case, /(x) is continuous at x = a. These requirements are often written in a 
single line: /(x) -+f{a) as x a. By way of contrast, start with four functions that are 
not continuous at x = 0. 
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Fig. 2.20 Four types of discontinuity (others are possible) at x = 0. 


In Figure 2.20, the first function would be continuous if it had /( 0) = 0. But it has 
f(0)= 1. After changing /( 0) to the right value, the problem is gone. The discontinuity 
is removable. Examples 2, 3, 4 are more important and more serious. There is no 
“correct” value for /( 0): 

2. /(x) = step function (jump from 0 to 1 at x = 0) 

3. /(x) = 1/x 2 (infinite limit as x -* 0) 

4. f(x) — sin (1/x) (infinite oscillation as x -» 0). 

The graphs show how the limit fails to exist. The step function has a jump discontinu- 
ity. It has one-sided limits, from the left and right. It does not have an ordinary (two- 
sided) limit. The limit from the left (x -* 0”) is 0. The limit from the right (x -* 0 + ) 
is 1. Another step function is x/|x|, which jumps from —1 to 1. 

In the graph of 1/x 2 , the only reasonable limit is L= + oo. I cannot go on record 
as saying that this limit exists. Officially, it doesn’t — but we often write it anyway: 
1/x 2 -» oo as x -» 0. This means that 1/x 2 goes (and stays) above every L as x -» 0. 
In the same unofficial way we write one-sided limits for /(x)= 1/x: 

From the left, lim - = - oo. From the right, lim - = + oo. (1) 

x-»0“ X jc->0 + X 

Remark 1/x has a “pole” at x = 0. So has 1/x 2 (a double pole). The function 
l/(x 2 - x) has poles at x = 0 and x = 1. In each case the denominator goes to zero 
and the function goes to + oo or - oo. Similarly 1/sin x has a pole at every multiple 
of 7t (where sin x is zero). Except for 1/x 2 these poles are “ simple ” — the functions are 
completely smooth at x = 0 when we multiply them by x: 

(x)(-J=l and (x)( 2 *_ ) = ~~t an d (x)f-r— ) are continuous at x = 0. 

V x ) V x x / x 1 V sin x j 

1/x 2 has a double pole, since it needs multiplication by x 2 (not just x). A ratio of 
polynomials P(x)/Q(x) has poles where 6 = 0, provided any common factors like 
(x + l)/(x + 1) are removed first. 

Jumps and poles are the most basic discontinuities, but others can occur. The 
fourth graph shows that sin (1/x) has no limit as x -*• 0. This function does not blow 
up; the sine never exceeds 1. At x = 5 and \ and totto it equals sin 3 and sin 4 and 
sin 1000. Those numbers are positive and negative and (?). As x gets small and 1/x 
gets large, the sine oscillates faster and faster. Its graph won’t stay in a small box of 
height e, no matter how narrow the box. 

CONTINUOUS FUNCTIONS 

DEFINITION / is “ continuous at x = a” if /(a) is defined and /(x) -» /(a) as x -> a. 
If / is continuous at every point where it is defined, it is a continuous function. 


2.7 Continuous Functions 
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Objection The definition makes /(x) = l/x a continuous function! It is not defined 
at x = 0, so its continuity can’t fail. The logic requires us to accept this, but we don’t 
have to like it. Certainly there is no /( 0) that would make l/x continuous at x = 0. 

It is amazing but true that the definition of “continuous function” is still debated 
(Mathematics Teacher, May 1989). You see the reason — we speak about a discontinu- 
ity of l/x, and at the same time call it a continuous function. The definition misses 
the difference between l/x and (sinx)/x. The function /(x) = (sinx)/x can be made 
continuous at all x. Just set /( 0) = 1. 

We call a function “continuable" if its definition can be extended to all x in a way 
that makes it continuous. Thus (sin x)/x and -Jx are continuable. The functions l/x 
and tan x are not continuable. This suggestion may not end the debate, but I hope 
it is helpful. 

EXAMPLE 1 sin x and cos x and all polynomials P(x) are continuous functions. 

EXAMPLE 2 The absolute value |x| is continuous. Its slope jumps (not continuable). 

EXAMPLE 3 Any rational function P(x)/Q(x) is continuous except where Q = 0. 

EXAMPLE 4 The function that jumps between 1 at fractions and 0 at non-fractions 
is discontinuous everywhere. There is a fraction between every pair of non-fractions 
and vice versa. (Somehow there are many more non-fractions.) 

EXAMPLE 5 The function 0* 2 is zero for every x, except that 0° is not defined. So 
define it as zero and this function is continuous. But see the next paragraph where 
0° has to be 1. 

We could fill the book with proofs of continuity, but usually the situation is clear. 
“A function is continuous if you can draw its graph without lifting up your pen.” 
At a jump, or an infinite limit, or an infinite oscillation, there is no way across the 
discontinuity except to start again on the other side. The function x" is continuous 
for n > 0. It is not continuable for n < 0. The function x° equals 1 for every x, except 
that 0° is not defined. This time continuity requires 0° = 1. 

The interesting examples are the close ones — we have seen two of them: 


EXAMPLE 6 — and 

x 


1 - COS X 
X 


are both continuable at x = 0. 


Those were crucial for the slope of sin x. The first approaches 1 and the second 
approaches 0. Strictly speaking we must give these functions the correct values 
(1 and 0) at the limiting point x = 0 — which of course we do. 

It is important to know what happens when the denominators change to x 2 . 


EXAMPLE 7 ^ blows up but i_££L^ has the limit \ atx = 0. 

X 2 r X 2 2 

Since (sin x)/x approaches 1, dividing by another x gives a function like l/x. There 
is a simple pole. It is an example of 0/0, in which the zero from x 2 is reached more 
quickly than the zero from sin x. The “race to zero ” produces almost all interesting 
problems about limits. 
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For 1 — cos x and x 2 the race is almost even. Their ratio is 1 to 2: 

1 - cos x 1 — cos 2 x sin 2 x 1 1 

9 = — TV-* r = — , > r as x -> 0. 

x x (1+cosx) x 1+cosx 1 + 1 

This answer \ will be found again (more easily) by ‘THopital’s rule.” Here I emphasize 
not the answer but the problem. A central question of differential calculus is to know 
how fast the limit is approached. The speed of approach is exactly the information in 
the derivative. 

These three examples are all continuous at x = 0. The race is controlled by the 
slope — because f(x)—f( 0) is nearly /'( 0) times x: 

derivative of sin x is 1 sin x decreases like x 

derivative of sin 2 x is 0 <-> sin 2 x decreases faster than x 
derivative of x 1/3 is oo <-> x 1/3 decreases more slowly than x. 

DIFFERENTIABLE FUNCTIONS 

The absolute value |x| is continuous at x = 0 but has no derivative. The same is true 
for x 1/3 . Asking for a derivative is more than asking for continuity. The reason is 
fundamental, and carries us back to the key definitions: 

Continuous at x: f(x + Ax) — f(x) -> 0 as Ax 0 

fix + Ax) - fix) 

Derivative at x: ► /'(x) as Ax — ► 0. 

Ax 

In the first case, A/ goes to zero (maybe slowly). In the second case, A / goes to zero 
as fast as Ax (because A// Ax has a limit). That requirement is stronger: 


21 At a point where /(x) has a derivative, the function must be continuous. 
But f(x) can be continuous with no derivative. 


Proof The limit of A / = (Ax)(A/ /Ax) is (0 )(df jdx) = 0. So /(x + Ax) ~f(x) -> 0. 

The continuous function x 1/3 has no derivative at x = 0, because ^x~ 2/3 blows up. 
The absolute value |x| has no derivative because its slope jumps. The remarkable 
function j cos 3x + * cos 9x + ••• is continuous at all points and has a derivative at 
no points. You can draw its graph without lifting your pen (but not easily — it turns 
at every point). To most people, it belongs with space-filling curves and unmeasurable 
areas — in a box of curiosities. Fractals used to go into the same box! They are 
beautiful shapes, with boundaries that have no tangents. The theory of fractals is 
very alive, for good mathematical reasons, and we touch on it in Section 3.7. 

I hope you have a clear idea of these basic definitions of calculus: 

1 Limit (n — ► x or x -+ a) 2 Continuity (at x = a) 3 Derivative (at x = a). 

Those go back to e and 3 , but it is seldom necessary to follow them so far. In the 
same way that economics describes many transactions, or history describes many 
events, a function comes from many values /(x). A few points may be special, like 
market crashes or wars or discontinuities. At other points df/dx is the best guide to 
the function. 




2.7 Continuous Functions 


This chapter ends with two essential facts about a continuous function on a closed 
interval. The interval is a < x < b, written simply as [a, b].f At the endpoints a and 
b we require f(x) to approach f(a) and f(b). 

Extreme Value Property A continuous function on the finite interval [a, b] has a 
maximum value M and a minimum value m. There are points x max and x min in [a, b] 
where it reaches those values: 

fix ma J = M > fix) f(x min ) = m for all x in [a, b]. 

Intermediate Value Property If the number F is between f(a) and /(&), there is a 
point c between a and b where /(c) = F. Thus if F is between the minimum m and 
the maximum M, there is a point c between x min and x max where /(c) = F. 

Examples show why we require closed intervals and continuous functions. For 
0<x^l the function /(x) = x never reaches its minimum (zero). If we close the 
interval by defining /(0) = 3 (discontinuous) the minimum is still not reached. Because 
of the jump, the intermediate value F = 2 is also not reached. The idea of continuity 
was inescapable, after Cauchy defined the idea of a limit. 


2.7 EXERCISES 


Read-through questions 

Continuity requires the Q of /(x) to exist as x -> a and 
to agree with b . The reason that x/|x| is not continuous 
at x = 0 is £ . This function does have d limits. The 
reason that 1/cos x is discontinuous at e is * . The 

reason that cos(l/x) is discontinuous at x = 0 is 9 

The function /(x) = h has a simple pole at x = 3, where 
f 2 has a i pole. 

The power x" is continuous at all x provided n is i . It 
has no derivative at x = 0 when n is k , /(*) = sin (— x)/x 
approaches * as x -> 0, so this is a function pro- 

vided we define /( 0) = n . A “continuous function” must 
be continuous at all Q . A “continuable function” can be 
extended to every point x so that p . 

If / has a derivative at x = a then / is necessarily q at 
x = a. The derivative controls the speed at which /(x) 
approaches r . On a closed interval [a, h], a continuous 
/ has the s value property and the t value property. 
It reaches its £ M and its v m, and it takes on every 
value w 


In Problems 1-20, find the numbers c that make /(x) into 
(A) a continuous function and (B) a differentiable function. 
In one case /(x) -»• f(a) at every point, in the other case Af/Ax 
has a limit at every point. 


1 /w= 


f sin x x < 1 

( C X ^ 1 


2 fix) = 



X^7l 

x = n 


3 fix) = 

5 /(x) = 

7 f(x) = 

9 f(x) = 

11 fix ) = 


cx x < 0 
2cx x ^ 0 

c + x x<0 
c 2 + x 2 x ^ 0 

2x x <c 
x + 1 x^c 

(sin x)/x 2 x ^ 0 


c x #4 
1/x 3 x = 4 


|x 2 + c 


X#1 


13 f(x)={ x-1 

2 x = 1 


15 f(x) = 


(tan x)/x x # 0 
c x = 0 


x = 0 


4 /(*) = 

6 f{x) = 

8 /(*) = 

10 /(x) = 

12 f(x) = 


CX X < 1 

2cx x ^ 1 

x 3 x^c 
— 8 x = c 

x c x # 0 
0 x = 0 

X + C X <c 


1 


x>c 


x^0 


sec x x ^ 0 
x 2 — 1 


x^c 


14 fix) = { x-c 

2c x = c 


16 /(x) = 


X 2 x<c 

2x x > c 


f(c + cosx)/x x#0 

17 /(x)=^ 18/(x) = |x + c| 

0 x = 0 


+The interval [a, b] is closed (endpoints included). The interval (a, b) is 0 />en (a and b left out). 
The infinite interval [0, oo) contains all x ^ 0. 
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19 fix) = 


f (sin x — x)/x c 

i o 


x#0 
x = 0 


20/(x) = |x 2 + c 2 | 


Construct your own /(x) with these discontinuities at x = 1. 

21 Removable discontinuity 

22 Infinite oscillation 


(b) If /(x)< 7 for all x, then / reaches its maximum. 

(c) If /( 1) = 1 and /( 2) = -2, then somewhere /(x) - 0. 

(d) If /(1)=1 and /( 2) = — 2 and / is continuous on 
[1, 2], then somewhere on that interval /(x) = 0. 

36 The functions cos x and 2x are continuous. Show from 

the property that cos x = 2x at some point between 

0 and 1. 


23 Limit for x -► 1 + , no limit for x -> 1 “ 

24 A double pole 

25 lim /(x) = 4+ lim /(x) 

x-+l“ JC— ► 1 + 

26 lim /(x) = oo but lim (x — l)/(x) = 0 

jc-+l x-> 1 

27 lim (x — I )/(x) - 5 

x~* 1 


37 Show by example that these statements are false: 

(a) If a function reaches its maximum and minimum then 
the function is continuous. 

(b) If /(x) reaches its maximum and minimum and all 
values between /( 0) and /( 1), it is continuous at x = 0. 

(c) (mostly for instructors) If /(x) has the intermediate 
value property between all points a and b, it must be 
continuous. 


28 The statement “3x -> 7 as x -> 1” is false. Choose an e for 
which no S can be found. The statement “3x -» 3 as x -> 1” is 
true. For e = i choose a suitable S. 

29 How many derivatives /", ... are continuable 
functions? 

(a) / = x 3/2 (b) / = x 3/2 sin x (c) /= (sin x) 5/2 

30 Find one-sided limits at points where there is no two- 
sided limit. Give a 3-part formula for function (c). 

(a) g (b) sin |x| (c) ^ |x 2 - 1| 

31 Let /(1)=1 and /(- 1)=1 and /(x) = (x 2 -x)/(x 2 - 1) 
otherwise. Decide whether / is continuous at 

(a) x=l (b) x = 0 (c) x = — 1. 

*32 Let /(x) = x 2 sin 1/x for x#0 and /(0) = 0. If the limits 
exist, find 

(a) lim /(x) (b) df/dx atx = 0 (c) lim /'(x). 

x-»0 x -*0 

33 If /(0) = 0 and /'(0) = 3, rank these functions from 
smallest to largest as x decreases to zero: 

fix), X, xfix), fix) + lx, 2(/(x) — x), (/(x)) 2 . 

34 Create a discontinuous function fix) for which / 2 (x) is 
continuous. 

35 True or false , with an example to illustrate: 

(a) If /(x) is continuous at all x, it has a maximum 
value M. 


38 Explain with words and a graph why /(x) = x sin (1/x) is 
continuous but has no derivative at x = 0. Set/(0) - 0. 

39 Which of these functions are continuable , and why? 


/iM = 


sin x x < 0 
cos x x > 1 


fii*) = 


fj sin 1/x x < 0 
) cos 1/x x > 1 


/ 3 (x) = when sin x ^ 0 / 4 (x) = x° -f 0 x2 

40 Explain the difference between a continuous function and 
a continuable function. Are continuous functions always con- 
tinuable? 

*41 /(x) is any continuous function with /(0) =/(l). 

(a) Draw a typical /(x). Mark where /(x)=/(x + i). 

(b) Explain why g(x) =/(x + i) -/(x) has g(£) = - g(0). 

(c) Deduce from (b) that (a) is always possible: There must 
be a point where #(x) = 0 and /(x) =/(x -F i). 

42 Create an /(x) that is continuous only at x = 0. 

43 If /(x) is continuous and 0 ^/(x) ^ 1 for all x, then there 
is a point where f(x*) = x*. Explain with a graph and prove 
with the intermediate value theorem. 

44 In the e-S definition of a limit, change 0 < |x — a\ < 3 to 
\x — a\< S. Why is /(x) now continuous at x = al 

45 A function has a at x = 0 if and only if 

(/(x) -/(0))/x is at x = 0. 
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CHAPTER 3 


Applications of the Derivative 


Chapter 2 concentrated on computing derivatives. This chapter concentrates on using 
them. Our computations produced dyjdx for functions built from x R and sin x and 
cos x. Knowing the slope, and if necessary also the second derivative, we can answer 
the questions about y ~J\x) that this subject was created for: 

1. How does y change when x changes? 

2. What is the maximum value of y? Or the minimum? 

3. How can you tell a maximum from a minimum, using derivatives? 

The information in dyjdx is entirely local It tells what is happening close to the point 
and nowhere else. In Chapter 2, Ax and Ay went to zero. Now we want to get them 
back. The local information explains the larger picture, because A y is approximately 
dyjdx times Ax. 

The problem is to connect the finite to the infinitesimal — the average slope to the 
instantaneous slope. Those slopes are close, and occasionally they are equal. Points 
of equality are assured by the Mean Value Theorem — which is the local-global 
connection at the center of differential calculus. But we cannot predict where dyjdx 
equals Ay/Ax. Therefore we now find other ways to recover a function from its 
derivatives — or to estimate distance from velocity and acceleration. 

It may seem surprising that we learn about y from dyjdx. All our work has been 
going the other way! We struggled with y to squeeze out dyjdx . Now we use dyjdx 
to study y. Thafs life. Perhaps it really is life, to understand one generation from 
later generations. 



3.1 Linear Approximation 



The book started with a straight line / = vt> The distance is linear when the velocity 
is constant. As soon as v begins to change, /= vt falls apart. Which velocity do we 
choose, when u{t) is not constant? The solution is to take very short time intervals, 
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in which v is nearly constant: 

/= i't is completely false 
A/'= uAt is nearly true 
df— vdt is exactly true. 

For a brief moment the function /(t) is linear — and stays near its tangent line. 

In Section 2.3 we found the tangent line to y = f(x). At x = a, the slope of the curve 
and the slope of the line are f\a). For points on the line, start at y = /(a). Add the 
slope times the “increment” x - a: 

Y-M+rmx-a). (i) 

We write a capital Y for the line and a small y for the curve. The whole point of 
tangents is that they are close ( provided we don't move too far from a): 

y ' Y or f(x) ft J\a)+f(a)(x- a). (2) 

That is the all-purpose linear approximation. Figure 3.1 shows the square root 
function y= Jx and its tangent line at x = a= 100. At the point y = v /l00= 10, 
the slope is I /2 ^/jc = 1/20. The table beside the figure compares y(x) with Y(x). 


v 



X 

Y 

y-s* 

100 

10 

10 

102 

10.1 

10.0995 

110 

10.5 

10.49 

200 

15 

14.1 

400 

25 

20 


Hg. 3.1 Hv) is the linear approximation to yfx near x = a = 100* 


The accuracy gets worse as x departs from 100* The tangent line leaves the curve* 
The arrow points to a good approximation at 102, and at 101 it would be even better* 
In this example Y is larger than y — the straight line is above the curve. The slope of 
the line stays constant, and the slope of the curve is decreasing. Such a curve will 
soon be called '"concave downward/’ and its tangent lines are above it* 

Look again at x = 102, where the approximation is good. In Chapter 2, when we 
were approaching dyjdx, we started with Ay/Ax: 




slope ft 


Jm-Jm 

102-100 


(3) 


Now that is turned around! The slope is 1/20. What we don't know is ^102: 

7102 ft yT00 + (slope)(102- 100). (4) 


You work with what you have. Earlier we didn’t know dyldx, so we used (3). Now 
we are experts at dyjdx, and we use (4). After computing /= 1/20 once and for 
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all, the tangent line stays near yfx for every number near 100. When that nearby 
number is 100 -I- Ax, notice the error as the approximation is squared: 

(yioo + = 100 + Ax + (Ax) 2 . 

The desired answer is 100 + Ax, and we are off by the last term involving (Ax) 2 , The 
whole point of linear approximation is to ignore every term after Ax, 

There is nothing magic about x = 100, except that it has a nice square root. Other 
points and other functions allow y 1 would like to express this same idea in 
different symbols. Instead of starting from a and going to x, we start from x and go a 
distance Ax to x + Ax. The letters are different but the mathematics is identical. 


3A At any point x, and for any smooth function y =/(x), 

1 . A x + Ax) ~fix) 

slope at x a --- 

Ax 


(5) 


For the approximation to f{x + Ax\ multiply both sides by Ax and add /(x): 

fix + Ax) « fix) + (slope at x)(Ax). (6) 


EXAMPLE 1 An important linear approximation: (1 + x) n ft 1 + nx for x near zero. 


EXAMPLE 2 A second important approximation: 1/(1 + xf ft 1 — nx for x near zero. 

Discussion Those are really the same. By changing n to — n in Example 1 , it becomes 
Example 2. These are linear approximations using the slopes n and — n at x = 0: 

(1 + x) n ft 1 + (slope at zero) times (x — 0) = 1 + nx. 

Here is the same thing with / (x) = x a . The basepoint in equation (6) is now 1 or x: 

(1 H- Axf ft 1 + nAx (x + Ax) ft ft x n + nx n ~ 1 Ax, 

Better than that, here are numbers. For n — 3 and —1 and 100, take Ax = ,01: 


(l,01) 3 ftl.03 




ft 2 


Actually that last number is no good. The 100th power is too much. Linear approxi- 
mation gives 1 + lOOAx = 2, but a calculator gives (l.Ol) 100 = 2.7... . This is close to 
e , the all-important number in Chapter 6. The binomial formula shows why the 
approximation failed: 

(1 + Ax) 100 = 1 + 100 Ax + (1 ( ^ 9 ) (A-x ) 2 + 

Linear approximation forgets the (Ax) 2 term. For Ax = 1/100 that error is nearly 7, 
It is too big to overlook. The exact error is j(Ax) 2 / w (c), where the Mean Value 
Theorem in Section 3.8 places c between x and x + Ax. You already see the point: 

y — Y is of order (Ax ) 2 . Linear approximation t quadratic error , 


DIFFERENTIALS 

There is one more notation for this linear approximation. It has to be presented, 
because it is often used. The notation is suggestive and confusing at the same time — 
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it keeps the same symbols dx and dy that appear in the derivative. Earlier we took 
great pains to emphasize that dy/dx is not an ordinary fraction.! Until this paragraph, 
dx and dy have had no independent meaning. Now they become separate variables, 
like x and y but with their own names. These quantities dx and dy are called 
differentials. 

The symbols dx and dy measure changes along the tangent line . They do for the 
approximation F(x) exactly what Ax and Ay did for y(x). Thus dx and Ax both 
measure distance across. 

Figure 3.2 has Ax = dx. But the change in y does not equal the change in Y. One 
is Ay (exact for the function). The other is dy (exact for the tangent line). The 
differential dy is equal to A Y, the change along ike Tangent line. Where Ay is the true 
change, dy is its linear approximation (dyjdx)dx> 

You often see dy written as f\x)dx. 



Ay = change in y (along curve) 
dy - change in Y (along tangent) 

Rg.3.2 The linear approximation to A y is 

dy=f'(x)dx. 


EXAMPLE 3 y = x 2 has dy/dx = 2x so dy = 2x dx . The table has basepoint x = 2. 
The prediction dy differs from the true Ay by exactly (Ax) 2 = .01 and .04 and .09. 



dx 

dy 

Ax 



y- x 2 

A 

0.4 

.1 

0.41 

Ay = (2 + Ax) 2 - 2 2 

dy = 4 dx 

.2 

0.8 

.2 

0.84 

Ay = 4Ax + (Ax) 2 


.3 

1.2 

.3 

1.29 



The differential dy— f\x)dx is consistent with the derivative dy/dx We 
finally have dy = (dy/dx)dx, but this is not as obvious as it seems! It looks like 
cancellation — it is really a definition. Entirely new symbols could be used, but dx 
and dy have two advantages: They suggest small steps and they satisfy dy=f(x)dx> 
Here are three examples and three rules: 

d(x n ) = nx" ’ l dx d(f + g) = df+ dg 

d { sin x) = cos x dx d(cf) — cdf 

d(tan x) = sec 2 x dx d(fg) -fdg + g df 

Science and engineering and virtually all applications of mathematics depend on 
linear approximation. The true function is “ linearized using its slope v: 

Increasing the time by At increases the distance by ^ vAt 

Increasing the force by A / increases the deflection by ^ vAf 

Increasing the production by Ap increases its value by % vAp. 


fFraction or not, it is absolutely forbidden to cancel the d's. 
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The goal of dynamics or statics or economics is to predict this multiplier v — the 
derivative that equals the slope of the tangent line. The multiplier gives a local 
prediction of the change in the function. The exact law is nonlinear — but Ohm’s law 
and Hooke’s law and Newton’s law are linear approximations. 


ABSOLUTE CHANGE, RELATIVE CHANGE, PERCENTAGE CHANGE 


The change Ay or Af can be measured in three ways. So can Ax: 


Absolute change 

Af 

Ax 

Relative change 

Af 

Ax) 

Ax 

X 

Percentage change 

^ x 100 

Ax) 

— x 100 

X 


Relative change is often more realistic than absolute change. If we know the distance 
to the moon within three miles, that is more impressive than knowing our own height 
within one inch. Absolutely, one inch is closer than three miles. Relatively, three miles 
is much closer: 


3 miles 1 inch 

< 

300,000 miles 70 inches 


or .001% <1.4%. 


EXAMPLE 4 The radius of the Earth is within 80 miles of r = 4000 miles. 

(a) Find the variation dV in the volume V = %nr 3 , using linear approximation. 

(b) Compute the relative variations dr/r and dV/V and AVj V 

Solution The job of calculus is to produce the derivative. After dV/dr — Anr 2 , its 
work is done. The variation in volume is dV = 4tt ( 4000) 2 (80) cubic miles. A 2% 
relative variation in r gives a 6% relative variation in V: 

dr = _80_ = dV_ 47r(4000f(80) 

r 4000 Z/0 V 4*(4000)V3 

Without calculus we need the exact volume at r = 4000 + 80 (also at r = 3920): 

AV_ 4;t(4080) 3 /3 — 4xr{4000) 3 / 3 
V ~ 4*r(4000) 3 /3 * ’ ° 

One comment on dV= An r 2 dr. This is (area of sphere) times (change in radius). It is 
the volume of a thin shell around the sphere. The shell is added when the radius 
grows by dr. The exact AV/V is 3917312/640000%, but calculus just calls it 6%. 


3.1 EXERCISES 


Read-through questions 

On the graph, a linear approximation is given by the o 
line. At x = a, the equation for that line is Y —fia) + b , 

Near x = a — 10, the linear approximation to y = x 3 is Y = 
1000 + c . At x = 1 1 the exact value is (1 l) 3 = ' d . The 
approximation is Y = • . In this case Ay = t and 

dy — q If we know sin x, then to estimate sin(x + Ax) we 

add h , 


In terms of x and Ax, linear approximation is 
fix + Ax) zzfjx) + l The error is of order (Ax)? or 

(x — of with p = 1 The differential dy equals fc 

times the differential 1 . Those movements are along the 

m line, where Ay is along the n 

Find the linear approximation Y to y ^/[x) near x = a\ 

1 fix) = x + X 4 , a = 0 2 fix) = l/*> a = 2 
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3 Applications of the Derivative 


3 f\x) = tan \\ a = tt/4 4 /(x) j= sin x, <j n! 2 

5 /i v) = a- sin x, a = 2 a 6 j[x) ^ $in 2 x. a = 0 

Compute 7—12 within .0] by deciding on /{a), choosing the 
basepoint a. and evaluating f(a) A-f(a )(* — a). A calculator 
shows the error, 

7 {2.001/ H sin(,02) 

9 cos{.03) 10 (15.99) 1 4 

11 1 / .98 12 sin(3.14) 

Calculate the numerical error in these linear approximations 
and compare with J(Ax) 2 /"(x): 

13 (1.01/ * I + 3(,0l) 14 cos(.Ol)* 1 +0( 01) 

15 (sin Ol) 2 ^ 0 + 0(.01) 16(1.01) 3 ^ 1 — 3{,01> 

17 (l +T y ,0 *2 18 ^/^99 ^3 

Confirm the approximations 19-21 by computing /'(0): 

19 

20 J/ V /I — a 2 ^ 1 + 4x 2 (use / = l/ v 1 — then put u = x 2 ) 

21 v V 2 | x 2 ^rc + | — (use f(u) = yje 2 + «, then put a = v 2 ) 

22 Write down the differentials df for /fx) = cosx and 
(a + l)/(.v — 1) and (a 2 + l) 2 


In 23-27 find the linear change dV in the volume or dA in the 
surface area 

23 dV if the sides of a cube change from 10 to 10.1. 

24 dA if the sides of a cube change from x to x + dx. 

25 dA if the radius of a sphere changes by dr. 

26 dV if a circular cylinder with r-2 changes height from 3 
to 3,05 (recall V=i ir 2 h). 

27 dV if a cylinder of height 3 changes from r - 2 to r = 1.9. 
Extra credit: What is dV if r and h both change ( dr and dhf} 

28 In relativity the mass is — {r/c) 2 at velocity v. By 

Problem 20 this is near + for small i\ Show that 

the kinetic energy [tnv 2 and the change in mass satisfy 
Einstein's equation e = (Anr)c 2 . 

29 Enter 1. 1 on your calculator. Press the square root key 5 

times (slowlyl. What happens each ti me to the number after 
the decimal point? This is because v ; 1 + .v ~ . 

30 In Problem 29 the numbers you see a re less than LOS, 

1.025, ... The second derivative of v - 1 + x is . so the 

linear approximation is higher than the curve. 

31 Enter 0.9 on your calculator and press the square root 
key A times Predict what will appear the fifth time and press 

again. You now have the root of 0.9. How many 

decimals agree with 1 jt(0. 1)? 


3.2 Maximum and Minimum Problems 


Our goal is (o learn about f(x) from dfidx. We begin with two quick questions. 
If dfidx is positive, what docs that say about/? If the slope is negative, how is that 
reflected in the function? Then the third question is the critical one: 

How do you identify a maximum or minimum ? Normal answer: The slope is zero. 

This may be the most important application of calculus, to reach dfidx - 0, 

Take the easy questions first. Suppose dfidx is positive for every v between a and h . 
All tangent lines slope upward. The function J(x) is increasing as x goes from a to h. 


3B If df/dx > 0 then f(x) is increasing. If dfidx < 0 then f(x) is decreasing. 


To define increasing and decreasing, look at any two points a < X . "Increasing" 
requires /{ a) < f(X ). "Decreasing" requires /(x) >/(A'). A positive slope does not mean 
a positive function , The function itself can be positive or negative. 

EXAMPLE 1 f(x) = x 2 - 2x has slope 2.x - 2. This slope is positive when x > J and 
negative when v< 1. The function increases after a = 1 and decreases before a = 1. 
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Fig. 3.3 Slopes are — I-. Slope is H h — h so / is up-down- up-down -up. 

We say that without computing /(x) at any point! The parabola in Figure 3.3 goes 
down to its minimum at x = 1 and up again, 

EXAMPLE 2 x 2 - 2x + 5 has the same slope, its graph is shifted up by 5 1 a number 
that disappears in dfjdx. All functions with slope 2x - 2 are parabolas x 2 - 2x + C, 
shifted up or down according to C. Some parabolas cross the a axis (those crossings 
are solutions to f(x) = 0). Other parabolas stay above the axis. The solutions to 
x 2 — 2x + 5 = 0 are complex numbers and we don't see them. The special parabola 
x 2 - 2x + 1 = - l) 2 grazes the axis at x = T It has a '‘double zero/ 1 where /(x) = 

dfjdx = 0. 

EXAMPLE 3 Suppose dfjdx = (x - l)(x - 2)(x - 3)(.v - 4). This slope is positive 
beyond x = 4 and up to x= 1 ( dfjdx = 24 at x = 0). And dfjdx is positive again 
between 2 and 3. At .x = 1, 2, 3, 4, this slope is zero and f(x) changes direction. 

Here/(x) is a Mth-degree polynomial, because /'(x) is fourth-degree. The graph of 
/ goes up-down-up-down-up, It might cross the x axis hve times. It must cross 
at least once (like this one). When complex numbers are allowed, every fifth-degree 
polynomial has five roots. 

You may feel that "positive slope implies increasing function" is obvious — perhaps 
it is. But there is still something delicate. Starting from dfjdx > Oat every single point, 
we have to deduce f[X) > /fx) at pairs of points. That is a '"local to global 11 question, 
to be handled by the Mean Value Theorem, It could also wait for the Fundamental 
Theorem of Calculus: The difference f{X ) —f(x) equals the area under the graph of 
dfjdx. That area is positive, so f(X) exceeds /(x), 

MAXIMA AND MINIMA 

Which x makes /(x) as large as possible? Where is the smallcst/(x)? Without calculus 
we are reduced to computing values of f(x) and comparing. With calculus, the infor- 
mation is in dfjdx. 

Suppose the maximum or minimum is at a particular point x. It is possible that 
the graph has a corner — and no derivative. But if dfidx exists^ it must be zero. The 
tangent line is level. The parabolas in Figure 3.3 change from decreasing to increasing. 
The slope changes from negative to positive. At this crucial point the slope is zero * 



98 


3 Applications of the Derivative 


3C Locai Maximum or Minimum Suppose the maximum or minimum 
occurs at a point x inside an interval where /(x) and dfjdx are defined. Then 
f'{x) = 0, 


The word “local" allows the possibility that in other intervals, /(x) goes higher or 
lower. We only look near x , and we use the definition of dfjdx. 

Start with f(x + Ax) — /(x). If /(x) is the maximum, this difference is negative or 
zero. The step Ax can be forward or backward: 


if Ax > 0: 

fix + Ax) -Ax) 

negative 
= — ^ ^ 0 

and in the limit 

df 


Ax 

positive 


(lx 

if Ax < 0: 

f{x + Ax) -/(x) 

negative 

= , 3*0 

and in the limit 

df 


Ax 

negative 


dx 


Both arguments apply. Both conclusions dfjdx ^0 and df/dx^ 0 are correct. Thus 
dfjdx = 0, 

Maybe Richard Feynman said it best. He showed his friends a plastic curve that 
was made in a special way — “no matter how you turn it , the tangent at the lowest 
point is horizontal" They checked it out. It was true. 

Surely You're Joking , Mr, Feynmanl is a good book (but rough on mathematicians). 

EXAMPLE 3 (continued) Look back at Figure 3.3b. The points that stand out 
are not the “ups 1 ’ or “downs” but the “turns.” Those are stationary points , where 
dfjdx = 0. We see two maxima and two minima. None of them are absolute maxima 
or minima, because /(x) starts at — x and ends at 4- x. 

EXAMPLE 4 f{x) = 4x 3 “ 3x 4 has slope 12x 2 - 12x 3 . That derivative is zero when 
x 2 equals x 3 , at the two points x = 0 and x= L To decide between minimum and 
maximum (local or absolute), the first step is to evaluate f(x ) at these stationary points. 
We find /(Q) = 0 and /( 1 ) — 1 . 

Now look at large x. The function goes down to - x in both directions. (You can 
mentally substitute x = 1000 and x = - 1000). For large x, - 3x 4 dominates 4x 3 . 

Conclusion /= 1 is an absolute maximum, /= 0 is not a maximum or minimum 
(local or absolute). We have to recognize this exceptional possibility, that a curve (or 
a car) can pause for an instant {/' = 0) and continue in the same direction. The reason 
is the “double zero” in 12x 2 — 12x 3 , from its double factor x 2 . 




3.2 Maximum and Minimum Problem* 
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EXAMPLE 5 Define J{x) = x + x~ l for x > 0, Its derivative 1 - 1/x 2 is zero at x = L 
At that point 7U)- 2 is the minimum value. Every combination like 3 + 3 or f + f 
is larger than / min = 2, Figure 3,4 shows that the maximum of x + x“ 1 is + oo.f 

Important The maximum always occurs at a stationary point (where dfjdx = 0) or a 
rough point (no derivative) or an endpoint of the domain. These are the three types 
of critical points. All maxima and minima occur at critical points! At every other 
point dfjdx > 0 or dfjdx < 0, Here is the procedure: 

1 , Solve dfjdx — 0 to find the stationary points J[x). 

2, Compute^*) at every critical point — stationary point, rough point , endpoint. 

3, Take the maximum and minimum of those critical values of^x), 

EXAMPLE 6 ( Absolute value j{x) = {x\) The minimum is zero at a rough point. The 
maximum is at an endpoint. There are no stationary points. 

The derivative of y = |x| is never zero. Figure 3,4 shows the maximum and mini- 
mum on the interval [“3, 2], This is typical of piecewise linear functions. 

Question Could the minimum be zero when the function never reaches Tfa) = 0? 
Answer Yes,f(x) = 1/(1 + x) 2 approaches but never reaches zero as x -► oo. 

Remark 1 x -* + oo and /(x) -► ± oo are avoided when / is continuous on a closed 
interval a < x ^ fc. Then^jc) reaches its maximum and its minimum (Extreme Value 
Theorem). But x -+ oo and f(x) -► oo are too important to rule out. You test x -*■ oo 
by considering large x . You recognize^*) oo by going above every finite value. 

Remark 2 Note the difference between critical points (specified by x) and critical 
values (specified by/(x)). The example x + x “ 1 had the minimum point x = 1 and the 
minimum value J[ 1 ) = 2 . 

MAXIMUM AND MINIMUM IN APPLICATIONS 

To find a maximum or minimum, solve f f (x) — 0. The slope is zero at the top and 
bottom of the graph. The idea is clear — and then check rough points and endpoints. 
But to be honest, that is not where the problem starts. 

In a rearapplication, the first step (often the hardest) is to choose the unknown 
and find the function. It is we ourselves who decide on x and^/W- The equation 
dfjdx - 0 comes in the middle of the problem, not at the beginning. I will start on 
a new example, with a question instead of a function. 

EXAMPLE 7 Where should you get onto an expressway for minimum driving time, 
if the expressway speed is 60 mph and ordinary driving speed is 30 mph? 

I know this problem well — it comes up every morning. The Mass Pike goes to MIT 
and I have to join it somewhere. There is an entrance near Route 128 and another 
entrance further in. I used to take the second one, now I take the first. Mathematics 
should decide which is faster — some mornings I think they are maxima. 

Most models are simplified, to focus on the key idea. We will allow the expressway 
to be entered at any point x (Figure 3.5). Instead of two entrances (a discrete problem) 


tA good word is approach when /(jc) oo. Infinity is not reached. But 1 still say "the maximum 
is x.” 
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3 Applications of the Derivative 


we have a continuous choice (a calculus problem). The trip has two parts, at speeds 
30 and 60: 


a distance x /a 2 + x 2 up to the expressway, in v /'a 2 + x 2 /30 hours 
a distance b x on the expressway, in (b — x)/60 hours 

Problem Minimize /(x) = total time = — v tr + x 2 + -^{b - x). 

We have the function /(x)* Now comes calculus. The first term uses the power rule: 
The derivative of u l:1 is ju~ lfl du;dx. Here u = a 2 + x 2 has duidx = 2x: 

(1) 

To solve /'(x) = 0, multiply by 60 and square both sides: 

(a 2 + x 2 )' 1; ' 2 {2x} = 1 gives 2 x = (a 2 ~\~ x 2 ) 1 2 and 4 x 2 = a 2 -\-x 2 . (2) 

Thus 3x 2 = a 2 . This yields two candidates, x=ajy / 3 and x=-a/ v / 3. But a 
negative x^vould mean useless driving on the expressway. In fact /' is not zero at 
x = — a,\/ 3, That false root entered when we squared 2x. 





Fig. 3.5 Join the freeway at v minimize the driving time f[x). 


\ notice something surprising. The stationary point x = a ; \/3 does not depend on 
b. The total time includes the constant hi 60, which disappeared in dfidx. Somehow 
b must enter the answer, and this is a warning to go carefully. The minimum might 
occur at a rough point or an endpoint. Those are the other critical points of f and 
our drawing may not be realistic. Certainly we expect x ^ b , or we are entering the 
expressway beyond MIT. 

Continue with calculus. Compute the driving time /(a) for an entrance at 
x* = q/y'3: 

The square root of 4a 2 _ 3 is 2n/ v /3, We combined 2/30 — 1/60= 3 60 and divided 
by v ' 3. Is this stationary value /* a minimum 7 You must look also at endpoints'. 

enter at x = 0: travel time is a 30 + h 60 -/** 

enter at \ = b: travel time is v a 1 4 - h 2 30 =/***, 
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The comparison /*</** should be automatic. Entering at x = 0 was a candidate 
and calculus didn't choose it. The derivative is not zero at x = 0. It is not smart to 
go perpendicular to the expressway. 

The second comparison has x = b. We drive directly to MIT at speed 30. This 
option has to be taken seriously. In fact it is optimal when b is small or a is large. 

This choice x — b can arise mathematically in two ways. If all entrances are between 
0 and b , then b is an endpoint. If we can enter beyond MIT, then b is a rough point. 
The graph in Figure 3.5c has a corner at x = b, where the derivative jumps. The 
reason is that distance on the expressway is the absolute value \b - x| — never negative. 
Either way x = b is a critical point. The optimal x is the smaller of aj^fl and b> 

if ajyj 3 ^ b: stationary point wins, enter at x = a\J 3, total time /* 

if a\J 3 ^ b: no stationary point, drive directly to MIT, time /*** 

The heart of this subject is in “word problems/’ All the calculus is in a few lines, 
computing/' and solving /'(x) = 0. The formulation took longer. Step 1 usually does; 

1. Express the quantity to be minimized or maximized as a function f(x ). 

The variable x has to be selected. 

2. Compute f ix), solve //*) = 0, check critical points for / min and / ma *. 

A picture of the problem (and the graph of /(*)) makes all the difference. 


EXAMPLE 7 (continued) Choose x as an angle instead of a distance. Figure 3.6 
shows the triangle with angle x and side a , The driving distance to the expressway is 
a sec x. The distance on the expressway is b — a tan x. Dividing by the speeds 30 and 
60, the driving time has a nice form: 


fix) = total 


time = 


a sec x 
30 


b - a tan x 
60 


( 3 ) 


The derivatives of sec x and tan x go into df/dx: 

df a a , 

-j- = — sec x tan x — — see x, 
dx 30 60 

Now set dfjdx = 0, divide by a, and multiply by 30 cos 2 x; 


( 4 ) 


sin x = {. (5) 

This answer is beautiful. The angle x is 30°! That optimal angle (jc/6 radians) has 
sin x = t. The triangle with side a and h ypotenu se a/y 3 is a 30 60 90 right triangle. 

I don’t know whether you prefer s Ja 1 + x 2 or trigonometry. The minimum is 
exactly as before — either at 30° or going directly to MIT. 
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3 Applications of the Derivative 


EXAMPLE 8 In mechanics, nature chooses minimum energy. A spring is pulled down 
by a mass, the energy is fix), and dfjdx = 0 gives equilibrium. It is a philosophical 
question why so many laws of physics involve minimum energy or minimum lime — 
which makes the mathematics easy. 

The energy has two terms — for the spring and the mass. The spring energy is 
\kx 2 — positive in stretching (x> 0 is downward) and also positive in compression 
(x < 0). The potential energy of the mass is taken as — mx — decreasing as the mass 
goes down. The balance is at the minimum of f(x } = \kx l - mx< 

I apologize for giving you such a small problem, but it makes a crucial point. 
When /(x) is quadratic , the equilibrium equation df/dx = 0 is linear. 

dfjdx = kx - m = 0. 

Graphically, x = mjk is at the bottom of the parabola. Physically, kx = m is a balance 
of forces — the spring force against the weight. Hooke's law for the spring force is 
elastic constant fc times displacement x. 

EXAMPLE 9 Derivative of cost — marginal cost (our first management example). 

The paper to print x copies of this book might cost C= 1000 + 3x dollars. The 
derivative is dCjdx = 3. This is the marginal cost of paper for each additional book. 
If x increases by one book, the cost C increases by S3. The marginal cost is like the 
velocity and the total cost is like the distance. 

Marginal cost is in dollars per book , Total cost is in dollars. On the plus side, the 
income is /(x) and the marginal income is dlfdx. To apply calculus, we overlook the 
restriction to whole numbers. 

Suppose the number of books increases by dx.t The cost goes up by (dCjdx) dx> 
The income goes up by (dljdx)dx, If we skip all other costs, then profit P[x) = 
income I(x) — cost C(x). In most cases P increases to a maximum and falls back. 

At the high point on the profit curve, the marginal profit is zero: 

dPulx — 0 or dldx = dCdx, (6) 

Profit is maximized when marginal income I equals marginal cost C. 

This basic rule of economics comes directly from calculus, and we give an example: 

C(x) = cost of x advertisements - 900 + 400x - x 2 

setup cost 900, print cost 4Q0x, volume savings x 2 

l(x) = income due to x advertisements = 600x — fix 2 

sales 600 per advertisement, subtract fix 2 for diminishing returns 

optimal decision dCjdx = dljdx or 400 - 2x = 600 — 12x or x = 20 
profit = income — cost = 9600 - 8500 — 1 100. 

The next section shows how to verify that this profit is a maximum not a minimum. 
The first exercises ask you to solve dfjdx = 0. Later exercises also look for fix). 


tMaybe dx is a differential calculus book, I apologize for that. 
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3.2 EXERCISES 


Read-through questions 

If dfjdx > 0 in an interval then /lx) is o , If a maximum 
or minimum occurs at x then f'(x) — b Points where 
f\x ) = 0 are called c points. The function/lx) = 3x 2 — x 
has a (minim um)(maximum) at x = d , A stationary point 
that is not a maximum or minimum occurs for/(x) = e . 

Extreme values can also occur where f is not defined 
or at the a of the domain. The minima of |x| and 5x for 
— 2 < x ^ 2 are at x= h and x = I even though 
dfjdx is not zero, x* is an absolute 1 when/(x*)>/lx) 
for all x, A * minimum occurs when /lx*) </(x) for 
all x near x*. 

The minimum of $ax 2 — bx is l at x = m , 

Find the stationary points and rough points and endpoints. 
Decide whether each point is a local or absolute minimum nr 
maximum. 

1 f[x) = x 2 + 4x + 5, — oo < x < oo 

2 f{x) = x 3 — I2x, — oo < x < oo 
3.flx) = x* + 3, -Ujc<4 
4f[x) = x 2 + (2lx), 1 

5 /W -(*-**)*. 1 

6 /(x) = l/{x — x 2 ), 0 < x < 1 

7 y?x) = 3x 4 + 8x 3 — 18x 2 , — oo < x < ao 

8 y?x) = {x J — 4x for 0 ^ x ^ 1, x 1 — 4 for x < 2} 

9 j{x) = y/x- 1 + y/9-x, 1 ^ x < 9 

10 f[x) = x + sin x, 0 ^ x < 2n 

11 /W = x 3 (l — x) 6 , — oo < x < oo 

12 /(x) = x/(l + x), 0 ^ x < 100 

13 f[x) = distance from x > 0 to nearest whole number 

14 f(x) — distance from x ^ 0 to nearest prime number 

15 f{x) = |x + 1 1 -h |x — 1 1, —3 ^ x ^ 2 

16 f(x) = Xyfl - x 2 , 

17/(x) = x 1/2 -x 3 ' 2 ,0^x^4 
18 f[x) — sin x + cos x, 0 a£ x ^ In 
19 Z(x) = x + sin x, 0<x 

20 f[B) = cos 2 0 sin 6, —n^B^n 

21 f[B) = 4 sin B - 3 cos B, 0 ^ B ^ 2* 

22 f[x) = (x 2 + 1 for x ^ 1, x 2 — 4x + 5 for x ^ 1}, 


In applied problems, choose metric units if you prefer. 

23 The airlines accept a box if length + width + height = 

l + w + h < 62" or 158 cm. If h is fixed show that the maxi- 
mum volume (62-w-h)wh is V=k(3l — }A) 2 , Choose h to 
maximize V. The box with greatest volume is a . 

24 If a patient's pulse measures 70, then 80, then 120, what 
least squares value minimizes (x — 70) 2 + (x — 80) 2 + 
(x- 120) 2 ? If the patient got nervous, assign 120 a lower 
weight and minimize (x — 70) 2 + (x — 8G) 2 + i(x — 120) 2 , 

25 At speed u, a truck uses av + (bjv) gallons of fuel per mile. 
How many miles per gallon at speed u? Minimize the fuel 
consumption. Maximize the number of miles per gallon. 

26 A limousine gets (120 — 2u)/5 miles per gallon. The 
chauffeur costs 510/hour, the gas costs Sl/gallon. 

(a) Find the cost per mile at speed v. 

(b) Find the cheapest driving speed, 

27 You should shoot a basketball at the angle B requiring 
minimum speed. Avoid line drives and rainbows. Shooting 
from (0,0) with the basket at (n,h), minimize f[B) = 
l/(asin 0cos $ — hcos 2 0). 

(a) If b — 0 you are level with the basket. Show that 
9 — 45° is best (Jabbar sky hook), 

(b) Reduce dffdB — 0 to tan 2 B-— ajb. Solve when a — b, 

(c) Estimate the best angle for a free throw. 

The same angle allows the largest margin of error (Sports 
Science by Peter Brancazio). Section 12,2 gives the fiight path. 

28 On the longest and shortest days, in June and December, 
why does the length of day change the least? 

29 Find the shortest Y connecting P t Q, and B in the figure. 
Originally B was a birdfeeder. The length of Y is L(x)“ 
(b - x) + 2 yfa 2 + x 2 . 

(a) Choose x to minimize L (not allowing x > h), 

(b) Show that the center of the Y has 120 11 angles, 

(c) The best Y becomes a V when ajb = , 




30 If the distance function is /(f) = (1 4 - 3t)/(l + 3f 2 ), when 
does the forward motion end? How far have you traveled? 
Extra credit: Graph /(r) and dfjdt 
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3 Applications of the Derivative 


In 31-34, we make and sell x pizzas. The income is R(x) = 
ax + bx 2 and the cost is C(x) = c ■ + dx + ex 2 . 

31 The profit is TTfx) = . The average profit per 

pizza is = . The marginal profit per additional pizza 

is dnidx = , We should maximize the 

(profit} (average profit) (marginal profit), 

32 We receive R(x) = ax -\- bx 2 when the price per pizza is 

p(x) — In reverse: When the price is p we sell x = 

pizzas (a function of p), We expect b< 0 because 


33 Find x to maximize the profit n(x). At that x the marginal 

profit is dnjdx = , 

34 Figure B shows ft(x) = 3x — x 2 and C!(x)=l+x 2 and 

C 2 (x) = 2 + x 2 , With cost C u which sales x makes a profit? 
Which x makes the most profit? With higher fixed cost in C 2 , 
the best plan is . 


The cookie box and popcorn box were created by Kay Dundas 
from a 1 2" x 1 2" square. A box with no top is a calculus classic. 



35 Choose x to find the maximum volume of the cookie box. 

36 Choose x to maximize the volume of the popcorn box. 

37 A high-class chocolate box adds a strip of width x down 
across the front of the cookie box. Find the new volume V(x ) 
and the x that maximizes it. Extra credit: Show that V miiX is 
reduced by more than 20%. 

38 For a box with no top, cut four squares of side x from the 
corners of the 12" square. Fold up the sides so the height is 
x. Maximize the volume. 


40 A fixed wall makes one side of a rectangle. We have 200 
feet of fence for the other three sides. Maximize the area A in 
4 steps: 

1 Draw a picture of the situation, 

2 Select one unknown quantity as x (but not Al ), 

3 Find all other quantities in terms of x. 

4 Solve dAjdx = 0 and check endpoints. 

41 With no fixed wall, the sides of the rectangle satisfy 
2x + 2y = 200. Maximize the area. Compare with the area of 
a circle using the same fencing. 

42 Add 200 meters of fence to an existing straight 100-meter 
fence, to make a rectangle of maximum area (invented by 
Professor Klee), 

43 How large a rectangle fits into the triangle with sides 
x = 0, y = 0, and x/4 + yj 6 = 1? Find the point on this third 
side that maximizes the area xy. 

44 The largest rectangle in Problem 43 may not sit straight 
up. Put one side along x/4 + yj6 = 1 and maximize the area. 

45 The distance around the rectangle in Problem 43 is 
P = 2x + 2>> Substitute for y to find F(x), Which rectangle 
has P max = 12? 

46 Find the right circular cylinder of largest volume that fits 
in a sphere of radius 1, 

47 How large a cylinder fits in a cone that has base radius ft 
and height ff? For the cylinder, choose r and h on the sloping 
surface r/ft + h!H — 1 to maximize the volume V = nr 2 h. 

48 The cylinder in Problem 47 has side area A = Inrh. 
Maximize A instead of V, 

49 Including top and bottom, the cylinder has area 

A - Inrh + 2;rr 2 - 2nrH{\ - (r/ft}} + Inr 2 . 

Maximize A when H > ft. Maximize A when ft > H. 

*50 A wall 8 feet high is 1 foot from a house. Find the length 
L of the shortest ladder over the wall to the house. Draw a 
triangle with height v\ base 1 + x, and hypotenuse L , 

51 Find the closed cylinder of volume V = nr 2 h = \6n that 
has the least surface area. 

52 Draw a kite that has a triangle with sides l, 1, 2x next to 
a triangle with sides 2x, 2, 2. Find the a rea A a nd the x that 
maximizes it. Hint: In dAjdx simplify 1 — x 2 - x 2 /^/ 1 — x 2 
to (l -2x 2 )lJ] -x 2 . 


Geometry provides many problems, more applied than they 
seem. 

39 A wire four feet long is cut in two pieces. One piece forms 
a circle of radius r, the other forms a square of side x. Choose 
r to minimize the sum of their areas. Then choose r to 
maximize. 


In 53-56, x and y are nonnegative numbers with x + y = 1 0. 
Maximize and minimize: 

53 xy 54 x 2 + y 2 55 v — (1/x) 56 sin x sin y 

57 Find the total distance/(x) from A to X to C. Show that 
dfidx = 0 leads to sin a = sin c. Light reflects at an equal angle 
to minimize travel time. 
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58 Fermat’s principle says that light travels from A to B on 
the quickest path. Its velocity above the x axis is v and below 
the x axis is tv. 

(a) Find the time Tfx) from A to X to B. On AX , time = 
distance/ velocity = ^ Jr 2 + x 2 /v. 

(b) Find the equation for the minimizing x. 

(c) Deduce SnelTs law (sin a)jv - (sin b)jw. 


“Closest point problems” are models for many applications. 

59 Where is the parabola y = x 2 closest to x = 0, y = 2? 

60 Where is the line y = 5 - 2x closest to (0, 0)? 

61 What point on y= — x 2 is closest to what point on 
y = 5 — 2x? At the nearest points, the graphs have the same 
slope. Sketch the graphs. 

62 Where is y = x 2 closest to (0, i)? Minimizing 
x 2 + (y — i) 2 4 y + (y “ i) 2 gives y < 0. What went wrong? 

63 Draw the lyie y — mx passing near (2, 3), (1, 1), and {—I, 1). 
For a least squares fit, minimize 

(3 — 2m) 2 + (1 - m) 2 + (1 + m) 2 . 


64 A triangle has comers (—1, l), (x, x 2 ), and (3, 9) on the 
parabola y = x 2 . Find its maximum area for x between —1 
and 3. Hint: The distance from (.Y, Y) to the line y = mx + b 
is |Y" — mX — b\jyfl + m 2 . 

65 Submarines are located at (2* 0) and (1, 1). Choose the 
slope m so the line y — mx goes between the submarines but 
stays as far as possible from the nearest one. 

Problems 66-72 go back to the theory. 

66 To find where the graph of j^x) has greatest slope, solve 

. For y = 1/(1 + x 2 ) this point is . 

67 When the difference between /(x) and g(x) is smallest, their 

slopes are . Show this point on the graphs of 

f=2 -bx 2 and g = 2x -x 2 . 

68 Suppose y is fixed. The minimum of x 2 + xy — y 1 (a func- 
tion of x) is niy) = . Find the maximum of m(y). 

Now x is fixed. The maximum of x 2 + xy — y 2 (a function 
of y) is M(x) = . Find the minimum of M(x). 

69 For each m the minimum value of f[x) — mx occurs at x = 
m. What is _ftx)? 

70 y — x + 2x 2 sin(l/x) has slope 1 at x = 0. But show that y 
is not increasing on an interval around x = 0, by finding points 
where dyjdx =1—2 cos(l/x) + 4x sin(l/x) is negative. 

71 True or false, with a reason: Between two local minima of 
a smooth function /(x) there is a local maximum. 

72 Create a function y(x) that has its maximum at a rough 
point and its minimum at an endpoint. 

73 Draw a circular pool with a lifeguard on one side and 
a drowner on the opposite side. The lifeguard swims with 
velocity v and runs around the rest of the pool with velocity 
w — 10r. If the swim direction is at angle $ with the direct 
line, choose 0 to minimize and maximize the arrival time. 
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When f{x) is positive, f(x) is increasing. When dyjdx is negative, y(x) is decreasing. 
That is clear, but what about the second derivative? From looking at the curve, 
can you decide the sign of f”(x) or d 2 y/dx 2 l The answer is yes and the key is in the 

bending, 

A straight line doesn’t bend. The slope of y = mx + b is m (a constant). The second 
derivative is zero. We have to go to curves, to see a changing slope. Changes in the 
derivative show up in f"{x)\ 

f= x 2 has /' = 2x and /" = 2 (this parabola bends up) 

y — sin x has dyjdx = cos x and d 2 yjdx 2 = — sin x (the sine bends down) 
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3 Applications of the Derivative 


The slope 2x gets larger even when the parabola is falling ♦ The sign of / or/' is not 
revealed by/''. The second derivative tells about change in slope. 

A function with f”{x) > 0 is concave up. It bends upward as the slope increases. It 
is also called convex ♦ A function with decreasing slope— this means /"(x) < 0™is 
concave down. Note how cos x and 1 + cos x and even 1 + ^x + cos x change from 
concave down to concave up at x = n/ 2. At that point f"= —cosx changes from 
negative to positive. The extra 1 -H^x tilts the graph but the bending is the same. 



Here is another way to see the sign of/". Watch the tangent lines. When the curve 
is concave up, the tangent stays below it A linear approximation is too low. This 
section computes a quadratic approximation — which includes the term with /" > 0. 
When the curve bends down (/" < 0), the opposite happens — the tangent lines are 
above the curve. The linear approximation is too high, and /" lowers it. 

In physical motion ,/"{r) is the acceleration — in units of distance/(time) 2 . Accelera- 
tion is rate of change of velocity. The oscillation sin 2 1 has v = 2 cos 2i (maximum 
speed 2) and a = — 4 sin 2f (maximum acceleration 4). 

An increasing population means /' > 0. An increasing growth rate means f"> 0. 
Those are different. The rate can slow down while the growth continues. 

MAXIMUM VS. MINIMUM 

Remember that /'(x) = 0 locates a stationary point. That may be a minimum or a 
maximum. The second derivative decides\ Instead of computing /(x) at many points, 
we compute /"(x) at one point— the stationary point. It is a minimum if f ,f (x)> 0. 


3D When/'(x) = 0 and f f, {x) > 0, there is a local minimum at x. 
When f f (x) = 0 and f fr (x) < 0, there is a local maximum at x. 


To the left of a minimum, the curve is falling. After the minimum, the curve rises. The 
slope has changed from negative to positive. The graph bends upward and /"(x) > 0. 

At a maximum the slope drops from positive to negative. In the exceptional case, 
when / r {x) = 0 and also /"(x) - 0, anything can happen. An example is x\ which 
pauses at x = 0 and continues up (its slope is 3x 2 ^ 0). However x* pauses and goes 
down (with a very flat graph). 

We emphasize that the information from /'(x) and f '(x) is only ‘'local." To be 
certain of an absolute minimum or maximum, we need information over the whole 
domain. 
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EXAMPLE 1 f{x) = x*-x 2 has f(x) = 3x 2 -2x and f"(x) = 6x-2. 

To find the maximum and/or minimum, solve 3x 2 ~ 2x = 0. The stationary points 
are x = 0 and x = f . At those points we need the second derivative. It is /"( 0) - —2 
(local maximum) and /"(f) = + 2 (local minimum). 

Between the maximum and minimum is the inflection point . Thai is where 
f fi (x ) = 0, The curve changes from concave down to concave up. This example has 
/"(*) = 6x — 2, so the inflection point is at x = 

INFLECTION POINTS 

In mathematics it is a special event when a function passes through zero. When the 
function is/, its graph crosses the axis. When the function is /', the tangent line is 
horizontal. When /" goes through zero, we have an inflection point . 

The direction of bending changes at an inflection point. Your eye picks that out in 
a graph. For an instant the graph is straight (straight lines have /" = 0), It is easy to 
see crossing points and stationary points and inflection points. Very few people can 
recognize where /'" = 0 or = 0. I am not sure if those points have names. 

There is a genuine maximum or minimum when f f {x) changes sign. Similarly, there 
is a genuine inflection point when /"(*) changes sign. The graph is concave down on 
one side of an inflection point and concave up on the other side. t The tangents are 
above the curve on one side and below it on the other side. At an inflection point, 
the tangent line crosses the curve (Figure 3,7b). 

Notice that a parabola y = ax 2 + bx + c has no inflection points: y" is constant. A 
cubic curve has one inflection point, because /" is linear, A fourth-degree curve might 
or might not have inflection points — the quadratic /"(x) might or might not cross 
the axis, 

EXAMPLE 2 x 4 — lx 2 is W-shaped, 4x 3 - 4x has two bumps, 12x 2 — 4 is U-shaped. 
The table shows the signs at the important values of x'. 

X -Ji -1 -1A/3 0 1/^/3 1 

fix) 0 - - 0, 0 - - 0 

fix) 0 + 0-0 

fix) 0-0 

Between zeros of f(x) come zeros of f*{x) (stationary points). Between zeros of f f (x ) 
come zeros of /"(x) (inflection points). In this example /(x) has a double zero at the 
origin, so a single zero of /' is caught there. It is a local maximum, since /"( 0) < 0, 
Inflection points are important — not just for mathematics. We know the world 
population will keep rising. We don’t know if the rate of growth will slow down. 
Remember: The rate of growth stops growing at the inflection point. Here is the 1990 
report of the UN Population Fund. 

The next ten years will decide whether the world population trebles or merely 
doubles before it finally stops growing. This may decide the future of the earth as 
a habitation for humans. The population, now 5.3 billion, is increasing by a quarter 
of a million every day. Between 90 and 100 million people will be added every year 

tThat rules out /{x) = x 4 , which has f" = 12x 2 > 0 on both sides of zero. Its tangent line is 
the x axis. The line stays below the graph — so no inflection point. 
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3 Applications of the Derivative 


during the 1990s; a billion people — a whole China — over the decade. The fastest 
growth will come in the poorest countries, 

A few years ago it seemed as if the rate of population growth was slowingt 
everywhere except in Africa and parts of South Asia. The world's population 
seemed set to stabilize around 10.2 billion towards the end of the next century. 

Today, the situation looks less promising. The world has overshot the marker 
points of the 1984 t4 most likely" medium projection. It is now on course for an 
eventual total that will be closer to 1 1 billion than to 10 billion. 

If fertility reductions continue to be slower than projected, the mark could be 
missed again. In that case the world could be headed towards a total of up to 14 
billion people. 

Starting w r ith a census, the UN follows each age group in each country. They 
estimate the death rate and fertility rate — the medium estimates are published. This 
report is saying that we are not on track with the estimate. 

Section 6.5 will come back to population, with an equation that predicts 10 billion. 
It assumes we are now r at the inflection point. But China's second census just started 
on July 1, 1990. When ifs finished we will know if the inflection point is still ahead. 

You now r understand the meaning of f ,r (.x). Its sign gives the direction of bending — 
the change in the slope. The rest of this section computes how much the curve bends — 
using the size of/" and not just its sign. We lind quadratic approximations based on 
/"(x), In some courses they arc optional — the main points arc highlighted. 

CENTERED DIFFERENCES AND SECOND DIFFERENCES 


Calculus begins with average velocities, computed on either side of x; 
f[x+Ax)-J\x) J j\x)-j\x-Ax) 


Ax 


and 


A.v 


are close to /'(x). 


( 1 ) 


We never mentioned it, but a better approximation to /'(.v) comes from averaging 
those two averages , This produces a centered difference, which is based on x + A.v 
and v - Ax. It divides bv 2 A.v: 


./» 


j\.x + A.v) -./(x) 
Ax 


+ JU) - /U - A 


Ax 


xf 


• Axi /(a Am 

2 Ax ' 


( 2 ) 


We claim this is better. The test is to try it on powers of x. 

For/(x) = x these ratios all give /' = 1 (exactly). For/(x) = x 2 , only the centered 
difference correctly gives/' = 2x. The one-sided ratio gave 2x + Ax (in Chapter I it 
was 2f + /t). It is only “first-order accurate," But centering leaves no error. We are 
averaging 2x — A.v with 2x — Ax. Thus the centered difference is “second-order 
accurate." 

I ask now: What ratio converges to the second derivative’ One answer is to take 
differences of the first derivative. Certainly A/'/ Ax approaches [". But we want a 
ratio involving / itself. A natural idea is to take differences of differences, which 
brings us to second differences 


fix + A x) — /(x) 
A.v 


/(x) -/(X - Ax) 
A.v 


fix + Ax) - 2/| M +j\x - Ax) 


Ax 


(Ax) 1 


dff 

dx : 


t4- ( 3 ) 


+The United Nations watches the second derivative! 
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On the top, the difference of the difference is A(A/) = A 2 /. It corresponds to d 2 f 
On the bottom, (Ax) 2 corresponds to dx 2 . This explains the way we place the 2’s in 
d 2 f/dx 2 . To say it differently: dx is squared, df is not squared — as in distance/(time) 2 . 

Note that (Ax) 2 becomes much smaller than Ax. If we divide A/ by (Ax) 2 , the ratio 
blows up. It is the extra cancellation in the second difference A 2 / that allows the limit 
to exist. That limit is /"(x). 

Application The great majority of differential equations can’t be solved exactly. 
A typical case is /"(x) = - sin /(x) (the pendulum equation). To compute a solution, 
I would replace /"(x) by the second difference in equation (3). Approximations at 
points spaced by A.x are a very large part of scientific computing. 


To test the accuracy of these differences, here is an experiment on /(x) = 
sin x + cos x. The table shows the errors at x = 0 from formulas (1), (2), (3): 


step length Ax one-sided errors centered errors second difference errors 


1/4 

.1347 

.0104 

1/8 

.0650 

.0026 

1/16 

.0319 

.0007 

1/32 

.0158 

.0002 


-.0052 
-.0013 
- .0003 
-.0001 


The one-sided errors are cut in half when Ax is cut in half. The other columns 
decrease like (Ax) 2 . Each reduction divides those errors by 4. The errors from one- 
sided differences are 0(Ax) and the errors from centered differences are O(Ax) 2 . 


The “big 0” notation When the errors are of order Ax, we write E = O(Ax). This 
means that E ^ CAx for some constant C. We don't compute C — in fact we don't 
want to deal with it. The statement “one-sided errors are Oh of delta x” captures 
what is important. The main point of the other columns is £ = 0(Ax) 2 . 


LINEAR APPROXIMATION VS. QUADRATIC APPROXIMATION 


The second derivative gives a tremendous improvement over linear approximation 
f(a) -I- f'(a)(x - a). A tangent line starts out close to the curve, but the line has no 
way to bend. After a while it overshoots or undershoots the true function (see 
Figure 3.8). That is especially clear for the model /(.x) = x 2 , when the tangent is the 
x axis and the parabola curves upward. 

You can almost guess the term with bending. It should involve f'\ and also (Ax) 2 . 
It might be exactly f"(x) times (Ax) 2 but it is not. The model function x 2 has/" = 2. 
There must be a factor 3 to cancel that 2: 


3E The quadratic approximation to a smooth function /(x) near x = a is 


f(x) * f(a) +f'(a)(x - a) + }/"(a)(x - a) 2 . (4) 


At the basepoint this is J\a) = f(a). The derivatives also agree at x = a. Furthermore 
the second derivatives agree. On both sides of (4), the second derivative at x = a is 

/*(«). 

The quadratic approximation bends with the function. It is not the absolutely 
final word, because there is a cubic term zf'"(a)(x - a) 3 and a fourth-degree term 
j$f""(a)(x - a) 4 and so on. The whole infinite sum is a “Taylor series.” Equation (4) 
carries that series through the quadratic term — which for practical purposes gives a 
terrific approximation. You will see that in numerical experiments. 
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3 Applications of Iho Derivative 


Two things to mention. First, equation (4) shows why /" > 0 brings the curve above 
the tangent line. The linear part gives the line, while the quadratic part is positive 
and bends upward. Second, equation (4) comes from (2) and (3), Where one-sided 
differences givtj\x + Ax) ^ f(x) + f*{x) Ax, centered differences give the quadratic: 

from (2): f(x + Ax) ^ f(x ~ Ax) + 2f f (x) Ax 

from (3): f(x + Ax) ^ 2f{x) —f(x — Ax) + /"(x)(Ax) 2 . 

Add and divide by 2. The result is f(x + Ax) &f(x)+ f\x)Ax + i/"(x)(Ax) 2 , This is 
correct through (Ax) 2 and misses by (Ax) 3 , as examples show: 



1 

near 7 — 
l -.v 

Fig 3.8 


EXAMPLE 3 (x + Ax) 3 % (x 3 ) + (3x 2 )(Ax) + *(6x)(Ax) 2 + error (Ax) 3 . 

EXAMPLE 4 (1+xr * 1 + nx + $n(n - l)x 2 . 

The first derivative at x = 0 is n. The second derivative is n(n — 1). The cubic term 
would be - l)(n — 2)x 3 , We are just producing the binomial expansion! 


EXAMPLE 5 # 1 + x + x 2 = start of a geometric series. 

1 - x 


1/(1 - x) has derivative 1/(1 — x) 2 . Its second derivative is 2/(1 - x) 3 . At x — 0 those 
equal U,2. The factor \ cancels the 2, which leaves 1,1,1. This explains 1 + x + x 2 . 
The next terms are x 3 and x 4 . The whole series is 1/(1 — x)— 1 + x + x 2 + x 3 H — . 

Numerical experiment 1/^1 + x& 1 - ±x + fx 2 is tested for accuracy. Dividing x 
by 2 almost divides the error by 8. If we only keep the linear part 1 - the error 
is only divided by 4, Here are the errors at x= 5, g, and 


linear approximation 
quadratic approximation 


( 

( 


error : 


■H 


- 5 

error ^ —— x 

16 




.0194 .0053 .0014 


-.00401 -.00055 


-.00007 


3.3 EXERCISES 


Read-through questions 

The direction of bending is given by the sign of a If the 
second derivative is b in an interval, the function is con- 
cave up (or convex). The graph bends c The tangent 
lines are d the graph. If /"(x) < 0 then the graph is con- 
cave e . and the slope is f 

At a point where f'{x) = 0 and /"(x) > 0, the function has a 
q . At a point where h . the function has a maximum. 
A point where /"(*) = 0 is an * point, provided f” 
changes sign. The tangent line J the graph. 

The centered approximation to f'{x) is [_*_]/2Ax, The 
3-point approximation to f ft (x) is [_!_]/(Ax} 2 . The second- 
order approximation to J[x + Ax) is f[x) + /'(x)Ax + m . 
Without that extra term this is just the ft approximation. 
With that term the error is Of o ). 


1 A graph that is concave upward is inaccurately said to 
“hold water ” Sketch a graph with / HH (x) > 0 that would not 
hold water. 

2 Find a function that is concave down for x < 0 and con- 
cave up for 0 < x < 1 and concave down for x > 1. 

3 Can a function be always concave down and never cross 
zero? Can it be always concave down and positive? Explain. 

4 Find a function with /"(2) — 0 and no other inflection 
point. 

True or false, when /(x) is a 9th degree polynomial with 
/'(l) = 0 and /'( 3) = 0. Give (or draw) a reason. 

5 J[x) = 0 somewhere between x = I and x = 3. 

6 /"(x) = 0 somewhere between x = 1 and x = 3. 
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7 There is no absolute maximum at x = 3. 

8 There are seven points of inflection, 

9 If fix) bas nine zeros, it has seven inflection points. 

10 VJ[x) has seven inflection points, it has nine zeros. 

In 11-16 decide which stationary points are maxima or 
minima. 

11 f(x) = x 2 - 6x 12 fix) = x 3 — 6x 2 

13 f[x) = x*-6x* 14/[x) = x n -6x 10 

15 /(x) = sin x — cos x 16 /(x) = x + sin 2x 

Locate the inflection points and the regions where /[x) e con- 
cave up or down. 

17 f[x) = x + x 1 - x 3 18 f{x) = sin x + tan x 

19 f[x) = (x “ 2) 2 (x — 4) 2 20 f[x) = sin x + (sin x) 1 

21 If f{x) is an even function, the centered difference 
[/(Ax) —/(—Ax)]/ 2Ax exactly equals /'(0) = 0. Why? 

22 If fix ) is an odd function, the second difference 
[/(Ax) — 2/(0) H-/( — Ax)]/(Ax) 2 exactly equals/ u (0) = 0. Why? 

Write down the quadratic /l0}-h/'(0)x + i/"(0)x 2 in 23-26. 

23 f[x) = cos x + sin x 24 fix) = tan x 

25 fix) = (sin x)/x 26 /(x) - 1 + x + x 2 

In 26, find/(l) 4/'(l)(x — 1) + i/"(l)(x — l) 2 around a = 1. 

27 Find A and B in ^1 — x ss 1 + Ax + Bx 2 . 

28 Find A and B in 1/(1 -x) 2 ^ 1 + Ax + Bx 2 . 

29 Substitute the quadratic approximation into 
[fix + Ax) — /(x)]/Ax, to estimate the error in this one-sided 
approximation to /'(x)- 

30 What is the quadratic approximation at x = 0 to/(— Ax)? 

31 Substitute for /(x + Ax) and f(x — Ax) in the centered 
approximation [ fix + Ax) — f(x - Ax)]/2Ax, to get 
Ax) 4- error Find the Ax and (Ax) 2 terms in this error Test 
on fix) = x* at x = 0. 

32 Guess a third-order approximation /(Ax) ss /(0) + 

/ r (G) Ax + i/" (0)(Ax) 2 + . Test it on fix) = x 3 . 


Construct a table as in the text, showing the actual errors at 
x-0 in one-sided differences, centered differences, second 
differences, and quadratic approximations. By hand take two 
values of Ax, by calculator take three, by computer take four. 

33 /(x) = x 3 + x 4 34 fix) = 1/(1 -x) 

35 fix) = x 2 + sin x 

36 Example 5 was 1/(1 — x) as 1 + x + x 2 . What is the error 
at x = 0. 1? What is the error at x = 2? 

37 Substitute x = ,01 and x= - 0.1 in the geometric series 
l/(l-x) = l + x + x 2 + - to find 1/.99 and 1/1.1— first to 
four decimals and then to all decimals. 

38 Compute cos T by equation (4) with a = 0. OK to check 
on a calculator. Also compute cos 1. Why so far off? 

39 Why is sin x x x not only a linear approximation but also 
a quadratic approximation? x = 0 is an _____ point. 

40 Uflx) is an even function, find its quadratic approximation 
at x = 0. What is the equation of the tangent line? 

41 For /(x)“x + x 2 + x 3 , what is the centered difference 
[/(3)— /(l)]/2, and what is the true slope /'(2)? 

42 For flx)-x + x 2 + x 3 , what is the second difference 
Lft3) — 2fi2) +/U)]/1 2 , a nd what is the exact A(2)? 

43 The error in fia)+f\a){x — a) is approximately 

— ft) 2 * This error is positive when the function is 
. Then the tangent line is the curve. 

44 Draw a piecewise linear y(x) that is concave up. Define 
“concave up T * without using the test d 2 y/dx 2 > 0. If derivatives 
don’t exist, a new definition is needed. 

45 What do these sentences say about / or /' or f" or /'"? 

1. The population is growing more slowly. 

2. The plane is landing smoothly. 

3. The economy is picking up speed. 

4. The tax rate is constant, 

5. A bike accelerates faster but a car goes faster. 

6. Stock prices have peaked. 

7. The rate of acceleration is slowing down. 

8. This course is going downhill. 

46 (Recommended) Draw a curve that goes up-down-up. 
Below it draw its derivative. Then draw its second derivative. 
Mark the same points on all curves — the maximum, minimum, 
and inflection points of the first curve. 

47 Repeat Problem 46 on a printout showing y(x) = 
x 3 - 4x 2 + x + 2 and dyjdx and d 2 y/dx 2 on the same graph. 
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3 Applications of the Derivative 
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REFERENCE 

500— 
400 

300 

200 

175 

150— 

140 

1 30 — — 

120 

110 — 

100 

95 — — 

90 

85“ — 

80 

75 

70 

65 

60 

55 

50 - — 

45 


40 


35 


Reading a graph is like appreciating a painting. Everything is there, but you have to 
know what to look for. One way to learn is by sketching graphs yourself, and in the 
past that was almost the only way. Now it is obsolete to spend weeks drawing 
curves — a computer or graphing calculator does it faster and better. That doesn’t 
remove the need to appreciate a graph (or a painting), since a curve displays a 
tremendous amount of information. 

This section combines two approaches. One is to study actual machine-produced 
graphs (especially electrocardiograms). The other is to understand the mathematics 
of graphs — slope, concavity, asymptotes, shifts, and scaling. We introduce the 
centering transform and zoom transform. These two approaches are like the rest of 
calculus, where special derivatives and integrals are done by hand and day-to-day 
applications are by computer. Both are essential — the machine can do experiments 
that we could never do. But without the mathematics our instructions miss the point. 
To create good graphs you have to know a few of them personally. 

READING AN ELECTROCARDIOGRAM (ECG Or EKG) 

The graphs of an ECG show the electrical potential during a heartbeat. There are 
twelve graphs - -six from leads attached to the chest, and six from leads to the arms 
and left leg. {It doesn’t hurt, but everybody is nervous. You have to lie still, because 
contraction of other muscles will mask the reading from the heart.) The graphs record 
electrical impulses, as the cells depolarize and the heart contracts. 

What can I explain in two pages? The graph shows the fundamental pattern of the 
ECG. Note the P wave , the QRS complex , and the T wave. Those patterns, seen 
differently in the twelve graphs, tell whether the heart is normal or out of rhythm — 
or suffering an infarction (a heart attack). 



First of all the graphs show the heart rate . The dark vertical lines are by convention 
3 second apart. The light lines are ^ second apart. If the heart beats every 5 second 
(one dark line) the rate is 5 beats per second or 300 per minute. That is extreme 
tachycardia — not compatible with life. The normal rate is between three dark lines 
per beat second, or 100 beats per minute) and five dark lines (one second between 
beats, 60 per minute). A baby has a faster rate, over 100 per minute. In this figure 

the rate is . A rate below 60 is bradycardia , not in itself dangerous. For a resting 

athlete that is normal. 

Doctors memorize the six rates 300, 150, 100, 75, 60, 50. Those correspond to 1,2, 
3, 4, 5, 6 dark lines between heartbeats. The distance is easiest to measure between 
spikes (the peaks of the R wave). Many doctors put a printed scale next to the chart. 
One textbook emphasizes that “Where the next wave falls determines the rate. No 
mathematical computation is necessary.” But you see where those numbers come 
from. 
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The next thing to look for is heart rhythm * The regular rhythm is set by the 
pacemaker, which produces the P wave. A constant distance between waves is good — 
and then each beat is examined* When there is a block in the pathway, it shows as 
a delay in the graph* Sometimes the pacemaker fires irregularly* Figure 3*10 shows 
sinus arrythmia (fairly normal)* The time between peaks is changing. In disease or 
emergency, there are potential pacemakers in all parts of the heart. 

I should have pointed out the main parts. We have four chambers, an atrium- 
ventricle pair on the left and right. The SA node should be the pacemaker* The 
stimulus spreads from the atria to the ventricles — from the small chambers that 
“prime the pump” to the powerful chambers that drive blood through the body* The 
P wave comes with contraction of the atria. There is a pause of jq second at the AV 
node. Then the big QRS wave starts contraction of the ventricles, and the T wave is 
when the ventricles relax* The cells switch back to negative charge and the heart cycle 
is complete* 



Rg* 3.9 Happy person with a heart and a normal electrocardiogram* 

The ECG shows when the pacemaker goes wrong. Other pacemakers take over — 
the AV node will pace at 60/minute* An early firing in the ventricle can give a wide 
spike in the QRS complex, followed by a long pause* The impulses travel by a slow 
path. Also the pacemaker can suddenly speed up (paroxysmal tachycardia is 
150-250/ minute)* But tbe most critical danger is fibrillation. 

Figure 3* 10b shows a dying heart. The ECG indicates irregular contractions— no 
normal PQRST sequence at all. What kind of heart would generate such a rhythm? 
The muscles are quivering or “fibrillaling” independently. The pumping action is 
nearly gone, which means emergency care. The patient needs immediate CPR— 
someone to do the pumping that the heart can’t do* Cardio-pulmonary resuscitation 
is a combination of chest pressure and air pressure (hand and mouth) to restart the 
rhythm. CPR can be done on the street, A hospital applies a defibrillator, which 
shocks the heart back to fife. It depolarizes all the heart cells, so the timing can be 
reset. Then the charge spreads normally from SA node to atria to AV node to 
ventricles* 

This discussion has not used all twelve graphs to locate the problem. That needs 
vectors. Look ahead at Section 11,1 for the heart vector, and especially at Section 11*2 
for its twelve projections * Those readings distinguish between atrium and ventricle, 
left and right, forward and back. This information is of vital importance in the event 
of a heart attack, A “heart attack” is a myocardial infarction (MI). 

An MI occurs when part of an artery to the heart is blocked (a coronary occlusion)* 
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3 Applications of Ifw Derivative 



Ftfl.3 .10 Doubtful rhythm. Serious fibrillation. Signals of a heart attack. 


An area is without blood supply — therefore without oxygen or glucose. Often the 
attack is in the thick left ventricle, which needs the most blood. The cells are first 
ischemic, then injured, and finally infarcted (dead). The classical ECG signals involve 
those three Vs: 

Ischemia : Reduced blood supply, upside-down T wave in the chest leads. 
Injury: An elevated segment between S and T means a recent attack. 
Infarction: The Q wave, normally a liny dip or absent, is as wide as a small 
square second). It may occupy a third of the entire QRS complex. 

The Q wave gives the diagnosis. You can find all three Vs in Figure 3.10c. 

It is absolutely amazing how much a good graph can do. 

THE MECHANICS OF GRAPHS 

From the meaning of graphs we descend to the mechanics. A formula is now given 
foryjx). The problem is to create the graph. It would be too old-fashioned to evaluate 
J\x) by hand and draw a curve through a dozen points. A computer has a much 
better idea of a parabola than an artist (who tends to make it asymptotic to a straight 
line). There are some things a computer knows, and other things an artist knows, 
and still others that you and I know — because we understand derivatives. 

Our job is to apply calculus. We extract information from /' and f" as well as /. 
Small movements in the graph may go unnoticed, but the important properties come 
through. Here are the main tests: 

1. The sign of f(x) (above or below axis: 0 at crossing point) 

2. The sign of/'(x) (increasing or decreasing: /' — 0 at stationary point) 

3. The sign of /"(*) (concave up or down: /" — 0 at inflection point) 

4. The behavior offlx) as x -► oo and x -> — oo 

5. The points at which y^x) -> oo otJ\x )-*♦ - oo 

6. Even or odd? Periodic? Jumps in / or /'? Endpoints? J\0)1 

EXAMPLE, /(I) . 

The sign of/x) depends on 1 - x 2 . Thus fix) > 0 in the inner interval where x 2 < 1. 
The graph bends upwards (/"(x) > 0) in that same interval. There are no inflection 
points, since /" is never zero. The stationary point where /' vanishes is x «= 0. We 
have a local minimum at x “ 0. 

The guidelines (or asymptotes) meet the graph at infinity. For large x the important 
terms are x 2 and -x 2 . Their ratio is + x 2 /- x 2 = — 1 — which is the limit as x oo 
and x - oo. The horizontal asymptote is the line y = — 1. 

The other infinities, where / blows up, occur when 1 - x 2 is zero. That happens at 
x = 1 and x = - 1. The vertical asymptotes are the lines x — 1 and x ™ — 1. The graph 
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in Figure 3.11a approaches those lines. 

if J{x) -► b as x -► + oo or - oo, the line y — b is a horizontal asymptote 
+ oo or - oo as x -► a, the line x = a is a vertical asymptote 
if^x) - (mx + b) -* 0 as x -* + oo or ™ oo, the line y = mx + his a sloping asymptote. 

Finally comes the vital fact that this function is eveniffx) =J{—x) because squaring 
x obliterates the sign. The graph is symmetric across the y axis. 

To summarize the effect of dividing by 1 ^ x 2 : No effect near x = 0, Blowup at 1 
and -1 from 2 ero in the denominator. The function approaches -1 as |x| -+ oo, 

example 


This example divides by x— 1. Therefore x= 1 is a vertical asymptote, where yix) 
becomes infinite. Vertical asymptotes come mostly from zero denominators. 

Look beyond x — L Both^x) and f"(x) are positive for x > L The slope is zero at 
x = 2. That must be a local minimum. 

What happens as x ^ oo? Dividing x 2 by x “ 1, the leading term is x. The function 
becomes large. It grows linearly — we expect a sloping asymptote . To find it, do the 
division properly: 


x 


2 


X 1 


= x+ 1 + 


1 

X 1 


(1) 


The last term goes to zero. The function approaches y — x + 1 as the asymptote. 
This function is not odd or even. Its graph is in Figure 3,11b, With zoom out you 
see the asymptotes. Zoom in for /= 0 or /' = 0 or /" = 0, 




Hg. 3,11 The graphs of x 2 j(l — x 2 } and x 2 /(x— 1) and sin x + i sin 3jt. 

EXAMPLE 3 J[x) — sin x + i sin 3x has the slope f f (x) ~ cos x + cos 3x, 

Above all these functions are periodic. If x increases by 2 tt, nothing changes. The 
graphs from 2 n to An are repetitions of the graphs from 0 to 2 it. Thus f(x + 2n) ~J{x) 
and the period is 2n> Any interval of length In will show a complete picture, and 
Figure 3.11c picks the interval from —n to n. 

The second outstanding property is that / is odd , The sine functions satisfy 
^-x) = The graph is symmetric through the origin. By reflecting the right half 

through the origin, you get the left half. In contrast, the cosines in /'(x) are even. 

To find the zeros of /(x) and /' (x) and /"{x), rewrite those functions as 

ffx) = 2 sin x- |sin 3 x /'{x) = -2 cos x + 4 cos 3 x /"(*)" - 10 sin x+ 12 sin 3 x. 
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3 Applications of the Dertvattvo 


We changed sin 3x to 3 sin x - 4 sin 3 x. For the derivatives use sin 2 * - 1 - cos 2 x. 
Now find the zeros — the crossing points , stationary points, and inflection points: 

f— 0 2 sin * — \ sin 3 * => sin * = 0 or sin 2 * = f x — 0, ±n 

/' = 0 2 cos * ~ 4 cos 3 * => cos * = 0 or cos 2 * = j => x = ± nf4, ±rc/2, ± 3n/4 

/" — 0 5 sin * = 6 sin 3 * => sin * = 0 or sin 2 * = f => * = 0, ±66% ±114% ± iz 

That is more than enough information to sketch the graph. The stationary points 
n/4, n/2 , 3n/4 are evenly spaced. At those points^*) is ^8/3 (maximum), 2/3 (local 
minimum), v/8/3 (maximum). Figure 3.1 1c shows the graph. 

I would like to mention a beautiful continuation of this same pattern: 

fix) — sin x + j sin 3* + ^ sin 5* + - ' f{x ) = cos * + cos 3* + cos 5* + ■ " 

If we stop after ten terms, fix) is extremely close to a step function . If we don’t stop, 
the exact step function contains infinitely many sines. It jumps from -ji/ 4 to + nj4 as 
* goes past zero. More precisely it is a "square wave” because the graph jumps back 
down at n and repeats. The slope cos * + cos 3* H — also has period In* Infinitely 
many cosines add up to a delta function] (The slope at the jump is an infinite spike.) 
These sums of sines and cosines are Fourier series. 

GRAPHS BY COMPUTERS AND CALCULATORS 

We have come to a topic of prime importance. If you have graphing software for a 
computer, or if you have a graphing calculator , you can bring calculus to life. A graph 
presents fix) in a new way — different from the formula. Information that is buried 
in the formula is clear on the graph. But don't throw away y(x) and dyfdx . The 
derivative is far from obsolete. 

These pages discuss how calculus and graphs go together. We work on a crucial 
problem of applied mathematics — to find where y(x) reaches its minimum. There is 
no need to tell you a hundred applications. Begin with the formula. How do you find 
the point x* where y(x) is smallest? 

First, draw the graph. That shows the main features. We should see (roughly) where 
x* lies. There may be several minima, or possibly none. But what we see depends on 
a decision that is ours to make — the range of x and y in the viewing window . 

If nothing is known about fix), the range is hard to choose. We can accept a default 
range, and zoom in or out. We can use the autoscaling program in Section 1.7. 
Somehow x* can be observed on the screen. Then the problem is to compute it. 

I would like to work with a specific example. We solved it by calculus— to find 
the best point x* to enter an expressway. The speeds in Section 3.2 were 30 and 60. 
The length of the fast road will be b = 6. The range of reasonable values for the entering 
po int is 0 x < 6. The distance to the road in Figure 3.12 is a = 3. We drive a distance 
+ * 2 at speed 30 and the remaining distance 6 — x at speed 60: 

driving time y(x) = Jl 2 + x 2 + ^ (6 - x). (2) 

This is the function to be minimized. Its graph is extremely flat. 

It may seem unusual for the graph to be so level. On the contrary, it is common. 
A fiat graph is the whole point of dyfdx = 0. 

The graph near the minimum looks like y-Cx 2 . It is a parabola sitting on a 
horizontal tangent. At a distance of Ax = .01, we only go up by C(A*) 2 — .0001 C. 
Unless C is a large number, this Ay can hardly be seen. 
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Rg.3 .12 Enter at x. The graph of driving time y(x). Zoom boxes locate x*. 


The solution is to change scale. Zoom in on x*. The tangent line stays flat, since 
dy/dx is still zero. But the bending from C is increased. Figure 3.12 shows the zoom 
box blown up into a new graph of y(x), 

A calculator has one or more ways to find x*. With a TRACE mode, you direct a 
cursor along the graph. From the display of y values, read y mai and x * to the 
nearest pixel. A zoom gives better accuracy, because it stretches the axes — each 
pixel represents a smaller Ax and Ay. The TI-81 stretches by 4 as default. Even 
better, let the whole process be graphical — draw the actual ZOOM BOX on the 
screen. Pick two opposite corners, press ENTER, and the box becomes the new 
viewing window (Figure 3.12). 

The first zoom narrows the search for x*. It lies between x = 1 and x = 3. We build 
a new ZOOM BOX and zoom in again. Now l.S < x* <2. Reasonable accuracy 
comes quickly. High accuracy does not come quickly. It takes time to create the box 
and execute the zoom. 

Question 1 What happens as we zoom in, if all boxes are square (equal scaling)? 
Answer The picture gets flatter and flatter. We are zooming in to the tangent line. 
Changing x to Xj4 and y to Yf 4, the parabola y = x z flattens to Y= X 2 j4. To see 
any bending, we must use a long thin zoom box. 


I want to change to a totally different approach. Suppose we have a formula for 
dyjdx. That derivative was produced by an infinite zoom! The limit of Ay/Ax came by 
brainpower alone: 


dy _ x l_ 

dx 30^/3 2 + x 1 60' 


Call this Ax). 



zero slope 
at minimum 


Fig. 3.13 


This function is zero at x*. The computing problem is completely changed: Solve 
Ax) = 0, It is easier to find a root of f(x) than a minimum of y(x). The graph of f(x) 
crosses the x axis. The graph of y(x) goes flat — this is harder to pinpoint. 

Take the model function y = x 2 for |x| < .01. The slope /= 2x changes from —.02 
to +.02. The value of x 2 moves only by .0001 — its minimum point is hard to see. 

To repeat: Minimization is easier with dyjdx. The screen shows an order of magni- 
tude improvement, when we trace or zoom on Ax) — 0. In calculus, we have been 
taking the derivative for granted, It is natural to get blase about dyjdx = 0. We forget 
how intelligent it is, to work with the slope instead of the function. 

Question 2 How do you get another order of magnitude improvement? 

Answer Use the next derivative! With a formula for dfjdx, which is d 2 yjdx 2 , tbe 
convergence is even faster. In two steps the error goes from .01 to .0001 to .00000001. 
Another infinite zoom went into the formula for dfjdx, and Newton’s method takes 
account of it. Sections 3.6 and 3.7 study /(x) = 0. 
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3 Application® of the Derivative 


The expressway example allows perfect accurac y. We can solve dyjdx = 0 by alge- 
bra. The equation simplifies to 60x = 30^/3 2 + x 2 . Dividing by 30 and squaring yields 
4x 2 — 3 2 + x 2 . Then 3x 2 = 3 2 . The exact solution is x* = ^/3 = 1.73205.., 

A model like this is a benchmark, to test competing methods. It also displays what 
we never appreciated — the extreme flatness of the graph. The difference in driving 
time between entering at x* = y/3 and x “ 2 is one second . 

THE CENTERING TRANSFORM AND ZOOM TRANSFORM 

For a photograph we do two things — point the right way and stand at the right 
distance. Then take the picture. Those steps are tbe same for a graph. First we pick 
the new center point. The graph is shifted, to move that point from (a, b) to (0, 0). 
Then we decide how far the graph should reach. It fits in a rectangle, just like the 
photograph. Rescaling to xjc and yjd puts the desired section of the curve into the 
rectangle. 

A good photographer does more (like an artist). The subjects are placed and the 
camera is focused. For good graphs those are necessary too. But an everyday calcula- 
tor or computer or camera is built to operate without an artist — just aim and shoot. 
I want to explain how to aim at y-f{x). 

We are doing exactly what a calculator does, with one big difference. It doesn't 
change coordinates . We do . When x = I, y — - 2 moves to the center of the viewing 
window, the calculator still shows that point as (1, - 2). When the centering transform 
acts on y + 2 - m(x - 1), those numbers disappear. This will be confusing unless x 
and y also change. The new coordinates are X — x — 1 and Y — y 4- 2. Then the new 
equation is Y—mX. 

The main point (for humans) is to make the algebra simpler. The computer has no 
preference for Y~ mX over y — y 0 — m(x — x 0 ). It accepts 2x 2 — 4x as easily as x 2 . 
But we do prefer Y= mX and y = x 2 , partly because their graphs go through (0, 0). 
Ever since 2 ero was invented, mathematicians have liked that number best. 


3F A centering transform shifts left by a and down by b\ 

X- x- a and Y = y - h change y -f[x) into T+ b ^f{X + a). 


EXAMPLE 4 The parabola y - 2x 2 - 4x has its minimum when dyjdx — 4x - 4 — 0. 
Thus x — 1 and y = — 2. Move this bottom point to the center: y — 2x 2 — 4x is 

Y + 2 = 2{X - l) 2 - MX - 1) or Y— 2X 1 . 

The new parabola Y = 2X 2 has its bottom at (0, 0). It is the same curve, shifted across 
and up. The only simpler parabola is y = x 2 . This final step is the job of the zoom. 

Next comes scaling. We may want more detail (zoom in to see the tangent line). 
We may want a big picture (zoom out to check asymptotes). We might stretch one 
axis more than the other, if the picture looks like a pancake or a skyscraper. 


3G A zoom transform scales the X and Y axes by c and d: 

x~cX and y = dY change T=F(A r ) to y -dF(x/c). 

The new x and y are boldface letters, and the graph is rescaled. Often c- d. 
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EXAMPLE 5 Start with Y- 2X 2 . Apply a square zoom with c-d. In the new xy 
coordinates, the equation is y/c = 2(x/c) 2 . The number 2 disappears if c = d - 2. With 
the right centering and the right 200 m, every parabola that opens upward is y = x 2 . 

Question 3 What happens to the derivatives ( slope and bending) after a zoom? 
Answer The slope (first derivative) is multiplied by d/c . Apply the chain rule to y = 
dF(xfc). A square zoom has d/c = 1 — fines keep their slope. The second derivative is 
multiplied by d/c 2 , which changes the bending. A 200 m out divides by small numbers 
c — d, so the big picture is more curved. 

Combining the centering and zoom transforms, as we do in practice, gives y in 
terms of x: 


y=/lx) becomes 


Y=j{X + a)-b 


and then 






flfl- 3.14 Change of coordinates by centering and zoom. Calculators still show (jc, y). 


Question 4 Find x and y ranges after two transforms. Start between -1 and 1. 
Answer The window after centering is — 1 < x — a ^ 1 and — l^y—b^l. The 
window after zoom is - 1< c{x - a) < 1 and - d(y - b) ^ 1. The point (1, 1) was 
originally in the corner. The point (tT 1 + a, d~ v + b) is now in the comer. 

The numbers a, b y c, d are chosen to produce a simpler function (like y = x 2 ). Or 
else — this is important in applied mathematics — they are chosen to make x and y 
“dimensionless.” An example is y -{ cos St. The frequency 8 has dimension 1/time. 
The amplitude ^ is a distance. With d = 2 cm and c = 8 sec, the units are removed 
and y = cos t. 

May I mention one transform that does change the slope? It is a rotation. The 
whole plane is turned. A photographer might use it — but normally people are sup- 
posed to be upright. You use rotation when you turn a map or straighten a picture. 
In the next section, an unrecognizable hyperbola is turned into Y — 1/X. 


3.4 EXERCISES 


Read-through questions 

The position, slope, and bending of y =/(x) are decided by 

0 b and c If [J{x)\ — ► go as x — ► a, the line x ~ 

a. is a vertical <* , Ify(x) — ► b for large x, then y = b is a 

e . If /(*) — mx->b for large x, then y=mx+b is a 

1 The asymptotes of y = x 2 /(x 2 - 4) are 9 . This 

function is even because W-x)= * . The function sin kx 

has period ( 

Near a point where dy/dx = 0, the graph is extremely 
) For the model y = Cx 2 , x — .1 gives y = * . A box 


around the graph looks long and r We g in to that 
box for another digit of x* But solving dy/dx = 0 is more 
accurate, because its graph » the x axis. The slope of 
dy/dx is £ Each derivative is like an p zoom. 

To move (a, b) to (0, Q\ shift the variables to X = Q 
and Y = t . This * transform changes y =f[x) to 
Y = t The original slope at (a, b) equals the new slope 
at u . To stretch the axes by c and d, set x^cX and 

v The w transform changes Y = F( X) to y = x 
Slopes are multiplied by y Second derivatives are 
multiplied by * 
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3 Applications of the Derivative 


1 Find the pulse rate when heartbeats are ± second or two 
dark lines or x seconds apart. 

2 Another way to compute the heart rate uses marks for 
6-second intervals. Doctors count the cycles in an interval. 

(a) How many dark lines in 6 seconds? 

(b) With 8 beats per interval, find the rate. 

(c) Rule: Heart rate = cycles per interval times . 


30 True (with reason) or false (with example). 

(a) Every ratio of polynomials has asymptotes 

(b) If J[x) is even so is /"(x) 

(c) If f\x) is even so is J\x) 

(d) Between vertical asymptotes, /'(x) touches zero. 

31 Construct an /(x) that is “even around x “ 3.” 

32 Construct g(x) to be “odd around x — 


Which functions id 3-18 are even or odd or periodic? Find all 
asymptotes: y = b or x — a or y — mx + b. Draw roughly by 
hand or smoothly by computer. 


3/(x) = *-(9/x) 

s /W=Y3^2 


7 f(x) = 


x 2 + 3 
x 2 + 1 


4 /(x) = xT (any integer n) 


«/w= 


4 — x 


8/(x) = 


x 2 -t-3 
x + 1 


Create graphs of 33-38 on a computer or calculator. 

33 y(x) = (l + t/x)*, -3^x^3 

34 y(x) = x I/I ,0.1^x^2 

35 y(x) — sin(x/3) + sin(x/5) 

36 y(x) = (2 — x)/(2 + x), — 3 s£ x 3 

37 y(x) = 2x 3 4- 3x 2 — 12x + 5 on [-3, 3] and [2.9, 3.1] 

38 100[sin(x -f .1) — 2 sin x + sin(x — .1)] 


9 /(x) = (sin x)(sin 2x) 10 /(x) = cos x + cos 3x + cos 5x 


11 /{x) = 


x 2 -l 


12 f(x) = — 
sin x 


aMm 7TP 

4 - 1 

,5 ^>=^T 

17 f(x) = x - sin x 


Uf(x) = — -2x 

. - , sin x + cos x 

16 f(x) = 

sin x — cos x 

1 *f(x)~(llx)-y/i 


Id 19-24 construct f[x) with exactly these asymptotes. 

19 x = 1 and y — 2 24) x=l,x = 2 T y = 0 

21 y = x and x = 4 22 y = 2x + 3 and x = 0 

23 y = x (x -► oo), y — — x(x — oo) 

24 x = 1, x = 3, y=*x 

25 For P{x)/Q(x) to have y — 2 as asymptote, the polynomials 

P and Q must be . 

26 For P[x)/Q(x) to have a sloping asymptote, the degrees of 

P and Q must be . 

27 For P(x)jQ{x) to have the asymptote y — 0, the degrees of 

P and Q must . The graph of x 4 /(l + x 2 ) has what 

asymptotes? 

28 Both l/(x — Ij and i/(x — l) 2 have x=l and y = 0 as 
asymptotes. The most obvious difference in the graphs is 


29 If /'(x) has asymptotes x — 1 and y = 3 then f{x) has 
asymptotes . 


In 39-40 show the asymptotes od large-scale computer graphs. 


39 (a) y = 


40 (a )y = 


x 3 + 8x- 15 
x 2 — 2 

x 2 — 2 

x 3 + 8x — 15 


(b )y = 
(b)y = 


x 4 — 6x 3 + 1 
2x 4 + x 2 

x 2 — x + 2 
x 2 - 2x + 1 


41 Rescale y = sin x so X is in degrees, not radians, and Y 
changes from meters to centimeters. 


Problems 42-46 minimize the driving time y(x) in the text. 
Some questions may not fit your software. 

42 Trace along the graph of y(x) to estimate x* Choose an 
xy range or use the default. 

43 Zoom in by c = d = 4. How many zooms until you reach 
x* = 1.73205 or 1.7320508? 

44 Ask your program for the minimum of y(x) and the solu- 
tion of dyjdx = 0. Same answer? 

45 What are the scaling factors c and d for the two zooms in 
Figure 3.12? They give the stretching of the x and y axes, 

46 Show that dyjdx = — 1/60 and d 2 y/dx 2 = 1/90 at x = 0. 
Linear approximation gives dyjdx se — 1/60 + x/90. So the 

slope is zero near x = . This is Newton's method, 

using the next derivative. 

Change the function to y(x) = ^15 + x 2 /30 + ( 10 — x)/60, 

47 Find x* using only the graph of y(x). 

48 Find x* using also the graph of dyjdx . 

49 What are the xy and XY and xy equations for the line in 
Figure 3. 14? 
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50 Define / fl (x) = sin x + jsin 3x -h^sin 5x + ii * (n terms). 
Graph f s and / 10 from — n to n. Zoom in and describe the 
Giftfcs phenomenon at x = 0. 

On the graphs of 51-56, zoom in to all maxima and minima 
(3 significant digits). Estimate inflection points. 

51 y = 2x 5 - 16x 4 + 5x 3 - 37x 2 + 21x 4- 683 

52 y = 3 c s -x 4 - s /3x + 1-2 

53 y = x{x- l)(x — 2X* - 4) 


54 y = 7 sin 2x + 5 cos 3x 

55 y = (x 3 -2x + l)/(x 4 - 3x 2 ™15), -3<x<5 

56 y = x sin (i/x), 0.1 ^x< 1 

57 A 10-digit computer shows y = 0 and dy/dx — .01 at x* = 1* 
This root should be correct to about (8 digits) (10 digits) 
(12 digits). Hint: Suppose y = .01 (x — 1 terror). What errors 
don’t show in 10 digits of y? 

58 Which is harder to compute accurately: Maximum point 
or inflection point? First derivative or second derivative? 
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Here is a list of the most important curves in mathematics, so you can tell what is 
coming. It is not easy to rank the top four: 

L straight lines 

2 . sines and cosines (oscillation) 

3* exponentials (growth and decay) 

4. parabolas, ellipses, and hyperbolas (using 1, x s y , x 2 , xy s y 2 ). 

The curves that I wrote last, the Greeks would have written first. It is so natural to 
go from linear equations to quadratic equations. Straight lines use l,x, y. Second 
degree curves include x 2 , xy, y 2 . If we go on to x 3 and y 3 , the mathematics gets 
complicated. We now study equations of second degree, and the curves they produce. 

It is quite important to see both the equations and the curves. This section connects 
two great parts of mathematics — analysis of the equation and geometry of the curve. 
Together they produce ‘analytic geometry You already know about functions and 
graphs. Even more basic: Numbers correspond to points. We speak about “ the point 
(5, 2).” Euclid might not have understood. 

Where Euclid drew a 45° line through the origin, Descartes wrote down y — x. 
Analytic geometry has become central to mathematics — we now look at one part of it. 



Hg.3 A5 The cutting plane gets steeper: circle to ellipse to parabola to hyperbola. 
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3 Applications el the Derivative 


CONIC SECTIONS 

The parabola and ellipse and hyperbola have absolutely remarkable properties. The 
Greeks discovered that all these curves come from slicing a cone by a plane. The 
curves are “conic sections.” A level cut gives a circle, and a moderate angle produces 
an ellipse. A steep cut gives the two pieces of a hyperbola {Figure 3.1 5d). At the 
borderline, when the slicing angle matches the cone angle, the plane carves out a 
parabola. It has one branch like an ellipse, but it opens to infinity like a hyperbola. 

Throughout mathematics, parabolas are on the border between ellipses and 
hyperbolas. 

To repeat: We can slice through cones or we can look for equations. For a cone 
of light, we see an ellipse on the wall. (The wall cuts into the light cone.) For an 
equation Ax 2 + Bxy + Cy 2 + Dx + Ey + F = 0, we will work to make it simpler. The 
graph will be centered and rescaled (and rotated if necessary), aiming for an equation 
like y = x 2 . Eccentricity and polar coordinates are left for Chapter 9. 

THE PARABOLA y=ax l +bx+c 

You knew this function long before calculus. The graph crosses the x axis when 
y = 0. The quadratic formula solves y = 3x 2 - 4x + 1 = 0, and so does factoring into 
(x - l)(3x - 1), The crossing points x = 1 and x = 3 come from algebra. 

The other important point is found by calculus. It is the minimum point, where 
dyjdx = 6x - 4 = 0. The x coordinate is % - \ , halfway between the crossing points. 
The height is y min = - 3. This is the vertex V in Figure 3.16a — at the bottom of the 
parabola. 

A parabola has no asymptotes. The slope 6x - 4 doesn’t approach a constant. 

To center the vertex Shift left by } and up by 5. So introduce the new variables 
X — x — $ and Y= y + 7, Then x = 5 and y = — 3 correspond to X = 7=0 — which 
is the new vertex: 

y=3x 2 -4x+l becomes Y=3X 2 . (1) 

Check the algebra. 7 = 3X Z is the same as y + $ = 3{x - §) 2 . That simplifies to the 
original equation y = 3x 2 - 4x + 1. The second graph shows the centered parabola 
7= 3X 2 , with the vertex moved to the origin. 

To zoom in on the vertex Rescale X and 7 by the zoom factor a: 

Y = 3X 2 becomes y/a = 3(x/a) 2 . 

The final equation has x and y in boldface. With a = 3 we find y = x 2 — the graph is 
magnified by 3. In two steps we have reached the model parabola opening upward. 





Fig. 3.16 Parabola with minimum at V. Rays reflect to focus. Centered in (b), rescaled in (c). 
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A parabola has another important point — the focus . Its distance from the vertex 
is called p, The special parabola y = x 2 3 has p- l/4> and other parabolas Y= aX 1 
have p = l/4a. You magnify by a factor a to get y = x 2 . The beautiful property of a 
parabola is that every ray coming straight down is reflected to the focus. 

Problem 2.3.25 located the focus F — here we mention two applications. A solar 
collector and a TV dish are parabolic. They concentrate sun rays and TV signals 
onto a point— a heat cell or a receiver collects them at the focus. The 1982 UMAP 
Journal explains how radar and sonar use the same idea. Car headlights turn the 
idea around, and send the light outward. 

Here is a classical fact about parabolas. From each point on the curve, the distance 
to the focus equals the distance to the “directrix" The directrix is the line y~ — p 
below the vertex (so the vertex is halfway between focus and directrix). With p = 
the distance down from any (x, y) is y + i. Match that with tbe distance to the focus 
a t {0, i) — this is the square root below. Out comes the special parabola y = x 2 : 

y + i - Jx 1 + (y - J) 2 > (square both sides) > y - x 2 . (2) 


The exercises give practice with all the steps we have taken — center the parabola to 
Y — aX 2 t rescale it to y = x 2 , locate the vertex and focus and directrix. 


Summary for other parabolas y = ax 2 + bx + c has its vertex where dyjdx is zero. 
Thus 2flx + b = 0 and x — - bjla . Shifting across to that point is “completing the 
square”: 


ax 2 + bx + c equals 



( 3 ) 


Here C = c - {b 2 jAa) is the height of the vertex. The centering transform X = x + (fc/2a), 
Y= y — C produces Y = aX 2 . It moves the vertex to (0, 0), where it belongs. 

For the ellipse and hyperbola, our plan of attack is the same: 


1. Center the curve to remove any linear terms Dx and Ey . 

2 . Locate each focus and discover the reflection property. 

3. Rotate to remove Bxy if the equation contains it. 


x 2 y 2 

ELLIPSES — 7 + — = A 
a 2 br 


(CIRCLES HAVE a = b) 


This equation makes the ellipse symmetric about (0, 0) — the center. Changing x to 
“X or y to -y leaves the same equation. No extra centering or rotation is needed. 

The equation also shows that x 2 ja 1 and y 2 jb 2 cannot exceed one, (They add to 
one and can’t be negative.) Therefore x 2 ^ a \ and x stays between — a and a. Similarly 
y stays between b and — b . The ellipse is inside a rectangle. 

By solving for y we get a function (or two functions!) of x: 





— x 


2 


The graphs are the top half (+) and bottom half {— ) of the ellipse. To draw the ellipse, 
plot them together. They meet when y = 0, at x — a on the far right of Figure 3.17 
and at x = - a on the far left. The maximum y = b and minimum y=—b are at the 
top and bottom of the ellipse, where we bump into the enclosing rectangle. 

A circle is a special case of an ellipse , when a~b. The circle equation x z + y 2 — r 2 
is the ellipse equation with a—b — r. This circle is centered at (0, 0); other circles are 



124 


3 Application! of the Derivative 


centered at x = h t y = k* The circle is determined by its radius r and its center ( h , It): 

Equation of circle : (x — h) 2 + (y - k) 2 = r 2 * (4) 

In words, the distance from (x, y) on the circle to (h, k) at the center is r* The 
equation has linear terms — 2hx and — 21 'ey — they disappear when the center is (0, 0). 


EXAMPLE i Find the circle that has a diameter from (1, 7) to (5, 7). 

Solution The center is halfway at (3, 7)* So r = 2 and (x - 3) 2 + (y — 7) 2 = 2 2 . 

EXAMPLE 2 Find the center and radius of the circle x 2 — 6x -I- y 2 — 14y = - 54* 

Solution Complete x 1 - 6x to the square ( x — 3) 2 by adding 9* Complete y 2 - 14 y 
to (y — 7) 2 by adding 49. Adding 9 and 49 to both sides of the equation leaves 
(x - 3) 2 + (y — l ) 2 = 4 — the same circle as in Example 1. 

Quicker Solution Match the given equation with (4). Then h = 3, k = 7, and r = 2: 

x 2 - 6 x + y 2 — 14 y = — 54 must agree with x 2 - 2 hx + h 1 + y 2 — 2 Icy + k 2 = r 2 * 

The change to X = x — h and Y = y — k moves the center of the circle from (h, It) 
to (0, 0). This is equally true for an ellipse: 

n . eUip , e , , blcomel *! + £_,. 

a b a b 

When we rescale by x = X/a and y— Yjb, we get the unit circle x 2 + y 1 = 1* 

The unit circle has area n. The ellipse has area nab (proved later in the book). The 
distance around the circle is 2ft. The distance around an ellipse does not rescale— it 
has no simple formula. 



Hg. 3*17 Uncentered circle* Centered ellipse x 2 /3 2 + y 2 /2 3 = 1. The distance from center to 
far right is also a = 3. All rays from F 2 reflect to * 


Now we leave circles and concentrate on ellipses* They have two foci (pronounced 
fo-sigh). For a parabola, the second focus is at infinity. For a circle, both foci are at 
the center* The foci of an ellipse are on its longer axis (its major axis), one focus on 
each side of the center: 

F x is at x = c = yja 2 - b 2 and F 2 is at x = - c* 

The right triangle in Figure 3*17 has sides a, c. From the top of the ellipse, the 
distance to each focus is a. From the endpoint at x = a, the distances to the foci are 
a + c and a — c. Adding (a + c) + (a — c ) gives 2a. As you go around the ellipse, the 
distance to plus the distance to F 2 is constant (always 2a)* 
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3H At all points on the ellipse, the sum of distances from the foci is 2a. This 
is another equation for the ellipse: 

from Fj and F 2 to (x, y): y/(x ~ c) 2 -I- y 2 + y/(x + c) 2 + y 2 = 2a. (5) 


To draw an ellipse, tie a string of length 2 a to the foci. Keep the string taut and your 
moving pencil will create the ellipse. This description uses a and c — the other form 
uses a and b (remember b 2 + c 2 = a 2 ). Problem 24 asks you to simplify equation (5) 
until you reach x 2 /a 2 + y 2 /b 2 = 1 . 

The “whispering gallery" of the United States Senate is an ellipse. If you stand at 
one focus and speak quietly, you can be heard at the other focus (and nowhere else). 
Your voice is reflected off the walls to the other focus — following the path of the 
string. For a parabola the rays come in to the focus from infinity — where the second 
focus is. 

A hospital uses this reflection property to split up kidney stones. The patient sits 
inside an ellipse with the kidney stone at one focus. At the other focus a lithotripter 
sends out hundreds of small shocks. You get a spinal anesthetic (I mean the patient) 
and the stones break into tiny pieces. 

The most important focus is the Sun. The ellipse is the orbit of the Earth. See 
Section 12.4 for a terrible printing mistake by the Royal Mint, on England's last 
pound note. They put the Sun at the center. 

Question 1 Why do the whispers (and shock waves) arrive together at the second 
focus? 

Answer Whichever way they go, the distance is 2a. Exception: straight path is 2c. 

Question 2 Locate the ellipse with equation 4x 2 + 9 y 2 = 36. 

Answer Divide by 36 to change the constant to 1. Now identify a and b: 

x 2 y2 

— + = 1 so a = v 9 and b = v /4. Foci at + ^9 - 4 = ± y/5. 

Question 3 Shift the center of that ellipse across and down to x = 1, y = - 5. 
Answer Change x to x-1. Change y to y + 5. The equation becomes 
(x — l) 2 /9 + (y + 5) 2 /4 = 1. In practice we start with this uncentered ellipse and go the 
other way to center it. 

y 2 x 2 

HYPERBOLAS - -= = 1 
a 2 b 2 


Notice the minus sign for a hyperbola. That makes all the difference. Unlike an ellipse, 
x and v can both be large. The curve goes out to infinity. It is still symmetric, since 
x can change to — x and y to —y. 

The center is at (0, 0). Solving for y again yields two functions (+ and -): 



gives 




or y = ± - yjb 2 4- x 2 . 


( 6 ) 


The hyperbola has two branches that never meet. The upper branch, with a plus sign, 
has y^a. The vertex V x is at x = 0, y = a — the lowest point on the branch. Much 
further out, when x is large, the hyperbola climbs up beside its sloping asymptotes. 

if = 1000 then ^ 7 = 1001. So - is close to - or - 
b z a z abb 
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3 Applications of the Derivative 




R9. ZAZ The hyperbola {y 2 —^x 2 = 1 has a- 2, b- 3, c = y/4 + 9, The distances to and 
F 2 differ by 2a = 4, 


The asymptotes are the lines yja = xjb and yja = - xjb. Their slopes are ajb and - ajb , 
You can't miss them in Figure 3,18. 

For a hyperbola, the foci are ins ide the two branches. Their distance from the 
center is still called c, But now c = > /a 2 + i> 2 , which is larger than a and b. The vertex 
is a distance c - a from one focus and c + a from the other. The difference (not the 
sum) is (c + a) - (c — a) = 2a, 

All points on the hyperbola have this property: The difference between distances to 
the foci is constantly 2a. A ray coming in to one focus is reflected toward the other. 
The reflection is on the outside of the hyperbola, and the inside of the ellipse. 

Here is an application to navigation. Radio signals leave two fixed transmitters at 
the same time, A ship receives the signals a millisecond apart. Where is the ship? 
Answer: It is on a hyperbola with foci at the transmitters. Radio signals travel 
186 miles in a millisecond, so 186 = 2a, This determines the curve. In Long Range 
Navigation (LORAN) a third transmitter gives another hyperbola. Then the ship 
is located exactly. 

Question 4 How do hyperbolas differ from parabolas, far from the center? 
Answer Hyperbolas have asymptotes. Parabolas don't. 

The hyperbola has a natural rescaling. The appearance of xjb is a signal to change 
to X, Similarly yja becomes Y. Then Y= 1 at the vertex, and we have a standard 
hyperbola: 

y 2 ja 2 - x 2 jb 2 = 1 becomes Y 2 - X 1 ~ 1, 

A 90° turn gives X 2 - Y 2 — 1 — the hyperbola opens to the sides. A 45° turn produces 
2XY-\. We show below how to recognize x 2 + xy + y 2 = I as an ellipse and 
x 2 + 3 xy + y 2 = 1 as a hyperbola. (They are not circles because of the xy term.) When 
the xy coefficient increases past 2, x 1 + y 2 no longer indicates an ellipse. 

Question 5 Locate the hyperbola with equation 9y 2 - 4x 2 = 36, 

Answer Divide by 36. Then y 2 j 4 - x 2 j9 = 1. Recognize a — yjA and b = v /9. 

Question 6 Locate the uncentered hyperbola 9 y 2 - 1 8y - Ax 1 - Ax = 28, 

Answer Complete 9y 2 -18y to 9(y-l ) 2 by adding 9. Complete 4* 2 + 4x to 
A(x + j) 2 by adding A(j) 2 - 1, The equation is rewritten as 9 (y— J) 2 - A{x + £) 2 - 
28 + 9-1. This is the hyperbola in Question 5 — except its center is (— j, 1). 
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To summarize: Find the center by completing squares. Then read off a and ft. 


THE GENERAL EQUATION Ax 2 4 Bxy 4 Cy 2 4 Dx + Ey 4 F = 0 

This equation is of second degree, containing any and all of 1, x, y t x 2 , xy, y 2 . 
A plane is cutting through a cone- Is the curve a parabola or ellipse or hyperbola! 
Start with the most important case Ax 2 4 Bxy + Cy 2 — 1. 


31 The equation Ax 2 + Bxy 4 Cy 2 = 1 produces a hyperbola if B 2 > 4 AC and 
an ellipse if B 2 < 4 AC. A parabola has B 2 = 4.4C. 


To recognize the curve, we remove Bxy by rotating the plane. This also changes A 
and C — but the combination B 2 - 4 AC is not changed (proof omitted). An example 
is 2 xy=L with B 2 = 4. It rotates to y 2 - x 2 = 1, with ~4,4C = 4. That positive 
number 4 signals a hyperbola — since A — — 1 and C = l have opposite signs. 

Another example is x 2 + y 2 - L It is a circle (a special ellipse). However we rotate, 
the equation stays the same. The combination B 2 - 4.4C = 0 - 4 * 1 • Ms negative, as 
predicted for ellipses. 

To rotate by an angle a, change x and y to new variables x' and /: 

x = x' cos % — y sin a x r = x cos a 4 y sin at 

and (7) 

y = x sin s 4 y cos a y = — y sin a 4 x cos at. 

Substituting for x and y changes Ax 2 4 Bxy 4 Cy 2 = 1 to A'x' 2 4 B'x'y' + C f y /Z = 1, 
The formulas for A\ B\ C' are painful so I go to the key point: 

fl' is zero if the rotation angle a has tan let = B/(A — C). 

With B' = 0, the curve is easily recognized from A'x' 2 4 C'y' 2 = 1* It is a hyperbola 
if A 1 and C have opposite signs. Then B 2 - 4 A'C is positive. The original B 2 - 4AC 
was also positive, because this special combination stays constant during rotation. 

After the xy term is gone, we deal with x and y — by centering. To find the center, 
complete squares as in Questions 3 and 6. For total perfection, rescale to one of the 
model equations y = x 2 or x 2 4 y 2 — 1 or y 2 — x 2 = 1, 

The remaining question is about F = 0, What is the graph of Ax 2 4 Bxy 4- Cy 2 = 0? 
The ellipse-hyperbola-parabola have disappeared. But if the Greeks were right, the 
cone is still cut by a plane. The degenerate case F = 0 occurs when the plane cuts 

right through the sharp point of the cone. 

A level cut hits only that one point (0, 0). The equation shrinks to x 1 4 y 2 — 0, a 
circle with radius zero. A steep cut gives two lines. The hyperbola becomes y 2 — x 2 = 0, 
leaving only its asymptotes y = 4 x, A cut at the exact angle of the cone gives only 
one line, as in x 2 = 0. A single point , two lines, and one line are very extreme cases of 
an ellipse, hyperbola, and parabola. 

All these "conic sections” come from planes and cones. The beauty of the geometry, 
which Archimedes saw, is matched by the importance of the equations. Galileo dis- 
covered that projectiles go along parabolas (Chapter 12). Kepler discovered that the 
Earth travels on an ellipse (also Chapter 12). Finally Einstein discovered that light 
travels on hyperbolas. That is in four dimensions, and not in Chapter 12. 
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3 Applications of the Derivative 


equation 

P y = ax 2 + bx + c 


E 



= 1, a > b 


H 



vertices 



(a, 0) and (— a, 0) 


foci 


— above vertex, also infinity 
4 a 


(c, 0) and (— c, 0): c — ^ ja 2 — b 2 


(0, a) and (0, - a) (0, c) and (0, —c)\ c = y ja 2 + b 2 


3.5 EXERCISES 


Read-through questions 

The graph of y = x 2 + 2x + 5 is a ° . Its lowest point 

(the vertex) is (x, y ) = f b ), Centering by X = x + 1 and 

c moves the vertex to (0, 0). The equation becomes 
Y = d . The focus of this centered parabola is * . All 

rays coming straight down are f to the focus. 

The graph of x 2 -h 4 y 2 = 16 is an O . Dividing by h 

leaves x 2 /d 2 -j- y 2 jb 2 = 1 with a — l and b = J The 

graph lies in the rectangle whose sides are k The area is 
nab = l The foci are at x = ±c— m . The sum of 
distances from the foci to a point on this ellipse is always 
" If we rescale to X = x/4 and Y = yf 2 the equation 
becomes o and the graph becomes a p 

The graph of y 2 — x 2 = 9 is a Q . Dividing by 9 leaves 

y 2 ja 2 — x 2 /b 2 = 1 with a = * and b = * On the 

upper branch y ^ * The asymptotes are the lines u 

The foci are at y = ± c = v . The * of distances from 
the foci to a point on this hyperbola is * 

All these curves are conic sections— the intersection of a 
y and a x A steep cutting angle yields a * , At 

the borderline angle we get a P . The general equation is 
j4x 2 + c + F = 0. If D = E = 0 the center of the graph is 
at p The equation Ax 2 + Bxy -h Cy 2 = 1 gives an ellipse 
when E t The graph of Ax 2 + 5xy + 6y 2 = 1 is a F 

1 The vertex of y = ax 2 + bx + c is at x = — bj2a . What is 

special about this x? Show that it gives y = c — {b 2 jAa) t 

2 The parabola y = 3x 2 — 1 2x has x mi * — . At this 

minim um, 3x 2 is as large as 12x. Introducing 

X = x — 2 and Y = y + 12 centers the equation to . 

Draw the curves 3-14 by hand or calculator or computer. 
Locate the vertices and foci. 

3 y = x 2 — 2x — 3 4p(r-l) 2 

5 4y = — x 2 6 4_x = y 2 

7 (x - l) 2 + (y - l) 2 = I 8 x 2 + 9/ = 9 

9 9x 2 + y 2 = 9 10 x 2 /4 — (y— 1> 2 = 1 


11 y 2 — Ax 2 = 1 12 (y — l) 2 — 4x 2 = 1 

13 y 2 — x 2 = 0 14 xy = 0 

Problems 15-20 are about parabolas, 21-34 are about ellipses, 
35-41 are about hyperbolas. 

15 Find the parabola y = ax 2 + bx + c that goes through 
(0,0) and (1, l)and (2, 12). 

16 y = x 2 — x has vertex at . To move the vertex to 

(0, 0) set X = and Y = . Then Y = X 2 . 

17 (a) In equation (2) change i to p. Square and simplify, 

(b) Locate the focus and directrix of Y = 3X 2 . Which 
points are a distance 1 from the directrix and focus? 

18 The parabola y = 9~-x 2 opens __ with vertex at 

♦ Centering by Y = y — 9 yields Y = — x 2 . 

19 Find equations for all parabolas which 

(a) open to the right with vertex at (0, 0) 

(b) open upwards with focus at (0, 0) 

(c) open downwards and go through {0, 0) and (1,0). 

20 A projectile is at x = r, y = t — t 2 at time L Find dxjdt and 
dyjdt at the start, the maximum height, and an xy equation 
for the path. 

21 Find the equation of the ellipse with extreme points at 
(±2, 0) and (0, +1). Then shift the center to (l, 1) and find the 
new equation. 

22 On t he ellip se x 2 /a 2 + y 2 jb 2 = 1, solve for y when 
x = c = A ja 2 — b 2 . This height above the focus will be valuable 
in proving Kepler's third law. 

23 Find equations for the ellipses with these properties: 

(a) through (5, 0) with foci at (±4, 0) 

(b) with sum of distances to (l s 1) and (5, 1) equal to 12 

(c) with both foci at (0, 0) and sum of distances = 
2a = 10. 

24 Move a square root to the right side of equation (5) and 
square both sides. Then isolate the remaining square root and 
square again. Simplify to reach the equation of an ellipse. 
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25 Decide between circle-ellipse-parabola-hyperbola, based 
on the X Y equation with X = x — 1 and Y = y + 3. 

(a) x 2 — 2x + y 2 + 6 y - 6 

(b) x 2 - 2.x - y 2 - 6y = 6 

(c) x 2 — 2.x + 2y 2 -I- 12y = 6 

(d) v 2 — 2x - y = 6. 

26 A tilted cylinder has equation (x - 2y — 2z) z + 
(y - 2.x - 2r) 2 = 1. Show that the water surface at z = 0 is an 
ellipse. What is its equation and what is B 2 - 4/1C? 

27 (4, 9/5) is above the focus on the ellipse x 2 /25 + > ,2 /9 = 1. 
Find dyidx at that point and the equation of the tangent line. 

28 (a) Check that the line -xx 0 + y>\) = r 2 is tangent to the 
circle x 2 + y 2 = r 2 at (x 0 , y i} ). 

(b) For the ellipse x 2 ja 2 + y 2 jb 2 = 1 show that the tangent 
equation is xx u ja 2 4- /ft 2 = 1- (Check the slope.) 



29 The slope of the normal line in Figure A is s = — 1 /(slope 

of tangent) = . The slope of the line from F 2 is 

S = . By the reflection property, 

5 = cot 20 = - (cot 0 tan (?) = ^ — - 

Test your numbers s and S against this equation. 

30 Figure B proves the reflecting property of an ellipse. 
R is the mirror image of F, in the tangent line; Q is any other 
point on the line. Deduce steps 2, 3, 4 from 1, 2, 3: 

1. PF t -f- PF 2 < QF , + QF 2 [left side = 2a, Q is outside) 

2. PR + PF 2 <QR + QF 2 ' 

3. P is on the straight line from f\ to R 

4. y. = fi: the reflecting property is proved. 

31 The ellipse [x - 3) 2 /4 -f (y — l) 2 /4 = I is really a 

with center at and radius . Choose X and 

Y to produce X 2 A- Y 2 = l. 

32 Compute the area of a square that just fits inside the 
ellipse x 2 :a 2 + y 2 /b 2 = 1. 


33 Rotate the axes of x 2 + xy + y 2 = l by using equation (7) 
with sin x = cos x = The xV equation should show an 
ellipse, 

34 What are a , b , c for the Earth's orbit around the sun? 

35 Find an equation for the hyperbola with 

(a) vertices (0, ;r 1 }, foci (0, ±2) 

(b) vertices (0, ±3), asymptotes y = ±2x 

(e) (2, 3) on the curve, asymptotes y = ± x 

36 Find the slope of > ,2 -x 2 =l at (x o ,v 0 )- Show that 
yy 0 — xx 0 = l goes through this point with the right slope (it 
has to be the tangent line), 

37 [f the distances from (x, y) to (8, 0) and (—8, 0) differ by 
10, what hyperbola contains (x, y)2 

38 If a cannon was heard by Napoleon and one second later 

by the Duke of Wellington, the cannon was somewhere on a 
with foci at , 

39 y 2 — 4 y is part of (y — 2) 2 = and 2x 2 + 12x 

is part of 2(x + 3) 2 = . Therefore y 2 - 4 y — 

2x l — 12x = 0 gives the hyperbola {y - 2) 2 — 2(x 4- 3) 2 = 

. Its center is and it opens to the 

40 Following Problem 39 turn y 2 + 2y = x 2 +10x into 
Y 2 = X 2 4- C with X , n and C equal to 

41 Draw' the hyperbola x 2 — 4y 2 =l and find its foci and 
asymptotes. 

Problems 42-46 are about second -degree curves (conics). 

42 For which A, C, F does Ax 2 + Cy 2 4- F = 0 have no solu- 
tion (empty graph)? 

43 Show that x 2 -I- 2 at -I- y 2 4- 2x + 2y + 1 = 0 is the equation 
(squared) of a single line. 

44 Given any points in the plane, a second-degree 

curve Ax 2 4 • + F = 0 goes through those points. 

45 (a) When the plane r = ax + by 4- < meets the cone 
z 2 = x 2 4- y 2 , eliminate z by squaring the plane equation. 
Re w rite in the form Ax 2 + Bxy + Cy 1 - Dx + Ey 4- F = 0. 

(b) Compute B 2 A AC in terms of a and b. 

(c) Show that the plane meets the cone in an ellipse if 
a 2 4- b 2 < l and a hyperbola if a 2 + b 2 > 1 (steeper)* 

46 The roots of ax 2 + bx 4- c = 0 also involve the special com- 
bination b 2 - Aac, This quadratic equation has two real roots 

if and no real roots if . The roots come 

together when b 2 = 4ac, which is the borderline case like a 
parabola. 



3 Applications of the Derivative 


3.6 Iterations x n+ 1 - F(x n ) 

Iteration means repeating the same function. Suppose the function is F(x) = cos x. 
Choose any starting value, say x 0 = L Take its cosine: = cos x 0 = .54. Then take 
the cosine of x x . That produces x 2 = cos .54 = ,86. The iteration is x n + l = cos x„. I 
am in radian mode on a calculator, pressing "cos” each time. The early numbers are 
not important, what is important is the output after 12 or 30 or 100 steps: 

EXAMPLE A x l3 = .75 t = .73, x u = .74, x 29 = .7391, * 30 = /7391. 

The goal is to explain why the x’s approach x* = .739085 Every starting value 

x 0 leads to this same number x*. What is special about .7391? 

Note on iterations Do x x = cos x 0 and x 2 — cos x x mean that x 2 = cos 3 x 0 ? Abso- 
lutely not! Iteration creates a new and different function cos (cos x). It uses the cos 
button, not the squaring button. The third step creates F(F(F(x))) + As soon as you 
can, iterate with x n + l = j cos x fl . What limit do the x's approach? Is it ^(.7931)? 

Let me slow down to understand these questions. The central idea is expressed by 
the equation x n+1 = F{xJ. Substituting x Q into F gives ^ . This output Xi is the input 
that leads to x 2 * In its turn, x 2 is the input and out comes x 3 = F(x 2 ). This is iteration, 
and it produces the sequence x 0 , x M x 2 , .... 

The x’s may approach a limit x*, depending on the function F. Sometimes x* also 
depends on the starting value x 0 , Sometimes there is no limit. Look at a second 
example, which does not need a calculator. 

EXAMPLE 2 x n +i = F(x n ) = i* n + 4. Starting from x 0 = 0 the sequence is 

*i= 2*0 + 4 = 4, x 2 = j*4 + 4 = 6, x 3 = i*6 + 4=7, -x 4 = i - 7 + 4 = 7i .... 

Those numbers 0, 4, 6, 7, 7j, ... seem to be approaching x* = 8. A computer would 
convince us. So will mathematics, when we see what is special about 8: 

When the x’s approach x*, the limit of x fl + 1 = jx n + 4 
is x* = ^x*+4. This limiting equation yields x* = 8. 

8 is the "steady state” where input equals output: 8 = F(8), it is the fixed point. 

If we start at x 0 = 8, the sequence is 8, 8, 8 When we start at x 0 = 12, the 

sequence goes back toward 8: 

x { = 12 + 4= 10, x 2 = i' 10 + 4 = 9, x 3 = i-9 + 4 = 8.5, .... 

Equation for limit : // the iterations x n + x = F(xJ converge to x*, then x* = F(x*). 

To repeat: 8 is special because it equals \ * 8 + 4. The number .7391 . , . is special because 
it equals cos .739L > . . The graphs of y = x and y = F(x) intersect at x*. To explain why 
the x’s converge (or why they don't) is the job of calculus. 

EXAMPLE 3 x n _ x = x 3 has two fixed points: 0 = 0 2 and 1 = I 3 . Here F(x) = x 3 . 

Starting from x 0 = j the sequence {,te, jh, ■■■ goes quickly to x* = 0. The only 
approaches to x* = 1 are from x 0 = 1 (of course) and from x 0 = — 1. Starting from 
x 0 = 2 we get 4, 16, 256, ... and the sequence diverges to + oc. 

Each limit x* has a 'basin of attraction." The basin contains all starting points x 0 
that lead to x*. For Examples 1 and 2, every x 0 led to .7391 and 8. The basins were 
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the whole line (that is still to be proved). Example 3 had three basins — the interval 
— 1 < x 0 < 1, the two points x 0 = ± 1, and all the rest. The outer basin |x 0 | > 1 led 
to + oo, 1 challenge you to find the limits and the basins of attraction (by calculator) 
for F(x) = x - tan x. 

In Example 3, x* = 0 is attracting. Points near x * move toward x *. The fixed point 
x* = 1 is repelling . Points near 1 move away. We now find the rule that decides 
whether x* is attracting or repelling. The key is the slope dfjdx at x*. 


3J Start from any x 0 near a fixed point x * — F(x*)\ 

x * is attracting if\dF/dx\ is below 1 at x* 
x* is repelling if \dF/dx\ is above 1 at x*. 


First I will give a calculus proof. Then comes a picture of convergence, by “cobwebs." 
Both methods throw light on this crucial test for attraction: \dF(dx \ < 1. 

First proof: Subtract x* = F(x*) from x n+L = F(xJ. The difference x n + l - x* is 
the same as F(x n ) - F(x *). This is A F. The basic idea of calculus is that A F is close 
to F'Ax: 

x n + , - x* = F(x n ) - F(x* ) * F' (x* )(x„ - x* ). ( 1 ) 

The ‘‘error’ 1 x n — x* is multiplied by the slope dfjdx. The next error x n+1 — x* is 
smaller or larger, based on |F'[ < 1 or |F'| > 1 at x*. Every step multiplies approxi- 
mately by F'(x*). Its size controls the speed of convergence. 

In Example 1, F(x) is cos x and F'(x) is -sinx. There is attraction to *7391 
because |sin x*| < 1. In Example 2, F is jx + 4 and F' is j. There is attraction to 8. 
In Example 3, F is x 2 and F' is 2x. There is superattraction to x* = 0 (where F* = 0). 
There is repulsion from x* = 1 (where F f = 2). 

I admit one major difficulty. The approximation in equation (1) only holds near 
x*. If x 0 is far away, does the sequence still approach x*? When there are several 
attracting points, which x* do we reach? This section starts with good iterations, 
w'hieh solve the equation x* = F(x*) or /(x) = 0. At the end we discover Newton's 
method , The next section produces crazy but wonderful iterations, not converging 
and not blowing up. They lead to “ fractals ' 1 and " Cantor sets " and “chaos” 

The mathematics of iterations is not finished. It may never be finished, but we are 
converging on the answers. Please choose a function and join in. 

THE GRAPH OF AN ITERATION: COBWEBS 

The iteration x n + 1 = F(xJ involves two graphs at the same time. One is the graph 
of y = F(x). The other is the graph of y = x (the 45"' line). The iteration jumps back 
and forth between these graphs. It is a very convenient way to see the whole process. 

Example 1 wasx n+1 = cosx H . Figure 3.19 shows the graph of cos x and the “cob- 
web” Starting at (x 0 , x 0 ) on the 45" line, the rule is based on — F(x 0 ): 

From (x 0 , x 0 ) go up or down to (x 0 , x t ) on the curve. 

From (x 0 , x } ) go across to (Xi , x x ) on the 45° line. 

These steps are repeated forever. From x x go up to the curve at F(x t ), That height 
is x 2 . Now cross to the 45 fJ line at (x 2 ,x 2 ), The iterations are aiming for (x*, x*} = 
(.7391, .7391), This is the crossing point of the two graphs y = F(x) and y = x. 
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Hg.3.19 Cobwebs go from (Xo, x 0 ) to (x 0 , x t ) to (x 1 , x t ) — line to curve to line. 


Example 2 was x n + 1 = \x n + 4. Both graphs are straight lines. The cobweb is one- 
sided, from (0, 0) to (0, 4) to (4, 4} to (4, 6) to (6, 6). Notice how y changes (vertical 
line) and then x changes (horizontal line). The slope of F(x) is so the distance to 8 
is multiplied by \ at every step. 

Example 3 was x B+1 = x 2 . The graph of y= x 2 crosses the 45° line at two fixed 
points: 0 2 = 0 and l 2 = 1. Figure 3.20a starts the iteration close to 1, but it quickly 
goes away. This fixed point is repelling because F'(l) = 2. Distance from x* = 1 is 
doubled (at the start). One path moves down to x* = 0 — which is superattractive 
because F = 0. The path from x 0 > 1 diverges to infinity. 

EXAMPLE 4 F(x) has two attracting points x* (a repelling x* is always between). 

Figure 3.20b shows two crossings with slope zero. The iterations and cobwebs con- 
verge quickly, In between, the graph of F(x) must cross the 45° line from below. That 
requires a slope greater than one. Cobwebs diverge from this unstable point, which 
separates the basins of attraction. The fixed point x = n is in a basin by itself! 

Note 1 To draw cobwebs on a calculator, graph y = F(x) on top of y= x. On a 
Casio, one way is to plot (x 0 ,x 0 ) and give the command LINE: PLOT X,Y 
followed by EXE. Now move the cursor vertically to y = F(x) and press EXE. Then 
move horizontally to y = x and press EXE. Continue. Each step draws a line. 




Rg. 3.20 Converging and diverging cobwebs: F(x) = x 2 and F(x) = x - sin x. 
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For the TI-81 (and also the Casio) a short program produces a cobweb. Store F(x) 
in the Y= function slot Y *| , Set the range (square window or autoscaling). Run the 
program and answer the prompt with x 0 : 

PrgmC : COBWEB :Disp "INITIAL X0 M : Input X : AL L-Of f 
:Yl-On :"X M -Y4 :LbL 1 :X-*S :Yi^T : L i ne ( S , S , S , T) 

: L i n e ( S , T f T , T) :T->X :Pause :Goto1 

Note 2 The x’s approach x* from one side when 0 < dF/dx < 1. 

Note 3 A basin of attraction can include faraway x 0 *s (basins can come in infinitely 
many pieces). This makes the problem interesting. If no fixed points are attracting, 
see Section 3,7 for “cycles” and “chaos,” 

THE ITERATION x n+1 = x n - 

At this point we offer the reader a choice. One possibility is to jump ahead to the 
next section on “Newton's Method,” That method is an iteration to solve /(x) = 0, 
The function F(x) combines x n and /(xj and /'(xj into an optimal formula for x n+ 1 , 
We will see how quickly Newton's method works (when it works). It is the outstanding 
algorithm to solve equations, and it is totally built on tangent approximations. 

The other possibility is to understand (through calculus) a whole family of itera- 
tions- This family depends on a number c t which is at our disposal. The best choice 
of c produces Newton’s method , I emphasize that iteration is by no means a new 
and peculiar idea. It is a fundamental technique in scientific computing , 

We start by recognizing that there are many ways to reach /(x*) = 0. (I write x* 
for the solution.) A good algorithm may switch to Newton as it gets close. The 
iterations use /(x„) to decide on the next point x n + 1 : 

*n+ 1 = F(*«) cf(x J. (2) 

Notice how F( x) is constructed from f(x) — they are different! We move / to the right 
side and multiply by a “preconditioner” c. The choice of c (or c fl , if it changes from 
step to step) is absolutely critical , The starting guess x 0 is also important — but its 
accuracy is not always under our control. 

Suppose the x n converge to x*. Then the limit of equation (2) is 

x* — x* — cf(x*). (3) 

That gives f(x*) — 0. If the x n 7 s have a limit, it solves the right equation. It is a fixed 
point of F (we can assume c n c ^ 0 and /(x„) -► /(x')), There are two key questions, 
and both of them are answered by the slope F'(x*): 

1. How quickly does x n approach x* (or do the x n diverge)? 

2. What is a good choice of c (or c a ) ? 

EXAMPLE 5 f(x) = ax - b is zero at x * = bja , The iteration x n+i =x n - c(ax n — b) 
intends to find bja without actually dividing. (Early computers could not divide; they 
used iteration.) Subtracting x* from both sides leaves an equation for the error: 

*,.+ 1 - x* = x„ - x* - c(ax n - b ). 

Replace b by ax*. The right side is (1 - ca)(x fl — x*). This “error equation” is 


(error) n+1 = (l-ca)(error) n . 


(4) 
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3 Application* of the Derivative 


At every step the error is multiplied by (1 — ea) y which is F f . The error goes to zero if 
|F'| is less than 1* The absolute value |1 — ca\ decides everything: 

x n converges to x* if and only if — l<\—ea<\. (5) 

The perfect choice (if we knew it) is c = 1/a, which turns the multiplier 1 — ca into 
zero. Then one iteration gives the exact answer: x l =x 0 — (l/a)(ox 0 — i) = bja. That 
is the horizontal line in Figure 3.21a, converging in one step. But look at the other 
lines. 

This example did not need calculus. Linear equations never do. The key idea is 
that close to x* the nonlinear equation f[x) = 0 is nearly linear. We apply the tangent 
approximation. You are seeing how calculus is used, in a problem that doesn’t start 
by asking for a derivative. 


THE BEST CHOICE OF c 

The immediate goal is to study the errors x n — x *. They go quickly to zero, if 
the multiplier is small. To understand * fl + 1 = x H — cf(x n ), subtract the equation 
x* = x* - cf(x*): 

x„ +1 -x* = x n -x*- c(f{x n ) -/(**)). (6) 

Now calculus enters. When you see a difference of /’s think of dfjdx . Replace 
/{*.) -/(**) by A(x n — x*) s where A stands for the slope dfjdx at x *: 

*fl + i - X* a (I - Ci4)(x„- JC*). (7) 

This is the error equation. The new error at step n + 1 is approximately the old error 
multiplied by m = l — cA. This corresponds to m— 1 - ca in the linear example. We 
keep returning to the basic test |m| = |F'(jc*)| < 1: 


3K Starting near x* t the errors x K — jc* go to zero if the multiplier has \m\ < L 
The perfect choice is c = XjA ~ 1/ /'(**), Then m - l - cA ** 0, 


There is only one difficulty: We don't know x*. Therefore we don’t know the perfect 
c. It depends on the slope A = /'(**) at the unknown solution. However we can come 
close, by using the slope at x A : 

Choose c„ = 1 //'(*„). Then x„ +1 = x n - f(x n )lf'(x„) = F(x„). 

This is Newton's method , The multiplier m — 1 - cA is as near to zero as we can make 
it. By building dfjdx into F(x), Newton speeded up the convergence of the iteration. 



F(.v) 

\ - c{dx - h ) : good 
x --(a* -b ) : best 


-h ) : fail 
a 


Fix) 


F\ a-*) 



Hg, 3,21 The error multiplier is m — 1 — Newton has c = \ jf’{x n ) and m -*0. 



3.6 Iteration* x n+ , = F(x„) 

EXAMPLE 6 Solve /(x) = 2x - cos x = 0 with different iterations (different c’s). 

The line y = 2x crosses the cosine curve somewhere near x = 7. The intersection 
point where 2x* = cos x* has no simple formula. We start from x 0 = i and iterate 
x K+1 = x B - c(2x„ — cos x„) with three different choices of c. 

Take c- 1 or c = l//'(x 0 ) or update c by Newton’s rule c„ = l//'(x„): 


* o = ,50 

c = I 

c=l//'(*o) 


*1 = 

.38 

.45063 

.45062669 


.55 

.45019 

.45018365 

*3 = 

.30 

.45018 

.45018361... 


The column with c= 1 is diverging (repelled from x*). The second column shows 
convergence (attracted to x * ). The third column (Newton’s method) approaches x* 
so quickly that .4501836 and seven more digits are exact forx 3 . 

How does this convergence match the prediction? Note that /'(x) = 2 + sin x so 
A = 2.435. Look to see whether the actual errors x„ - x*, going down each column, 
are multiplied by the predicted m below that column: 



c = 1 

c — 1/(2 + sin 7) 

c„ =1/(2 + sin x„; 

x 0 ~ X* = 

0.05 

4.98 -10“ 2 

4.98 -10" 2 

X[ — X* = 

-0.07 

4.43 -10" 4 

4,43 -10" 4 

X 2 - X* = 

0.10 

7.88 -lO" 6 

3.63 -lO" 8 

x 3 - X* = 

-0.15 

1.41 -10" 7 

2.78- 10“ 16 

multiplier 

m = - 1.4 

m = .018 

m -* 0 (Newton) 


The first column shows a multiplier below - 1 , The errors grow at every step. Because 
m is negative the errors change sign — the cobweb goes outward. 

The second column shows convergence with m = .018. It takes one genuine Newton 
step, then c is fixed. After n steps the error is closely proportional to m" = (.018)" — 
that is "linear convergence'’ with a good multiplier. 

The third column shows the “ quadratic convergence ” of Newton’s method. 
Multiplying the error by m is more attractive than ever, because m -» 0. In fact m 
itself is proportional to the error, so at each step the error is squared. Problem 3.8.31 
wili show that (error) B+ 1 ^ A# (error) 2 . This squaring carries us from 10" 2 to 10" 4 to 
10“ 8 to “machine e” in three steps. The number of correct digits is doubled at every 
step as Newton converges. 

Note 1 The choice c= 1 produces x„ + 1 = x„-/{x (t ). This is “successive substitu- 
tion.” The equation fix) = 0 is rewritten as x = x -/(x), and each x„ is substituted 
back to produce x„+i . Iteration with c = 1 does not always fail! 

Note 2 Newton’s method is successive substitution for ///', not f. Then maO. 

Note 3 Edwards and Penney happened to choose the same example 2x = cos x. But 
they cleverly wrote it as x B + 1 = j cos x„, which has |F'| = |£sin xj < 1. This iteration 
fits into our family with c = j, and it succeeds. We asked earlier if its limit is j(.7391). 
No, it is x* = .450.... 
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3 Application! of the Derivative 


Note 4 The choice c = l//'(x 0 ) is “modified Newton” After one step of Newton's 
method, c is fixed* The steps are quicker, because they don't require a new f’(x B ), 
But we need more steps* Millions of dollars are spent on Newton’s method, so speed 
is important* In all its forms, f(x) — 0 is the central problem of computing* 


3.6 EXERCISES 


Read-through questions 

x„ +1 = x* describes h an q . After one step x t — b t 
After two steps x 2 = F(x x ) = c - If it happens that input = 
output, or x * = d , then x* is a e point* F = x* has 
* fixed points, at x* = g , Starting near a fixed point, 
the x n will converge to it if ti < 1. That is because 

x a +i — x* = F(x„) — Ffx*)sfr I The point is called 

I The x„ are repelled if k * For F = x 3 the fixed 

points have F f = I . The cobweb goes from (x 0 , x Q ) to 

( , ) to ( , ) and converges to fx* x*t = m , This 

is an intersection of y = x 3 and v = n . and it is super- 

attracting because o * 

/(x) = 0 can be solved iteratively by x n + 1 = x n — cf(x n \ in 
which case F'(x*) = p * Subtracting x* = x* — c/(x*}, the 
error equation is x n + 1 — x* as mi o )* The multiplier is 
m = r . The errors approach zero if » , The choice 

= > produces Newton’s method. The choice c= 1 is 

“successive u ” and c — v is modified Newton. Con- 
vergence to x * is * certain. 

We have three ways to study iterations x H+l — F(xJ: 
(1) compute x 1( from difTerent x 0 (2) find the fixed 

points x + and test \dFjdx \ < 1 (3) draw cobwebs. 

In Problems 1-8 start from x 0 = .6 and x 0 = 2. Compute 



x 2l .. 

. to test convergence: 




1 


= * 2 -t 

n 2 

2 

*«+i 

= 2je»(1 -x„) 

3 

JC-.+ t 


4 


- 1 Iy/X H 

5 


= 3x,(l -jc„) 

6 


= xl + x„ - 2 

7 

Aff + 1 

= - 1 

8 

■^+i 

= KI 


9 Check dFjdx at all fixed points in Problems 1-6* Are they 
attracting or repelling? 

10 From x 0 = — 1 compute the sequence x„+ ] = — x„. Draw 

the cobweb with its “cycle,” Two steps produce x n+2 = x^ 1 
which has the fixed points _* 

11 Draw the cobwebs for x n + l =?x n - 1 and x n+1 = 1 — ?x n 

starting from x 0 = 2. Rule: Cobwebs are two-sided when 
dFjdx is , 

12 Draw the cobweb for x n+1 =x* — l starting from the 

periodic point x o = 0* Another periodic point is 

Start nearby at x 0 = T to see if the iterations are 
attracted to 0, -1,0, — 1, 


Solve equations 13-16 within 1% by iteration. 

13 x = cosyx 14 x=*cos 2 x 

15 x = cos yfx 16 x = 2x- 1 (V) 

17 For which numbers a does x n+ j = a(x H — x;J) converge to 
x* = 0? 

18 For which numbers a does x B+ , — a{x n — xj) converge to 
x* = (a— l)/a? 

19 Iterate x n+1 = 4(x„ — xj) to see chaos. Why don't the x n 
approach x* = J? 

20 One fixed point of F(x) = x 2 — { is attracting, the other is 
repelling. By experiment or cobwebs, find the basin of x 0 *s 
that go to the attractor* 

21 (important) Find the fixed point for F(x) = ax + s* When 
ts it attracting? 

22 What happens in the linear case x n + 1 — ax* + 4 when 
a = 1 and when a — — 1? 

23 Starting with S1000, you spend half your money each year 
and a rich but foolish aunt gives you a new $1000. What is 
your steady state balance x*? What is x* if you start with a 
million dollars? 

24 The US national debt was once $1 trillion. Inflation 
reduces its real value by 5% each year (so multiply by 
a^* 95), but overspending adds another $100 billion. What 
is the steady state debt x*? 

25x n41 — bjxp has the fixed point x* = y/b. Show that 
|dF/dx| = l at that point — what is the sequence starting 
from x 0 ? 

26 Show that both fixed points of x H + , = xl + x* — 3 are 
repelling. What do the iterations do? 

27 A 55 calculator ta kes s quare roots but not cube roots. 
Explain why x n + 1 = ■ x /2/x H converges to ^2* 

28 Start the cobwebs for x„ +x = sinx n and x n+1 =tanx n . In 
both cases dFjdx = 1 at x* = 0* (a) Do the iterations converge? 
(b) Propose a theory based on F" for cases when F 1 = 1. 

Solve f(x) = 0 in 29-32 by the iteration x r+ 1 = x* — c/(xj, to 
find a c that succeeds and a c that fails. 

29 /(x) — x 2 — 4 30 f(x) = x 2 — 4x -h 3 

31 /(x)-(x-2)*-l 32 /(x) = (l-x)- J -3 
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33 Newton’s method computes a new c= t/f'(x n ) at each 
step. Write out the iteration formulas for f(x) = x* — 2 = 0 
and f(x) = sin x — i = 0, 

34 Apply Problem 33 to find the first six decimals of 
and rc/6. 

35 By experiment find each x* and its basin of attraction* 
when Newton’s method is applied to f(x) = x 1 — 5x + 4* 

36 Test Newton’s method on x 2 — 1 — 0, starting far out at 
Xo = 10 6 * At first the error is reduced by about m = 5. Near 
x* = 1 the multiplier approaches m = 0. 

37 Find the multiplier m at each fixed point of x n+1 = 
x„ — c(x 2 — x„)* Predict the convergence for different c (to 
which x*?). 

38 Make a table of iterations for c = 1 and c — l//'(x 0 ) and 
c= l//'(xw)* when /(x) = x 2 — i and x 0 = 1. 

39 In the iteration for x 2 — 2 = 0* find dF/dx at x *: 



(b) Newton’s iteration has F(x) = x — /(x)//'(x)- Show 
that F* = 0 when /{x) = 0* The multiplier for Newton is 
m = 0 . 

40 What are the solutions of /(x) = x 2 + 2 = 0 and why is 
Newton’s method sure to fail? But carry out the iteration to 
see whether x n -> go* 

41 Computer project F(x) = x — tan x has fixed points where 
tan x * = 0. So x* is any multiple of n* From x 0 = 2.0 and 1*8 
and L9, which multiple do you reach? Test points in 
1*7 < x 0 < 1.9 to find basins of attraction to rc* 2n, 3rc, An. 

Between any two basins there are basins for every multiple 
of n. And more basins between these (a fractal)* Mark them 
on the line from 0 to n* Magnify the picture around x 0 = L9 
(in color?). 

42 Graph cos x and cos(cos x) and cos(cos(cos x))* Also 
(cos) s x. What are these graphs approaching? 

43 Graph sin x and sin(sin x) and (sin) 8 x* What are these 
graphs approaching? Why so slow? 



3.7 Newton’s Method (and Chaos) 



The equation to be solved is f(x ) = 0. Its solution x* is the point where the graph 
crosses the x axis. Figure 3.22 shows x* and a starting guess x 0 . Our goal is to come 
as close as possible to x*, based on the information f(x 0 ) and /'(jc 0 ). 

Section 3.6 reached Newton's formula for (the next guess). We now do that directly. 

What do we see at x 0 ? The graph has height f(x 0 ) and slope /'(jc 0 ). We know 
where we are, and which direction the curve is going. We don’t know if the curve 
bends (we don’t have /"). The best plan is to follow the tangent line, which uses all 
the information we have. 

Newton replaces f(x) by its linear approximation {- tangent approximation); 

/(*) » /(*o) +/'(*o)(* “ *o)- (1) 


We want the left side to be zero. The best we can do is to make the right side zero! 
The tangent line crosses the axis at r lt while the curve crosses at x*. The new guess 
*1 comes from /(jc 0 ) +/'(x 0 )(jc 1 — x 0 ) = 0. Dividing by f'(x 0 ) and solving for x j , this 
is step 1 of Newton’s method: 




Xf) - 


/(*o) 

/'(*>)' 


( 2 ) 


At this new point, compute /( jtj) and f(xj ) — the height and slope at x 1 . They 
give a new tangent line, which crosses at x 2 . At every step we want f{x„. L ) — 0 and 
we settle for f(x n ) + f'(x„)(x„ +1 - x„) = 0. After dividing by f'(x n ), the formula for 
x a+ , is Newton’s method. 
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3 Applications of the Derivative 


3L The tangent line from x„ crosses the axis at x R+1 : 

Newton ’s method x„ + l = x„- frrA. • (3) 

J P^n/ 

Usually this iteration x n+1 = FfxJ converges quickly to x*. 



2 



Fig, 3,22 Newton's method along tangent lines from x 0 to x t to x 2 * 


Linear approximation involves three numbers. They are Ax (across) and A f (up) 
and the slope f f (x). If we know two of those numbers, we can estimate the third. It 
is remarkable to realize that calculus has now used all three calculations — they are 
the key to this subject: 


1. Estimate the slope f'(x) from AfjAx 

2, Estimate the change A f from /'(x) Ax 
3- Estimate the change Ax from Afjf*(x) 


(Section 2.!) 
(Section 3.1) 
(Newton's method) 


The desired A f is -/(x B ), Formula (3) is exactly Ax = -f(x n )!f’(x n ). 


EXAMPLE 1 (Square roots) f(x) = x 2 — b is zero at x* = v b and also at — yf b . 

Newton's method is a quick way to find square roots — probably built into your 

calculator* The slope is /'(xj = 2x n , and formula (3) for the new guess becomes 

x 2 — b 1 h 

2 x " + 2^ (4) 

This simplifies to x n + i ~ i{.v n + bfx „ )* Guess the square root , divide into b , and average 
the two numbers. The ancient Babylonians had this same idea, without knowing 
functions or slopes. They iterated x n+1 = F(x„ ): 

Fw = l( x+ x) and (5) 

The Babylonians did exactly the right thing. The slope F* is zero at the solution , when 
x 2 — b. That makes Newton’s method converge at high speed. The convergence test 
is |F'(x*)| < L Newton achieves F'(x*) = Q — which is superconvergence * 
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To find ^/4, start the iteration x B+1 = i(x„ + 4/x„) at x 0 = 1. Then Xj = ;(1 + 4): 

x* = 2.5 x 2 = 2.05 x 3 = 2.0006 x 4 = 2.000000009. 

The wrong decimal is twice as far out at each step. The error is squared. Subtracting 
x* = 2 from both sides of x„ + t = F(x„) gives an error equation which displays that 
square: 

(6) 

This is (error) n+i & i(error) 2 . It explains the speed of Newton's method. 


Remark 1 You can't start this iteration at x 0 = 0, The first step computes 4/0 and 
blows up. Figure 3.22a shows why — the tangent line at zero is horizontal. It will 
never cross the axis. 


Remark 2 Starting at x 0 = — 1, Newton converges to - yfl instead of + That 
is the other x*. Often it is difficult to predict which jc* Newton's method will choose. 
Around every solution is a "‘basin of attraction," but other parts of the basin may be 
far away. Numerical experiments are needed, with many starts x 0 . Finding basins of 
attraction was one of the problems that led to fractals. 

EXAMPLE 2 Solve - — a — 0 to find x* = - without dividing by a. 
x a 


Here f(x) — ( 1/x) - a . Newton uses /'(x) = - 1/x 2 . Surprisingly, we don't divide: 


(W-fl 


-1 /*2 


x n + x n -ax 2 . 


( 7 ) 


Do these iterations converge? I will take a = 2 and aim for x * = Subtracting £ from 

both sides of (7) changes the iteration into the error equation: 

x R + ! - 2x„ - 2xl becomes x n + l -%=- 2(x n - i) 2 . (8) 


At each step the error is squared. This is terrific if (and only if) you are close to 
x* = 7. Otherwise squaring a large error and multiplying by - 2 is not good: 


x 0 = .70 = .42 * 2 = .487 x 3 = .4997 x 4 = .4999998 

x 0 = 1.21 x 2 = -.5 x 2 = — 1.5 x 3 = -7.5 x 4 = -127.5 


The algebra in Problem 18 confirms those experiments. There is fast convergence if 
0 < x 0 < 1. There is divergence if x 0 is negative or x 0 > 1. The tangent line goes to a 
negative Xj. After that Figure 3.22 shows a long trip backwards. 

In the previous section we drew F(x). The iteration x rt+1 = F{x n ) converged to the 
45° line, where x* — F(x*). In this section we are drawing f(x). Now x* is the point 
on the axis where /(x*) = 0. 

To repeat: It is f{x*) = 0 that we aim for. But it is the slope F'(x*) that decides 
whether we get there. Example 2 has F(x) — 2x — 2x 2 . The fixed points are x* — ^ 
(our solution) and x* = 0 (not attractive). The slopes F'(x*) are zero (typical Newton) 
and 2 (typical repeller). The key to New ton's method is F' = 0 at the solution: 


The slope of F(x) — x — 


m . mr(x) 

/'(*> t/'W ) 2 


Then F'(x) = 0 w henf(x) = 0. 
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3 Applications of the Derivative 


The examples x 1 = b and 1 jx = a show fast convergence or failure. In Chapter 13, 
and in reality, Newton’s method solves much harder equations. Here I am going to 
choose a third example that came from pure curiosity about what might happen. The 
results are absolutely amazing. The equation is x 2 = - 1. 

EXAMPLE 3 What happens to Newton's method if you ask it to solve f(x) = x 2 + 1 = 0? 

The only solutions are the imaginary numbers x* = i and x* = — t. There is no real 
square root of -1. Newton’s method might as well give up. But it has no way to 
know that! The tangent line still crosses the axis at a new point x n + x , even if the 
curve y = x 2 + 1 never crosses. Equation (5) still gives the iteration for b = - 1: 



The x’s cannot approach i or -i {nothing is imaginary). So what do they do? 

The starting guess x 0 = 1 is interesting. It is followed by x 1 = 0. Then x 2 divides 
by zero and blows up. [ expected other sequences to go to infinity. But the experiments 
showed something different (and mystifying). When x„ is large, x n + 1 is less than half 
as large. After x n — 10 comes x n + 1 = i(10^ its) — 4.95. After much indecision and a 
long wait, a number near zero eventually appears. Then the next guess divides by 
that small number and goes far out again. This reminded me of “chaos.” 

It is tempting to retreat to ordinary examples, where Newton’s method is a big 
success. By trying exercises from the book or equations of your own, you will see 
that the fast convergence to ^/4 is very typical. The function can be much more 
complicated than x 2 — 4 (in practice it certainly is). The iteration for 2x = cos x was 
in the previous section, and the error was squared at every step. If Newton’s method 
starts close to x*, its convergence is overwhelming. That has to be the main point of 
this section: Follow the tangent line . 

Instead of those good functions, may I stay with this strange example x 2 + 1 = 0? 
It is not so predictable, and maybe not so important, but somehow it is more interest- 
ing. There is no real solution x*, and Newton’s method x R + 1 = ?(x n - l/x B ) bounces 
around. We will now discover x*. 


A FORMULA FOR X n 


The key is an exercise from trigonometry books. Most of those problems just give 
practice with sines and cosines, but this one exactly fits ^(x n — 1/xJ: 


1 / cos 0 ^ sin 6 \ _ cos 26 

2 \ sin 6 cos 6/ sin 26 


^ (cot e - 



= cot 26 


In the left equation, the common denominator is 2 sin 6 cos 6 (which is sin 26). The 
numerator is cos 2 0- sin 3 6 (which is cos 2 6). Replace cosine/sine by cotangent, 
and the identity says this: 


If x 0 = cot 6 then Xi = cot 20. Then x 2 = cot 40. Then x„ = cot 2" 6. 

This is the formula. Our points are on the cotangent curve. Figure 3.23 starts from 
x 0 = 2 = cot 0, and every iteration doubles the angle. 


Example A The sequence x 0 = 1, x x = 0, x 2 = oo matches the cotangents of ti/4, tt/ 2, 
and n. This sequence blows up because x 2 has a division by x x = 0. 
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Rg. 3.23 Newton’s method for x 2 + 1 = 0, Iteration gives x n = cot 2 n 0 L 


Example £ The sequence 1/^/3, — 1/^/3, 1/^/3 matches the cotangents of w/3, 2n/3, 
and 4rc/3. This sequence cycles forever because x 0 = x 2 = x 4 = .... 

Example C Start with a large x 0 (a small 0). Then x 1 is about half as large (at 20)* 
Eventually one of the angles 4 0, 8 0, . . . hits on a large cotangent, and the x’s go far 
out again. This is typical . Examples A and B were special, when 9Jk was \ or y. 

What we have here is chaos. The x’s can’t converge. They are strongly repelled by 
all points. They are also extremely sensitive to the value of 0. After ten steps 0 is 
multiplied by 2 10 = 1024. The starting angles 60° and 61° look close, but now they 
are different by 1024°. If that were a multiple of 180°, the cotangents would still be 
close. In fact the x 10 ’s are 0.6 and 14. 

This chaos in mathematics is also seen in nature. The most familiar example is the 
weather, which is much more delicate than you might think. The headline 14 Fore- 
casting Pushed Too Far” appeared in Science (1989). The article said that the snow- 
balling of small errors destroys the forecast after six days. We can’t follow the weather 
equations for a month — the flight of a plane can change everything. This is a revolu- 
tionary idea, that a simple rule can lead to answers that are too sensitive to compute. 

We are accustomed to complicated formulas (or no formulas). We are not 
accustomed to innocent-looking formulas like cot 2 n 0, which are absolutely hopeless 
after 100 steps. 


CHAOS FROM A PARABOLA 


Now I get to tell you about new mathematics. First I will change the iteration x n + x = 
^(x„ - l/x n ) into one that is even simpler. By switching from x to z — 1/(1 + x 2 ), each 
new z turns out to involve only the old i and z 2 : 

z» + i = 4z„-4 z*. (10) 

This is the most famous quadratic iteration in the world. There are books about it, 
and Problem 28 shows where it comes from. Our formula for x„ leads to z n : 


1 

i +*. 2 


i 

1 + (cot 2 a 0) z 


= {sin 2 n 0) 2 . 
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z, 


( 11 ) 


The sine is just as unpredictable as the cotangent, when 2 "0 gets large. The new thing 
is to locate this quadratic as the last member (when a = 4) of the family 

z„ + i = az n - az 2 , 0^a^4. (12) 

Example 2 happened to be the middle member a = 2, converging to -j. I would like 
to give a brief and very optional report on this iteration, for different a’s. 

The general principle is to start with a number 2 0 between 0 and 1, and compute 

Zi, z 2 , z 3 , .... It is fascinating to watch the behavior change as a increases. You can 

see it on your own computer. Here we describe some things to look for. All numbers 
stay between 0 and 1 and they may approach a limit. That happens when a is small: 

for 0 ^ a ^ 1 the z„ approach z* = 0 

for 1 ^ a ^ 3 the z H approach z* = (a- 1 )ja 

Those limit points are the solutions of z = F(z). They are the fixed points where 
z* - as* - a{z*) 2 . But remember the test for approaching a limit: The slope at z* 
cannot be larger than one. Here F = az — az 2 has F' = a - 2az. It is easy to check 
1F'| ^ 1 at the limits predicted above. The hard problem — sometimes impossible — 
is to predict what happens above a = 3. Our case is a = 4. 

The z’s cannot approach a limit when |F'(z*)| > 1. Something has to happen, and 
there are at least three possibilities: 

The z„’s can cycle or fill the whole interval (0, 1) or approach a Cantor set. 

I start with a random number z 0 , take 100 steps, and write down steps 101 to 105: 



a = 3.4 

a = 3.5 

a = 3.8 

a = 4.0 

*101 = 

.842 

.875 

.336 

.169 


.452 

.383 

.848 

.562 

*103 = 

.842 

.827 

.491 

.985 

*104 = 

.452 

.501 

.950 

.060 

*105 ” 

.842 

.875 

.182 

.225 


The first column is converging to a “2-cycle,” It alternates between x = .842 and 
y = .452. Those satisfy y — F(x) and x = F(y) = F(F(x)). If we look at a double step 
when a = 3.4, x and y are fixed points of the double iteration z K+1 = F(F(z„)). When 
a increases past 3.45, this cycle becomes unstable. 

At that point the period doubles from 2 to 4. With a = 3.5 you see a “4-cycle" in 
the table — it repeats after four steps. The sequence bounces from .875 to .383 to .827 
to .501 and back to .875. This cycle must be attractive or we would not see it. But it 
also becomes unstable as a increases. Next comes an 8-cycle, which is stable in a little 
window (you could compute it) around a = 3.55. The cycles are stable for shorter and 
shorter intervals of a' s. Those stability windows are reduced by the Feigenbaum shrink- 
ing factor 4.6692.... Cycles of length 16 and 32 and 64 can be seen in physical 
experiments, but they are all unstable before a = 3.57. What happens then? 

The new and unexpected behavior is between 3.57 and 4. Down each line of 
Figure 3.24, the computer has plotted the values of z 1001 to z 20 oo — omitting the first 
thousand points to let a stable period (or chaos) become established. No points 
appeared in the big white wedge. I don’t know why, In the window for period 3, you 
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Rfl. 3.24 Period doubling and chaos from iterating F{z) (stolen by special permission from a = 4 

Introduction to Applied Mathematics by Gilbert Strang, Wellesley -Cam bridge Press), 


see only three z' s. Period 3 is followed by 6 , 12, 24, There is period doubling at the 

end of every window (including all the windows that are too small to see). You can 
reproduce this figure by iterating z n+i =az„- az* from any z 0 and plotting the results. 

CANTOR SETS AND FRACTALS 

I can’t tell what happens at a = 3.8. There may be a stable cycle of some long period. 
The z’s may come close to every point between 0 and 1. A third possibility is to 
approach a very thin limit set, which looks like the famous Cantor set: 

To construct the Cantor set, divide [0, 1] into three pieces and remove the open 
interval (j, §). Then remove ( 5 , 9 ) and ( 9 , §) from what remains. At each step 
take out the middle thirds. The points that are left form the Cantor set. 

All the endpoints §, 9 , 9 , ... are in the set. So is | (Problem 42). Nevertheless the 
lengths of the removed intervals add to 1 and the Cantor set has “measure zero.” 
What is especially striking is its self-similarity: Between 0 and j you see the same 
Cantor set three times smaller. From 0 to £ the Cantor set is there again, scaled down 
by 9. Every section, when blown up, copies the larger picture. 

Fractals That self-similarity is typical of a fractal. There is an infinite sequence of 
scales. A mathematical snowflake starts with a triangle and adds a bump in the 
middle of each side. At every step the bumps lengthen the sides by 4/3. The final 
boundary is self-similar, like an infinitely long coastline. 

The word “fractal" comes from fractional dimension. The snowflake boundary has 
dimension larger than 1 and smaller than 2. The Cantor set has dimension larger 
than 0 and smaller than 1 . Covering an ordinary line segment with circles of radius 
r would take cjr circles, For fractals it takes c/r° circles — and D is the dimension. 
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Rg.3 .25 Cantor set (middle thirds removed). Fractal snowflake (infinite boundary). 


Our iteration z n + 1 = 4z„ — Az\ has a — 4, at the end of Figure 3.24. The sequence 
z 0 ,z u ... goes everywhere and nowhere. Its behavior is chaotic, and statistical tests 
find no pattern. For all practical purposes the numbers are random. 

Think what this means in an experiment (or the stock market). If simple rules 
produce chaos, there is absolutely no way to predict the results. No measurement can 
ever be sufficiently accurate. The newspapers report that Pluto's orbit is chaotic — 
even though it obeys the law of gravity. The motion is totally unpredictable over 
long times. I don't know what that does for astronomy (or astrology). 

The most readable book on this subject is Gleick’s best-seller Chaos: Making a 
New Science* The most dazzling books are The Beauty of Fractals and The Science 
of Fractal Images , in which Peitgen and Richter and Saupe show photographs that 
have been in art museums around the world. The most original books are Mandel- 
brot's Fractals and Fractal Geometry t Our cover has a fractal from Figure 13.1 1. 

We return to friendlier problems in which calculus is not helpless. 

NEWTON'S METHOD VS. SECANT METHOD: CALCULATOR PROGRAMS 


The hard part of Newton’s method is to find dfjdx. We need it for the slope of the 
tangent line. But calculus can approximate by A//Ax — using the values of f(x) 
already computed at x„ and x n _ t . 

The secant method follows the secant line instead of the tangent line: 


Secant : 




/(*„) 
( 4 fl^)„ 


where 



/(*„)-/(*„- l) 


(13) 


The secant line connects the two latest points on the graph of /(x). Its equation is 
y = (Af/Ax)(x - xj. Set y = 0 to find equation (13) for the new x = x n +i, 

where the line crosses the axis. 

Prediction: Three secant steps are about as good as two Newton steps. Both should 
give four times as many correct decimals: {error) {error) 4, . Probably the secant 
method is also chaotic for x 1 + 1 = 0. 

These Newton and secant programs are for the TI-81. Place the formula for /(x) 
in slot Y 'l and the formula for f\x) in slot Vj on the Y = function edit screen. 
Answer the prompt with the initial x 0 = XQ, The programs pause to display each 
approximation x n , the value /(*„), and the difference Press ENTER to 

continue or press ON and select item 2 : Qu i t to break. If f{x n ) = 0, the programs 
display ROOT AT and the rootx n . 



3.7 Nowton’s M e th od (and Chaos) 


145 


PrgmN : NEWTON 

: Di sp "ENTER FOR MORE" 

PrgmS : SECANT 

: Y-T 

:Disp "X0 = " 

: Di sp "ON 2 TO BREAK" 

: D i sp , •X0 = , ’ 

: Y i -»Y 

: I npu t X 

: Di sp " " 

: Input X 

: Di sp "ENTER FOR MORE" 

: X-> S 

:Di sp "XN FXN XN-XNM1 " 

: X->S 

: Di sp "XN FXN XN-XNM1 " 

: Y 1 -* Y 

: D i sp X 

: Y*| ->T 

: D i s p X 

: Lb l 1 

: D i s p Y 

: Di sp "XI =" 

: D i sp Y 

:X-Y/Y2^X 

: D i sp D 

: Input X 

: D i s p D 

:X-S->D 

: Pa u s e 

: Y-|->Y 

: Pa us e 

: X->S 

:If Y#0 

: Lb L 1 

: If Y*0 

: Y 1 Y 

:Goto 1 

:X-S-*D 

: Goto 1 


: Di sp "ROOT AT" 

:X->S 

; Di sp "ROOT AT" 


: Di sp X 

:X-YD / ( Y - T ) ->X 

: Di sp X 


3.7 

EXERCISES 



Read-through questions 

When /(x) = 0 is linearized to f(x n ) +/'(x„X* — xj = 0, the 
solution x = o is Newton’s x n+i . The b to the curve 
crosses the axis at x K + j , while the c crosses at x *. The 
errors at x„ and x n+} are normally related by 
(errorL-ii^M d . This ts e convergence. The 
number of correct decimals f at every step. 

For f(x) = x 2 — b t Newton’s iteration is x M+ j ~ 0 . The 

x„ converge to h if x 0 > 0 and to [ if x Q < 0. For 
f{x) = x 2 + 1, the iteration becomes jc„ f x = I . This can- 
not converge to k . Instead it leads to chaos. Changing 
to z — 1 f(x 2 + 1) yields the parabolic iteration z n+i = l 

For a < 3, z n ^ x = az n — az\ converges to a single m . 
After a — 3 the limit is a 2-cycle, which means n . Later 
the limit is a Cantor set, which is a one-dimensional example 
of a £ . The Cantor set is self- P . 

1 To solve f{x) = x* — b = 0, what iteration comes from 
Newton’s method? 

2 For f(x) = (x — l)/(x + 1} Newton’s formula is x n+l = 

Fix.) = Solve x* = F(x*) and find F(x*). What 

limit do the x„'s approach? 

3 I believe that Newton only applied his method in public 
to one equation x 3 — 2x — 5 = 0. Raphson carried the idea 
forward but got partial credit at best. After two steps from 
Xq = 2, how many decimals in x* = 2.09455148 are correct? 

4 Show that Newton’s method for f(x) = x 1/3 gives the 
strange formula x„ +1 = — 2_r n . Draw a graph to show the 
iterations. 

5 Find x x if (a) f(x 0 ) = 0; (b) f(x 0 ) = 0. 

6 Graph /(x) = x 3 — 3x — 1 and estimate its roots x *. Run 
Newton’s method starting from 0, 1, — and LI. Experiment 
to decide which x 0 converge to which root. 


7 Solve x 2 — 6x + 5 = 0 by Newton’s method with x 0 = 2.5 
and 3. Draw a graph to show which x 0 lead to which root. 

8 If f(x ) is increasing and concave up {/' > 0 and f* > 0) 
show by a graph that Newton’s method converges. From 
which side? 

Solve 9-17 to four decimal places by Newton's method with a 
computer or calculator. Choose any x 0 except x*. 

9 x 2 — 10 = 0 

10 x 4 5 6 — 100 = 0 (faster or slower than Problem 9?) 

11 x 2 — x = 0 (which x 0 to which root?) 

12 x 3 — x = 0 (which x 0 to which root?) 

13 x + 5 cos x = 0 (this has three roots) 

14 x -I- tan x = 0 (find two roots) (are there more?) 

15 1/(1 -x) = 2 

16 1 + x + x 2 + x 3 + x 4 = 2 

17 x 3 + (x -f- 1) 3 = 10 3 

18 (a) Show that x„+ { = 2x„ — 2x 2 in Example 2 is the same 
as (1 — 2x„+ i) = (I — 2x„) 2 . 

(b) Prove divergence if |1 — 2x 0 | > L Prove convergence 
if 1 1 — 2x 0 | <1 or 0 <x 0 < L 

19 With a = 3 in Example 2, experiment with the Newton 
iteration x„ + i = 2x„ — 3x 2 to decide which x 0 lead to x* = 

20 Rewrite x n+ , = 2x„ — ax 2 as (1 — ax n+ j) = (1 — ax n ) 2 . For 
which x 0 does the sequence 1 — ax n approach zero (so 

-► 1/®)? 

21 What is_Newton’s method to find the kXh root of 7? 
Calculate jfl to 7 places. 

22 Find all solutions of x 3 = 4x — 1 (5 decimals). 
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3 Application* of the Derivative 


Problems 23-29 are about x z + 1 = 0 and chaos. 

23 For 0 = tt/ 16 when docs x n = cot 2 n 6 blow up? For 
0 = njl when does cot 2 "0 = cot 0? (The angles 2"0 and 0 
differ by a multiple of tt.) 

24 For 0 = jt/ 9 follow the sequence until x K = x 0 . 

25 For 0=1* x n never returns to x 0 = cot L The angles 2* 
and 1 never differ by a multiple of n because ______ 

26 If z 0 equals sin 2 0 , show that z t = 4 z Q — 4 1 \ equals sin 2 20, 

27 If y = x 2 4- 1, each new y is 



Show that this equals y 2 /4(y ff - 1). 

28 Turn Problem 27 upside down, l/y rt+J = 4(y n — l)/y 2 , to 
find the quadratic iteration (10) for z n — 1/y* = 1/(1 + x 2 ). 

29 If F{z) = 4 z — 4 z 2 what is F(F(z))? How many solutions to 
z = F(F(z))? How many are not solutions to z = F(z)? 

30 Apply Newton’s method to x 3 — .64x -36 = 0 to find the 
basin of attraction for x* = 1. Also find a pair of points for 
which y = F{z) and z = F(y). In this example Newton does 
not always find a root- 

31 Newton’s method solves jc/(I — x) = 0 by x B+1 = 

, From which x 0 does it converge? The distance to 

x* = 0 is exactly squared. 

Problems 33—41 are about competitors or Newton* 

32 At a double root, Newton only converges linearly. What 
is the iteration to solve x 2 = 0? 

33 To speed up Newton’s method, find the step Ax from 
Ax a n&tf(x a ) + K&xfr(x H ) = Q. Test on f(x) = x 2 - 1 
from x 0 = 0 and explain, 

34 Halley’s method uses /„ + Ax f n + x(-/ H //')/" = 0, For 

f(x) = x 2 — 1 and *0 = 1+^ show that = 1 H- 
which is cubic convergence. 

35 Apply the secant method to /(x) = x 2 — 4 - 0, starling 

from x 0 = 1 and x, =2.5, Find Af/Ax and the next point x 2 
by hand. Newton uses = 5 to reach x 2 = 2.0 5. Which 

is closer to x* = 2? 

36 Draw a graph of f{x ) = x 1 — 4 to show the secant line in 
Problem 35 and the point x 2 where it crosses the axis. 


Bisection method If f{x) changes sign between x 0 aud xj, find 
its sign at the midpoint x 2 = 4(x 0 + Decide whether /(x) 
changes sign between x 0 and x 2 or x 2 and x t . Repeat on that 
half-length (bisected) interval. Continue, Switch to a faster 
method when the interval is small enough. 

37 f(x) = x 2 - 4 is negative at x = I, positive at x = 2.5, and 
negative at the midpoint x = 1.75, So x* lies in what interval? 
Take a second step to cut the interval in half again, 

38 Write a code for the bisection method. At each step print 
out an interval that contains x*, The inputs are x 0 and x t ; 
the code calls f(x). Stop if /(x 0 ) and /(jq) have the same 
sign. 

39 Three bisection steps reduce the interval by what 
factor? Starting from x 0 = 0 and x ^ — 8, take three steps for 
f(x) = x 2 — 10. 

40 A direct method is to zoom in where the graph crosses the 
axis. Solve 10x 3 — 8.3x 2 + 2.295x — .21 141 =0 by several 
zooms, 

41 If the zoom factor is 10, then the number of correct 

decimals for every zoom. Compare with Newton. 

42 The number | equals 4(1 + 4 + Vr + *")■ Show that it is in 
the Cantor set. It survives when middle thirds are removed. 

43 The solution to /(x) = (x- l,9)/(x — 2.0) = 0 is x* = 1.9. 
Try Newton’s method from x 0 = l,5, 2.1, and 1.95. Extra 
credit: Which x 0 ’s give convergence? 

44 Apply the secant method to solve cosx = 0 from 
x 0 = ,308. 

45 Try Newton’s method on cos x = 0 from x 0 - *308, If 
cot x 0 is exactly ti, show that X] = x 0 + n (and x 2 = x x + *). 
From x 0 = .308169071 does Newton’s method ever stop? 

46 Use the Newton and secant programs to solve 
x 3 — 10x 2 + 22x + 6 = 0 from x 0 = 2 and 1,39. 

47 Newton’s method for sin x = 0 is x „+ , = x n — tan x„. 
Graph sin x and three iterations from x 0 = 2 and x 0 = 1.8, 
Predict the result for x 0 = 1.9 and test. This leads to the com- 
puter project in Problem 3,6,41, which finds fractals. 

48 Graph yjx) = 3.4(x — x 2 ) and y 2 (x) = Y^Yifx)) in the 
square window (0, 0) ^ (x, y) ^ (1, 1). Then graph y 2 (x) = 
Y 2 (YM) and y 4 , ..., y,. The cycle is from ,842 to .452. 

49 Repeat Problem 48 with 3,4 changed to 2 or 3,5 or 4, 
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Now comes one of the cornerstones of calculus: the Mean Value Theorem . It connects 
the local picture (slope at a point) to the global picture (average slope across an 
interval). In other words it reiates dfjdx to A// Ax. Calculus depends on this conneo 


3.8 The Mean Value Theorem and I'Hopitars Rule 


147 




tion, which we saw first for velocities. If the average velocity is 75, is there a moment 
when the instantaneous velocity is 75? 

Without more information, the answer to that question is no. The velocity could 
be 100 and then 50 — averaging 75 but never equal to 75. If we allow a jump in 
velocity, it can jump right over its average. At that moment the velocity does not 
exist. (The distance function in Figure 3.26a has no derivative at x = 1.) We will take 
away this cheap escape by requiring a derivative at all points inside the interval. 

In Figure 3.26b the distance increases by 150 when t increases by 2. There is a 
derivative df/dt at all interior points (but an infinite slope at t = 0). The average 
velocity is 

A/_ /(2)-/(0) _ 150 
At 2-0 2 

The conclusion of the theorem is that df/dt = 75 at some point inside the interval. 
There is at least one point where f'(c) = 75. 

This is not a constructive theorem. The value of c is not known. We don't find c, 
we just claim (with proof) that such a point exists. 


3M Mean Value Theorem Suppose f{x) is continuous in the closed interval 
a^x^b and has a derivative everywhere in the open interval a< x<b. Then 

= /'(c) at some point a<c< b. (1) 


The left side is the average slope Af/Ax. It equals df/dx ac c. The notation for a 
closed interval [with endpoints] is [a, />]. For an open interval (without endpoints) 
we write ( a , b ). Thus /' is defined in (a, 6), and / remains continuous at a and b. A 
derivative is allowed at those endpoints too — but the theorem doesn't require it. 

The proof is based on a special case — when /(a) = 0 and f(b) = 0. Suppose the 
function starts at zero and returns to zero. The average slope or velocity is zero. We 
have to prove that /'(c) = 0 at a point in between. This special case (keeping the 
assumptions on f(x)) is called Rollers theorem. 

Geometrically, if / goes away from zero and comes back, then /' = 0 at the turn. 


3N Rollers theorem Suppose f(a) =f(b) = 0 (zero at the ends). Then /'(c) = 0 
at some point with a<c <b. 


Proof At a point inside the interval where f(x) reaches its maximum or minimum, 
df/dx must be zero. That is an acceptable point c. Figure 3.27a shows the difference 
between /= 0 (assumed at a and b) and /' = 0 (proved at c). 
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3 Application* at the Derivative 


Small problem: The maximum could be reached at the ends a and b , if f(x) < 0 in 
between. At those endpoints dfjdx might not be zero. But in that case the minimum 
is reached at an interior point c, which is equally acceptable. The key to our proof 
is that a continuous function on [a, b] reaches its maximum and minimum . This is the 
Extreme Value Theorem 

It is ironic that Rolle himself did not believe the logic behind calculus. He may not 
have believed his own theorem! Probably he didn't know what it meant — the lan- 
guage of “evanescent quantities” (Newton) and “infinitesimals” (Leibniz) was exciting 
but frustrating. Limits were close but never reached. Curves had infinitely many flat 
sides. Rolle didn't accept that reasoning, and what was really serious, he didn't accept 
the conclusions. The Academie des Sciences had to stop his battles (he fought against 
ordinary mathematicians, not Newton and Leibniz). So he went back to number 
theory, but his special case when f{a) =f{b) = 0 leads directly to the big one. 



Rq. 3.27 Rollers theorem is when f[a)=J{b) = 0 in the Mean Value Theorem. 


Proof of the Mean Value Theorem We are looking for a point where df jdx equals 
A//Ax. The idea is to tilt the graph back to Rolle' s special case (when A/ was zero). 
In Figure 3.27b, the distance F(x) between the curve and the dotted secant line comes 
from subtraction: 


F[x) = f{x) - 




( 2 ) 


At a and b , this distance is F(a) = F(b) = 0. Rolle' s theorem applies to F(x). There is 
an interior point where F'(c) = 0. At that point take the derivative of equation (2): 
0 = f(c) - (A//Ax). The desired point c is found, proving the theorem. 


EXAMPLE 1 The function f(x) — ^ fx goes from zero at x = 0 to ten at x = 100. Its 
average slope is A//Ax = 10/100. The derivative f r (x)= 1/2^/x exists in the open 
interval (0, 100), even though it blows up at the end x = 0. By the Mean Value 
Theorem there must be a point where 10/100 = /'(c) “ 1/2^/c. That point is c = 25, 
The truth is that nobody cares about the exact value of c. Its existence is what 
matters. Notice how it affects the linear approximation f{x)?zf{a)+ f f {a){x — a), 
which was basic to this chapter. Close becomes exact ( becomes = ) when /' is 
computed at c instead of a: 


flf f(x) doesn’t reach its maximum M y then \j{M — f(x)) would be continuous but also 
approach infinity. Essential fact: A continuous function on [a, b~\ cannot approach infinity. 
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30 The derivative at c gives an exact prediction of f(x): 

f(x) =f(a) +f'(c)(x - a). (3) 

The Mean Value Theorem is rewritten here as A f = f*(c) Ax. Now a <c < x. 


EXAMPLES The function /(x) = sinx starts from /( 0) = 0, The linear prediction 
(tangent line) uses the slope cos 0=1. The exact prediction uses the slope cos c at an 
unknown point between 0 and x: 

(approximate) sin xxzx (exucf) sin x = (cos c)x. (4) 

The approximation is useful, because everything is computed at x = a = 0* The exact 
formula is interesting, because cos c < I proves again that sin x < x. The slope is 
below 1, so the sine graph stays below the 45° line. 

EXAMPLE 3 If /'(c) — 0 at all points in an interval then f(x } is constant. 

Proof When /' is everywhere zero, the theorem gives A/= 0. Every pair of points 
has f(h)=f(a). The graph is a horizontal line. That deceptively simple case is a key 
to the Fundamental Theorem of Calculus. 

Most applications of A f =f*(c ) Ax do not end up with a number. They end up with 
another theorem (like this one). The goal is to connect derivatives (local) to differences 
(global). But the next application — THopitaTs manages to produce a number 

out of 0/0. 

L'HdPITAL'S rule 


When f(x) and g(x) hath approach zero , what happens to their ratio /(x)/g(x)? 


f(x) x 2 

^TV =— or 
gW x 


sin x x- sin x t 0 

or all become - at x — 0. 

x l “ cos x 0 


Since 0/0 is meaningless, we cannot work separately with f(x) and g(x), This is a 
“ race toward zero" in which two functions become small while their ratio mighl do 
anything. The problem is to find the limit of /(x)/g(x). 

One such limit is already studied. It is the derivative] A// Ax automatically builds 
in a race toward zero, whose limit is df/dx: 


f(x) - f(a ) 0 

x — a -+ 0 


but 






(5) 


The idea of PHopital is to use fjg to handle fjg. The derivative is the special case 
g(x}= x - a . with g f — L The Rule is followed by examples and proofs. 


3P VHopitaVs Rule Suppose f(x) and g(x) both approach zero as x -► a. Then 
f(x)/g(x) approaches the same limit as f(x)/g t (x), if that second limit exists: 

lim = lim — — , Normally this limit is (6) 

g(x) — g M g (a) 


This is not the quotient rulel The derivatives of /(x) and g(x) are taken separately. 
Geometrically, THopital is saying that when functions go to zero their slopes control 
their size . An easy case is /= 6(x - a) and g = 2(x a). The ratio / jg is exactly 6/2, 
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3 Application! ol flic Derivative 




the ratio of their slopes. Figure 3.28 shows these straight lines dropping to zero, 
controlled by 6 and 2. 

The next figure shows the same limit 6 / 2 , when the curves are tangent to the lines. 
That picture is the key to l’HdpitaPs rule. 

Generally the limit of f/g can be a finite number L or + oo or - oo. (Also the limit 
point x = a can represent a finite number or +oo or - oo. We keep it finite.) The 
one absolute requirement is that f(x) and g(x) must separately approach zero — we 
insist on 0 / 0 . Otherwise there is no reason why equation (6) should be true. With 
f(x) = x and g(x) = x - 1 , don't use l’Hopital: 

M^JL- but £W_I 

*(*) a-\ g(x) 1 

Ordinary ratios approach lim f(x) divided by lim^(x). THopital enters only for 0 / 0 , 


EXAMPLE 4 


(an old friend) 


„ 1 - cos x . sin x __ . 

lim equals lim — - — . This equals zero. 

0 x x-0 1 


EXAMPLE 5 


/ _ tan x 
g sin x 


leads to 


r 

g' 


sec z x 1 

. At x = 0 the limit is 7 , 

cos x 1 


EXAMPLE 6 


/ 

g 


x — sin x 
1 - COS .X 


leads to 


r 

g f 


1 - cos X 
sin x 


At x - 0 this is still - . 


Solution 


Apply the Rule to f'!g'- 




r 

g 


0 

0 


It has the same limit as 

t /"(x) sin x 0 ^ 

then compute — — = ► 7 = 0. 

g {x) cos x 1 


The reason behind VH&pitaVs Rule is that the following fractions are the same : 

/(*) _ /(*) -/(a) / g(x)-g(fl) 
g(x) x — a I x — a 


That is just algebra; the limit hasn’t happened yet. The factors x - a cancel, and the 
numbers f(a) and g(a) are zero by assumption. Now take the limit on the right side 
of ( 7 ) as x approaches a . 

What normally happens is that one part approaches /' at x = a. The other part 
approaches g'(a ). We hope g'(a) is not zero. In this case we can divide one limit by 
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the other limit. That gives the “normal" answer 


lim = limit of (7) = 
***■ gM 


m 

g'w 1 


( 8 ) 


This is also I’Hopital’s answer. When f'{x) -* f'(a) and separately jf'(x) — ► his 

overall limit is f'(a)/g'(a). He published this rule in the first textbook ever written on 
differential calculus. (That was in 1696 — the limit was actually discovered by his 
teacher Bernoulli.) Three hundred years later we apply his name to other cases 
permitted in (6), when f'jg might approach a limit even if the separate parts do not. 

To prove this more general form of I'Hopital’s Rule, we need a more general Mean 
Value Theorem. 1 regard the discussion below as optional in a calculus course 
(but required in a calculus book). The important idea already came in equation (8). 


Remark The basic “ indeterminate " is co — oo. If f(x) and g{x) approach infinity, 
anything is possible for f{x) — g(x). We could have x 2 — x or x - x 2 or (jc + 2) - x. 
Their limits are oo and — oo and 2. 

At the next level are 0/0 and oo/oo and 0* oo. To find the limit in these cases, try 
I’Hopital’s Rule. See Problem 24 when f(x)/g{x) approaches oo/oo. When f(x ) -* 0 
and g(x) -*■ oo, apply the 0/0 rule to /(jc)/(l/g(x)). 

The next level has 0° and I" and oo°. Those come from limits of /(x)* (Jc) . If f(x) 
approaches 0, t, or oo while g(x) approaches 0, oo, or 0, we need more information. 
A really curious example is x 1/ln \ which shows all three possibilities 0° and 1* and 
oo°. This function is actually a constant! It equals e. 

To go back down a level, take logarithms. Then g(x) In f(x) returns to 0/0 and 
0* oo and l'Hdpital’s Rule. Rut logarithms and e have to wait for Chapter 6, 


THE GENERALIZED MEAN VALUE THEOREM 


The MVT can be extended to two functions. The extension is due to Cauchy, who 
cleared up the whole idea of limits. You will recognize the special case g = x as the 
ordinary Mean Value Theorem. 


3Q Generalized MVT If f(x ) and g(x) are continuous on [a, ft] and 
differentiable on ( a , ft), there is a point a < c < ft where 

[/(ft) = I Jib) - g{a)-]f'(c% (9) 


The proof comes by constructing a new function that has F(a) = F(b): 

F(x) = [/(ft) -f(a)]g(x) - lg(b) - g(a)]f(x). 

The ordinary Mean Value Theorem leads to F'(c)= 0 — which is equation (9). 

Application I (Proof of THopital's Rule) The rule deals with f(a)jg(a) = 0/0. Insert- 
ing those zeros into equation (9) leaves f(b)g , (c) - g(ft)/'(c). Therefore 


/(ft) _ fjc) 

g(b ) g f {c) ‘ 


( 10 ) 


As ft approaches a, so does c. The point c is squeezed between a and ft. The limit of 
equation (10) as ft a and c -> a is PHopitaPs Rule. 
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3 Applications of tho Derivative 


Application 2 (Error in linear approximation) Section 3.2 stated that the distance 
between a curve and its tangent line grows like (x - a) 2 . Now we can prove this, and 
find out more. Linear approximation is 

fix) =m +/»(x - a) + error e(x). (11) 

The pattern suggests an error involving f"{x) and ( x - a) 2 . The key example /= jc 2 
shows the need for a factor j (to cancel f" = 2). The error in linear approximation is 

e(x) = jf”(c)(x — a) 2 with a<c<x. (12) 

Key idea Compare the error c(jc) to (jc — a) 2 . Both are zero at x = a: 

e =/( x) -/(a) (a)(x - a) e' =/'(x) -/'(a) =/"(*) 

g = (x-a) 2 g' = 2(x-a) g ,f = 2 

The Generalized Mean Value Theorem finds a point C between a and x where 
e(x)/g(x) — e t (C)/g'(C). This is equation (10) with different letters. After checking 
e f (a) = g / (a) = 0, apply the same theorem to e'(x) and g'(x). It produces a point c 
between a and C — certainly between a and x — where 


e\C) _ e"(c) 
g'(C) g"(c) 


and therefore 


g(x) 


e v (c) 

ficY 


With g = (x — a) 2 and g" “ 2 and c" = /", the equation on the right is e(x) = 
i a) 2 . The error formula is proved. A very good approximation is 

~ <*) 2 - 


EXAMPLE 7 f(x) = fx near a = 100: * 10+ 2 + 2 2 . 

That last term predicts e = — .0005. The actual error is ^f\02 — 10- 1 = — ,000496. 


3.8 EXERCISES 


Read-through questions 

The Mean Value Theorem equates the average slope A//Ax 
over an o [a, h] to the slope dfjdx at an unknown b . 
The statement is c , It requires f{x) to be d _ on the 
» interval [ a , h], with a { on the open interval (a, b). 

Rolle’s theorem is the special case when /{a) = f(b ) = 0, and 
the point c satisfies fl . The proof chooses c as the point 
where / reaches its h . 


Find all points 0 < c < 2 where /( 2) — /( 0) - /'(c)( 2 — 0). 

1 f(x) = x 3 2 /(x) = sin ?rx 

3 /(x) = tan2nx 4 /(x)=l+x + x 2 

5/W = (x-l)‘° 6 /(x)=(x-l) 9 

Id 7-10 show that no point c yields /(l)— /(— l)=/'(cX2). 
Explain why the Mean Value Theorem fails to apply. 


Consequences of the Mean Value Theorem include: 
If /'(x) = 0 everywhere in an interval then /(*) = _ j_. 
The prediction f(x) = fia)-\- 1 (x-a) is exact for 

some c between a and x. The quadratic prediction 
/(x) = f(a) + /'(a)(x — a) + It (x - a ) 2 is exact for another 
c. The error in f(a)+f'(a)(x — a) is less than \M{x — a) 2 
where M is the maximum of I 

A chief consequence is THopital’s Rule, which applies when 
/(x) and jelxl -> m as x-*a. In that case the limit of 
f(x)/g{x) equals the limit of n , provided this limit exists. 
Normally this limit is /'(aj/g'fa). If this is also 0/0, go on to 
the limit of o 


7 /(*) = |x - y| 8 f(x) = unit step function 

9 /(x) = |x| 1 ' 2 10 f(x)=\jx 2 

11 Show that sec 2 x and tan 2 x have the same derivative, and 
draw a conclusion about f{x) = sec 2 x — tan 2 x . 

12 Show that csc 2 x and cot 2 x have the same derivative and 
find /(x) = esc 2 jc — cot 2 x. 


Evaluate the limits in 13-22 by I’Hfipital’s Rule. 


13 lim 

j-*3 


x 2 — 9 
x — 3 


14 lim 

x-3 


x 2 — 9 
x 4- 3 
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15 lim 

*-►0 


(1 + x)" 2 -! 


17 lim : 


19 lim 

x-hO 


21 lim 
*-o 


sin x 

(l+xf-l 


, 6 Ita 


18 lim ' 

x-+i sin x 

(l+xf-l-nx 


sin x — tan x 


20 lim 

x^O 


22 lim 

x-0 


^/\+x-J\ ~x 


23 For f—x 1 —4 and g = x + 2, the ratio fix' approaches 4 
as x -* 2. What is the limit of f(x)/g(x)'l What goes wrong in 
1’HopitaTs Rule? 

24 VIlopitaTs Rule still holds for f(x)lg(x) -* oo/oo: L is 

I im /W iWM _ , 2 .. rn 

jrW /te" /'M/rw - /'W 

Then L equals lim Lf , (x)/g f (x)J if this limit exists. Where did 
we use the rule for 0/0? What other limit rule was used? 


25 Compute lim 


1+0/x) 


*-o 1 — (1 /jc) 


26 Compute lim 


x 2 + x 


*-►*> 2x 2 


M x + cos X . 

27 Compute lim : by common sense. Show that 

x + sin x 

PHopital gives no answer. 


CSC X 

28 Compute lim by common sense or trickery. 

x -*0 cot x 

29 The Mean Value Theorem applied to /(x) = x 3 guarantees 
that some number c between 1 and 4 has a certain property. 
Say what the property is and find c * 

30 If \dfjdx\ < 1 at all points, prove this fact: 

\f(x)“-f{y)\<\x-y \ at all x and y. 

31 The error in Newton’s method is squared at each step: 
[x fl+1 — x*| ^ M|x n — x*[ 2 . The proof starts from 0 =f(x*) = 
/(^)+/ J (xJ^-x„) + ir( C )(x*-x n ) 2 . Divide by f’(x K ), 
recognize x H+lr and estimate M. 


32 (Rollers theorem backward) Suppose /'(c) = 0. Are there 
necessarily two points around c where /(fl) =/(&)? 

33 Suppose f[0) = 0. If /(x)/x has a limit as x 0, that limit 

is better known to u$ as , L’Hopital’s Rule looks 

instead at the limit of . 

Conclusion from PHopital: The limit of /'(x), if it exists, 
agrees with /'(Q). Thus / r (x) cannot have a “removable 


34 It is possible that /'(x)/g r (x) has no limit but f{x)fg(x) -* L. 
This is why PHopital included an “if.” 

(a) Find Las x -*0 when /(x) = x 2 cos(l/x)and£(x) = x. 
Remember that cosines are below 1. 

(b) From the formula /'(x) = sin(l/x) + 2x cos{t/x) show 
that f'jgf has no limit as x -+ 0. 

35 Stein’s calculus book asks for the limiting ratio of 
/(x) — triangular area ABC to g(x) = curved area ABC , 

(a) Guess the limit of f/g as the angle x goes to zero, 

(b) Explain why /(x) is i(sin x — sin x cos x) and g(x) is 
i(x — sin x cos x). (c) Compute the true limit of /(x)/g(x). 



36 If you drive 3000 miles from New York to L.A. in 100 

hours (sleeping and eating and going backwards are allowed) 
then at some moment your speed is . 

37 As x -+ oo PHopital’s Rule still applies. The limit of 
f(x)jg{x) equals the limit of / r (iMx) t if that limit exists. 
What is the limit as the graphs become parallel in Figure B? 

38 Prove that /(x) is increasing when its slope is positive: // 
/'(c) >0 at all points c, then f[b) > f(a ) at all pairs of points 
b>a. 
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CHAPTER 4 


Derivatives by the Chain Rule 


4.1 The Chain Rule 


You remember that the derivative of f(x)g(x) is not (dfjdx)(dgfdx}. The derivative 
of sin * times x 2 is not cos * times 2x , The product rule gave two terms, not one 
term. But there is another way of combining the sine function / and the squaring 
function g into a single function. The derivative of that new function does involve 
the cosine times 2x (but with a certain twist). We will first explain the new function, 
and then find the ^ chain rule ” for its derivative. 

May I say here that the chain rule is important. It is easy to learn, and you will 
use it often. I see it as the third basic way to find derivatives of new functions from 
derivatives of old functions. (So far the old functions are x rt , sin x, and cos x. Still 
ahead are e x and logx.) When /and g are added and multiplied, derivatives come 
from the sum rule and product rule. This section combines /and g in a third way. 

The new function is sin(.v 2 ) — the sine of x 2 . It is created out of the two original 
functions: if x = 3 then x 2 = 9 and sin(x 2 ) = sin 9. There is a "chain"' of functions, 
combining sin x and x 2 into the composite function sin(x 2 ). Yon start with x, then 
find g(xl then find f{g(x\): 

The squaring function gives y = x 2 . This is #(x). 

The sine function produces z = sin y = sinfx 2 ), This is /(£(*)), 

The "inside function" #(x) gives \\ This is the input to the "outside function ' f{y). That 
is called composition. It starts with x and ends with z. The composite function is 
sometimes written fg (the circle shows the difference from an ordinary product fg). 
More often you will see /(#(x)); 

z{x)=f:g{x)=j\g{x)h (1) 

Other examples are cos 2x and (2x)\ with g = 2x. On a calculator you input x, then 
push the “g*' button, then push the L /" button: 

From x compute y = g{x) From y compute z =/{}')♦ 

There is not a button for every function! But the squaring function and sine function 
are on most calculators, and they are used in that order. Figure 4.1a shows how 
squaring will stretch and squeeze the sine function. 
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That graph of sin x 2 is a crazy FM signal (the Frequency is Modulated). The wave 
goes up and down like sin x , but not at the same places. Changing to sin g(x) moves 
the peaks left and right. Compare with a product g(x) sin x , which is an AM signal 
(the Amplitude is Modulated). 

Remark /fg(x)) is usually different from g(/(x)). The order of f and g is usually 
important. Foij[x) = sin x and g(x ) = x 2 , the chain in the opposite order g(/fx)) gives 
something different: 

First apply the sine function: y - sin x 
Then apply the squaring function: z = (sin x) 2 . 

That result is often written sin 2 x, to save on parentheses. It is never written sin x 2 , 
which is totally different. Compare them in Figure 4.1. 





Rg-4,1 Rg(x)) is different from g(.ftx)). Apply g then /, or / then g. 


EXAMPLE 1 The composite function/*# can be deceptive. If g(x)- x 3 and/fy) = y 4 , 
how dots J\g(x)) differ from the ordinary product _ftx)g(x)? The ordinary product is 
x 7 . The chain starts with y = x 3 , and then z = y 4 -x 12 . The composition of j t* and 
/ gives/{g(x))= x 12 . 


EXAMPLE 2 In Newton’s method, F(x) is composed with itself This is iteration . 
Every output x n is fed back as input, to find x„ + { = F(x J. The example F(x) = ^x + 4 
has F(F(x)) — ^(^x + 4) + 4. That produces z — £x + 6. 

The derivative of F(x) is 7, The derivative of z = F(F(x)) is which is 7 times 7, 
We multiply derivatives. This is a special case of the chain rule. 

An extremely special case is /(x) = x and g(x) = x. The ordinary product is x 2 . The 
chain /f#(x)) produces only x! The output from the “identity function M is g(x) = x.t 
When the second identity function operates on x it produces x again. The derivative 
is 1 times L I can give more composite functions in a table: 


,F = 8(*) 


z=/ljrtx» 

x 2 -\ 

Vy 

jx 2 -l 

COS X 

f 

(cos x) 3 

2 X 

2 y 

2i x 

x + 5 

y~ 5 

X 


The last one adds 5 to get y. Then it subtracts 5 to reach z. So z — x. Here output 


f A calculator has no button for the identity function. It wouldn’t do anything. 
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equals input: _%{*)) = x. These “inverse functions” are in Section 4.3. The other 
examples create new functions z(x) and we want their derivatives. 


THE DERIVATIVE OF ApU)) 


What is the derivative of z = sin x 2 ? It is the limit of AzjAx. Therefore we look at a 
nearby point x 4- Ax. That change in x produces a change in y = x 1 — which moves 
to y + Ay = (x + Ax) 2 , From this change in y, there is a change in z=j\y). It is a 
“domino effect,” in which each changed input yields a changed output: Ax produces 
Ay produces Az. We have to connect the final Az to the original Ax. 

The key is to write AzjAx as AzjAy times Ay I Ax. Then let Ax approach zero. 
In the limit, dzjdx is given by the “chain rule”: 


Az Az A y , . dz dz dy 

— = — — becomes the chain rule — = 

Ax Ay Ax dx dydx 


( 2 ) 


As Ax goes to zero, the ratio Ay /Ax approaches dyjdx. Therefore Ay must be going 
to zero, and AzjAy approaches dzjdy. The limit of a product is the product of the 
separate limits (end of quick proof). We multiply derivatives 


4A Chain Rule Suppose g(x) has a derivative at x and/fy) has a derivative 
at y — g(x ). Then the derivative of z =y(g(x)) is 

= (3> 

The slope at x is dfjdy (at y) times dgjdx (at x). 


Caution The chain rule does not say that the derivative of sin x 2 is (cos x)(2x). 
True, cos y is the derivative of sin y. The point is that cos y must be evaluated at y 
(not at x). We do not want dfjdx at x, we want dfjdy at y - x 2 : 

The derivative of sin x 2 is (cos x 2 } times (2x). (4) 

EXAMPLE 3 If z = (sin x) 2 then dzjdx = (2 sin x)(cos x). Here y = sin x is inside. 

In this order, z = y 2 leads to dzjdy = 2 y. It does not lead to 2x. The inside function 
sin x produces dyjdx = cos x. The answer is 2y cos x. We have not yet found the 
function whose derivative is 2x cos x. 

EXAMPLE 4 The derivative of z = sin 3x is ^ ^ ^ = 3 cos 3x. 

dx dy dx 




Az Az Ay 
Ax Ay Ax 



dx dy dx 



4.1 The Chain Rule 


157 


The outside function is z = sin y. The inside function is y = 3x. Then dzjdy = cos y — 
this is cos 3x, not cos x. Remember the other factor dyjdx = 3. 

I can explain that factor 3, especially if x is switched to t . The distance is z = sin 3r. 
That oscillates like sin t except three times as fast. The speeded-up function sin 3 1 
completes a wave at time 2n/3 (instead of In). Naturally the velocity contains the 
extra factor 3 from the chain rule. 


EXAMPLE 5 Let z =f(y) = y n . Find the derivative of f(g(x}) = [£(*)]". 

In this case dzjdy is ny"~\ The chain rule multiplies by dyjdx : 

f x ( 5 ) 

This is the power rule\ It was already discovered in Section 2,5, Square roots (when 
ft = 1/2) are frequent and important. Suppose y = x 2 — 1: 




(x 2 


!)“ 1/2 (2x) = 


x 

J X 1 - 1 


( 6 ) 


Question A Buick uses 1/20 of a gallon of gas per mile. You drive at 60 miles per 
hour. How many gallons per hour? 

Answer {Gallons j hour) = {gallons! mile)(milesi hour). The chain rule is (dyjdt) = 
(dyjdx)(dxjdt). The answer is (1/20)(60) = 3 gallons/hour. 


Proof of the Chain rule The discussion above was correctly based on 
Az Az Ay , dz dz dy 

= and ~r = T~r- ( ? ) 

Ax Ay Ax dx dy dx 

It was here, over the chain rule, that the “battle of notation” was won by Leibniz. 
His notation practically tells you what to do: Take the limit of each term. (1 have to 
mention that when Ax is approaching zero, it is theoretically possible that Ay might 
hit zero. If that happens, Az/Ay becomes 0/0. We have to assign it the correct meaning, 
which is dzjdy.) As Ax -► 0, 


Ay 

Ax 


g'W 


and 


Az 

Av 


/'M =/'(£(*)). 


Then Az/Ax approaches f(y) times g r (x), which is the chain rule (dzjdy)(dyidx). In the 
table below, the derivative of (sin x) 3 is 3(sin x) 2 cos x. That extra factor cos x is easy 
to forget. It is even easier to forget the -1 in the last example. 

z = (x 3 + I) 5 dzjdx = 5(x 3 + l) 4 times 3x 2 

z = (sin x) 3 dzjdx = 3 sin 2 x times cos x 

z = (1 — x) 2 dzjdx = 2(1 — x) times - 1 

Important All kinds of letters are used for the chain rule. We named the output z. 
Very often it is called y, and the inside function is called u: 

The derivative of y = sin u(x) is -j- = cos u — , 

dx dx 


Examples with dujdx are extremely common. I have to ask you to accept whatever 
letters may come. What never changes is the key idea — derivative of outside function 
times derivative of inside function. 
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EXAMPLE 6 The chain rule is barely needed for sin(x- 1). Strictly speaking the 
inside function is u — x— 1* Then dujdx is just 1 (not —1). If y = $m(x — 1) then 
dyjdx = cos(jc — 1 ). The graph is shifted and the slope shifts too. 

Notice especially: The cosine is computed at jc - 1 and not at the unshifted jc. 


RECOGNIZING f(y) AND g(x) 


A big part of the chain rule is recognizing the chain . The table started with (jc 3 + l) 5 . 
You look at it for a second. Then you see it as u 5 . The inside function is u = jc 3 + L 
With practice this decomposition (the opposite of composition) gets easy: 

cos (2jc + 1) is cos u ^/l + sin t is Ju x sin jc is ... (product rule!) 

In calculations, the careful way is to write down all the functions: 

t — cos u u = 2x + 1 dzjdx — (— sin u)(2) — — 2 sin (2x + 1). 

The quick way is to keep in your mind “the derivative of what’s inside.” The slope 
of cos(2:x + 1) is -sin(2x + 1), times 2 from the chain rule. The derivative of 2x 4- 1 
is remembered — without z or u or /or g. 


EXAMPLE 7 sin yj 1 — jc is a chain of z = sin y, y = v /u, u— 1 - jc (three functions). 
With that triple chain you will have the hang of the chain rule: 


The derivative of sin J 1 - jc is (cos J\ x) Gypp)' 1) - 


This is (dzjdy)(dyjdu)(du/dx) r Evaluate them at the right places y, u, jc. 

Finally there is the question of second derivatives. The chain rule gives dzjdx as a 
product, so d 2 zjdx 2 needs the product rule: 


dz dzdy d 2 z dzd 2 y dfdz\dy 

dx dydx 63 S t0 tfjc 2 dydx 2 + dx \dy/ dx' 


( 8 ) 


u v 


u v f + U* V 


That last term needs the chain rule again. It becomes dfzjdy 2 times (dyjdx) 2 . 


EXAMPLES The derivative of sin* 2 is 2jccosx 2 . Then the product rule gives 
d 2 z/dx 2 — 2 cos jc 2 — 4jc 2 sin x 2 . In this case y" = 2 and (/) 2 = 4x 2 . 


4.1 EXERCISES 


Read~through questions 

z =y(g(x)) comes from z = f(y ) and y - a . At x — 2, the 
chain [x 2 — l) 3 equals b . Its inside function is y = c . 
its outside function is z = d , Then dzjdx equals * . 
The first factor is evaluated at y = t (not at y = x). 
For z — sin(x 4 — 1) the derivative is a . The triple chain 
z = cos(x + l) 2 has a shift and a n and a cosine. Then 
dzidx = I . 

The proof of the chain rule begins with Az/Ax — 
(_!_)(_*_) and ends jvith I Changing letters, y = 


cos u(x) has dyjdx = m . The power rule for y = M*)T i s 
the chain rule dyjdx — » . The slope of 5g(x) is p and 
the slope of g(5x) is p . When / = cosine and g = sine and 
x = 0 t the numbers y(g(x)) and g(/(x)) and f(x)g(x) are q . 

dz 

In 1-10 identify f[y) and g(x). From their derivatives find — . 

1 z = (x 2 -Zf 2 z = (x 3 — 3) 2 

3 i = cos(x 3 ) 4 z — tan 2x 

Si — ^/sin x 6 z ™ sin ^ fx 
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7 z = tan(l/x) + 1/tan x 8 z = sin(cosx) 

9 z = cos(x 2 + x + 1) 10 z = y/x 2 

in 11-16 write down dzjdx, Don’t write down / and g. 

11 z = sin(17x) 12 z = tan(x+I) 

13 z = cosfcos x) 14 z = (x 2 ) 3f2 

15 z = x 2 sin x 16 z = (9x + 4) 3 ' 2 

Problems 17-22 involve three functions z(y), y(u), and u(x). 
Find dzjdx from {dz!dy){dyidu)(dujdx\ 

17 z = sin y/x + 1 18 z = v /sin(x -1- 1) 

19 z = yf 1 + sin x 20 z = sinf^x + 1) 

21 z = sin< 1/sin x) 22z = (sinx 2 ) 2 

In 23-26 find dzjdx by the chain rule and also by rewriting z* 

23 z = ((x 2 ) 2 ) 2 24 z = (3x) 3 

25 z - (x + l) 2 + sin(x -h ti) 26 z = x / 1 - cos 2 x 


because is continuous, there is a 6 such that 

\g{x}- 7 1 <6 whenever |x — 4| <B> Conclusion: lf\x -4| < 0 
then . This shows that/(£{x)) approaches f(g{ 4)). 

37 Only six functions can be constructed by compositions (in 
any sequence) of g(x) = 1 — x and f{x) = 1/x. Starting with g 
and / find the other four. 

38 If x(x) =\-x then g(g(x)) = 1 -(1 - x) = x. If g(x) = 1/x 
then g(g(x))= l/(l/x) = x, Draw graphs of those g ' s and 
explain from the graphs why g(g(x)) = x, Find two more 
with this special property. 

39 Construct functions so that/(g(x)) is always zero, but/(y) 
is not always zero. 

40 True or false 

(a) If/(x) =/(-x) then /'(x) =/'(-*). 

(b) The derivative of the identity function is zero. 

(c) The derivative of /(1/x) is — l/(/(x)) 2 , 

(d) The derivative of /(I + x) is /'(l + x). 

(e) The second derivative of /(g(x)) is f"{g(x))g"{x). 


27 If f(x) = x 2 + 1 what is /(/{x))'* If L r (x) is the unit step 
function {from 0 to 1 at x = 0) draw the graphs of sin [;(x) 
and £/(sin x). If R(x) is the ramp function j(x + [x|), draw the 
graphs of R(x) and sin x). 

28 (Recommended) If g(x) = x 3 Rnd /(y) so that /(jf(x)) = 
x 3 + 1. Then find fi(y) so that /j(£(x)) = x, Then find Jt(y) so 
that /^x))- 1. 

29 If^y) = y-2 find g(x) so that /(g(x)) = x, Then find h(x) 
so that /^(x)) = x 2 . Then find k{x) so that /(Jt(x)) = 1 . 

30 Find two different pairs /(y), #(,*) so that /(j^x)) = 
v r l-x 2 . 

31 The derivative of /(/(x)) is __ . Is it (df!dx) 2 '? Test 

your formula on j\x) = 1/x. 


41 On the same graph draw the parabola y = x 2 and the 
curve z — sin y (keep y upwards, with x and z across). Starting 
at x = 3 find your way to z = sin 9. 

42 On the same graph draw y — sin x and z = y 2 (y upwards 
for both). Starting at x = ?i/4 find z = (sin x) 2 on the graph. 

43 Find the second derivative of 

(a) sin(x 2 + 1) (b) y/x 2 — 1 (c)cos y/x 

44 Explain why £(~) = (^)(£) equation (8). 
Check this when z = y 2 , y — x\ 

Final practice with the chain rule and other rules (and other 
letters!). Find the x or t derivative of z or y. 


32 If/(3) = 3 and g(3) = 5 and /'( 3) = 2 and £ H (3) = 4, find the 
derivative at x — 3 if possible for 

(a)/(x)g(x) (b)/(g(x)) (c) £(/(x)) (d) /(ylx)) 

33 For F(x) = \x + 8, show' how r iteration gives F(F(x)) = 

ix + 12, Find F(F[F(x)}} — also called F (3> (x), The derivative 
of F w (x) is , 

34 In Problem 33 the limit of F Crt) (x) is a constant C = 

. From any start (try x = 0) the iterations x rt+] = 

F(x n ) converge to C. 

35 Suppose g{x) = 3x+ 1 and f{y} = ^(y- 1). Then/(g(x)) = 

and g(/(y)) = . These are inverse functions. 

36 Suppose g(x) is continuous at x = 4, say g(4) = 7. Suppose 
f(y) is continuous at y = l y say /(7) = 9. Then /(g(x)) is con- 
tinuous at x = 4 and /(#(4)) = 9. 

Proof e is given. Because is continuous, there is a 

d such that |/0?(x}) — 9| <z whenever |#(x) — 7[ < Then 


45 z=/(u<0) 

47 y = sin w(x)cos u(x) 

49 y = x 2 n(x) 

51 z — v 1 — M, u — v / 1 — X 
53 z =f{u\ u = r 2 , l- = y/t 


46 i = u 3 , u = x 3 
48 y = v u(f) 

50 y=/(x 2 ) + (/(x)) 2 
52 z — 1/w"(i) 

54 v = u, u = x, x = 1/r 


55 If/ - x 4 and g = x 3 then/' = 4x 3 and g — 3x 2 , The chain 
rule multiplies derivatives to get 12.v 5 , But f(g(x)} = x 12 and 
its derivative is not I2x 5 . Where is the flaw? 


56 The derivative of y = sin(sin x) is dyjdx = 

cos(cos x) sin(cos x)cos x cos(sin x)cos x cos(cos x)cos x, 

57 (a) A book has 400 words per page. There arc 9 pages per 

section. So there are words per section, 

(b) You read 200 words per minute. So you read „ 

pages per minute. How many minutes per section? 
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58 (a) You walk in a train at 3 miles per hour. The train 
moves at 50 miles per hour. Your ground speed is 
miles per hour. 

(b)You walk in a train at 3 miles per hour The train is 
shown on TV (1 mile train = 20 inches on TV screen). 
Your speed across the screen is inches per hour. 


59 Coke costs l / 3 dollar per bottle. The buyer gets _____ 

bottles per dollar. If dyjdx = 1/3 then dxjdy = « 

60 (Computer) Graph F(x) = sin.x and G(x) = sin (sin x ) — 
not much difference. Do the same for F(.x) and G'fx). Then 
plot F"(x) and G"(x) to see where the difference shows up. 



4.2 Implicit Differentiation and Related Rates 



We start with the equations xy — 2 and jr 1 xy = 3. As x changes, these v’s will 
change — to keep (x, y ) on the curve. We want to know dyjdx at a typical point . For 
xy — 2 that is no trouble, but the slope of y 5 + xy = 3 requires a new idea. 

In the first case, solve for y = 2 jx and take its derivative: dyjdx = - 2jx 2 . The curve 
is a hyperbola. At x = 2 the slope is -2/4 = -1/2. 

The problem with y 5 + xy = 3 is that it can’t be solved for y. Galois proved that 
there is no solution formula for fifth-degree equations.! The function y(x) cannot 
be given explicitly. All we have is the implicit definition of y, as a solution to 
y 5 + xy = 3. The point x = 2, y = 1 satisfies the equation and lies on the curve, but 
how to find dyjdx' ? 

This section answers that question. It is a situation that often occurs. Equations 
like sin y + sin x = 1 or y sin y = x (maybe even sin y = x) are difficult or impossible 
to solve directly for y. Nevertheless we can find dyjdx at any point. 

The way out is implicit differentiation. Work with the equation as it stands. Find 
the x derivative of every term fn y 5 + xy = 3. That includes the constant term 3, whose 
derivative is zero. 


EXAMPLE 1 The power rule for y 5 and the product rule for xy yield 


dx 


.4y 

dx 


5y*T- + x~ + y = 0. 


( 1 ) 


Now substitute the typical point x = 2 and y = 1, and solve for dyjdx: 

dy dy dy l 

5-^- + 2-r - + 1 = 0 produces — = — . (2) 

dx dx F dx 1 ' 

This is implicit differentiation (ID), and you see the idea: Include dyjdx from the 
chain rule, even if y is not known explicitly as a function of x. 


EXAMPLE 2 


ay 

sin y + sin x = 1 leads to cos y — + cos x = 0 

dr 


EXAMPLE 3 


y sin y = x leads to y cos y 


dy 

dx 


dy 

+ sin y — 
dx 


= 1 


Knowing the slope makes it easier to draw the curve. We still need points (x, y) 
that satisfy the equation. Sometimes we can solve for x . Dividing y 5 + xy = 3 by y 


tThat was before he went to the famous duel, and met his end. Fourth-degree equations do 
have a solution formula, but it is practically never used. 
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gives x = 3 \y - y 4 . Now the derivative (the x derivative!) is 

1= (-?- v )* = - 7 £ ,3) 

Again dy/dx = — 1/7. All these examples confirm the main point of the section: 


4B ( Implicit differentiation) An equation F(x, y) = 0 can be differentiated 
directly by the chain rule, without solving for y in terms of x. 


The example xy = 2, done implicitly, gives x dy/dx + y = 0. The slope dy/dx is -y/x. 
That agrees with the explicit slope — 2/x 2 . 

ID is explained better by examples than theory (maybe everything is). The essential 
theory can be boiled down to one idea: “Go ahead and differentiate." 


EXAMPLE 4 Find the tangent direction to the circle x 2 + y 2 = 25. 

We can solve for y = ± ^25 - x 2 , or operate directly on x 2 + / = 25: 

dy x 

or — = - 
dx y 


dy 

2x + 2y/ = 0 
dx 


(4) 


Compare with the radius, which has slope y/x. The radius goes across x and up y. 
The tangent goes across - y and up x. The slopes multiply to give (- x/y)(y/x) = - 1 . 

To emphasize implicit differentiation, go on to the second derivative. The top of the 
circle is concave down, so d 2 y/dx 2 is negative. Use the quotient rule on -x/y: 

dy x d 2 y y dx/dx — x dy/dx y + (* 2 /y) y 2 + x 2 

j 2 2 2 3 * (5) 

dx y dx y y y 


RELATED RATES 


There is a group of problems that has never found a perfect place in calculus. They 
seem to fit here — as applications of the chain rule. The problem is to compute 
df/dt. but the odd thing is that we are given another derivative dg/dt. To find df/dt , 
we need a relation between/ and g. 

The chain rule is df/dt = ( df/dg)(dg/dt ). Here the variable is / because that is typical 
in applications. From the rate of change of g we find the rate of change of f. This is 
the problem of related rates , and examples will make the point. 

EXAMPLE 5 The radius of a circle is growing by dr/dt = 7. How fast is the circum- 
ference growing? Remember that C = 2nr (this relates C to r). 

I I I 

Solution — = — = (2 tt)(7) = 1 4tt . 

dt dr dt 

That is pretty basic, but its implications are amazing. Suppose you want to put a 
rope around the earth that any 7-footer can walk under. If the distance is 24,000 
miles, what is the additional length of the rope? Answer: Only 147r feet. 

More realistically, if two lanes on a circular track are separated by 5 feet, how 
much head start should the outside runner get? Only 10^ feet. If your speed around 
a turn is 55 and the car in the next lane goes 56, who wins? See Problem 14. 

Examples 6-8 are from the 1988 Advanced Placement Exams (copyright 1989 by 
the College Entrance Examination Board). Their questions are carefully prepared. 
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Rg. 4.3 Rectangle for Example 6, shadow for Example 7, balloon for Example 8. 

EXAMPLE 6 The sides of the rectangle increase in such a way that dzjdt= 1 and 
dxjdt = 3dyjdt. At the instant when x = 4 and y = 3, what is the value of dxjdt"? 

Solution The key relation is x 2 + y 2 — z 2 . Take its derivative (implicitly): 

* dx dy dz n dx ,dy 

2 x — -I- 2y — = 2z — produces 8 — + 6 — = 10 . 

dt dt dt dt dt 

We used all information, including z — 5, except for dxjdt = 3dyjdt. The term 6 dyjdt 
equals Idxjdt, so we have 10 dxjdt = 10 . Answer: dxjdt — 1 . 


EXAMPLE 7 A person 2 meters tall walks directly away from a streetlight that is 8 
meters above the ground. If the person’s shadow is lengthening at the rate of 4/9 
meters per second, at what rate in meters per second is the person walking? 

Solution Draw a figure! You must relate the shadow length s to the distance x from 
the streetlight. The problem gives dsjdt= 4/9 and asks for dxjdt: 

_ . „ . x s dx 6 ds 

By similar triangles - - - so — = - — - (3) 

6 2 dt 2 dt 

Note This problem was hard. I drew three figures before catching on to x and $. 
It is interesting that we never knew x or s or the angle , 



EXAMPLE 8 An observer at point A is watching balloon B as it rises from point C. 
(The figure is given.) The balloon is rising at a constant rate of 3 meters per second 
(this means dyjdt = 3) and the observer is 100 meters from point C. 

(a) Find the rate of change in z at the instant when y — 50. (They want dzjdt.) 

z 2 = y 2 + 100 2 => 2z — = 2 y — 
dt dt 


z — y/50 2 + 100 2 = 50^/5 => ^ = — 1 °'^ = Vg, 

v dt 2-50^5 5 

(b) Find the rate of change in the area of right triangle BCA when y= 50. 

1 dA dy 

A ~ -(100){y) = 50y — = 50-^ = 50-3 = 150, 

2 at dt 


(c) Find the rate of change in 0 when y = 50. (They want dOjdt.) 


tan 



„ 100 2 

= 50 => cos 0 = 7 - = — 7 = 

50^5 75 

sec2 ^ = 2_^^^=m 2 j_ 

dt mdt dt yjs) 100 


3 

125 
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In oil problems I first wrote down a relation from the figure . Then I took its derivative. 
Then I substituted known information. (The substitution is after taking the derivative 
of tan 8 - yj 100. If we substitute y- 50 too soon, the derivative of 50/100 is useless.) 

“Candidates are advised to show their work in order to minimize the risk of not 
receiving credit for it” 50% solved Example 6 and 21% solved Example 7. From 
12,000 candidates, the average on Example 8 (free response) was 6.1 out of 9. 

EXAMPLE 9 A is a lighthouse and BC is the shoreline (same figure as the balloon). 
The light at A turns once a second [d8jdt = 2n radians/second). How quickly does 
the receiving point B move up the shoreline? 

Solution The figure shows y = 100 tan 8. The speed dyjdt is 100 sec 2 0 ddjdt. This is 
200* sec 2 0, so B speeds up as sec 0 increases. 

Paradox When 8 approaches a right angle, sec0 approaches infinity. So does 
dyjdt. B moves faster than lightl This contradicts Einstein’s theory of relativity. The 
paradox is resolved (I hope) in Problem 18. 

If you walk around a light at A , your shadow at B seems to go faster than light. 
Same problem. This speed is impossible — something has been forgotten. 

Smaller paradox (not destroying the theory of relativity). The figure shows y = z sin 8. 
Apparently dyjdt = (dz/dt) sin 8. This is totally wrong . Not only is it wrong, the exact 
opposite is true: dz/dt — [dyjdt) sin 8. If you can explain that (Problem 15), then ID 
and related rates hold no terrors. 


4.2 EXERCISES 


Read-through questions 

For x 3 + y 3 = 2 the derivative dyjdx comes from o 
differentiation . We don’t have to solve for b , Term by 
term the derivative is 3x 2 + c = 0. Solving for dyjdx gives 
d . At x = y — 1 this slope is • . The equation of the 

tangent line is y — 1 = t . 

A second example is y 1 = x. The x derivative of this 
equation is o . Therefore dyjdx = h . Replacing y 
by yfx, this is dyjdx - 1 . 

In related rates, we are given dgjdt and we want dfjdt. We 
need a relation between / and J . If /= g 2 , then [dfjdt) — 

fc (dgjdt). If / 2 +£ 2 = l, then dffdt= , , I . If the 
sides of a cube grow by dsjdt = 2 > then its volume grows by 
dVjdt = m , To find a number (8 is wrong), you also need 
to know n . 


By implicit differentiation find dyjdx in 1-10. 


1 y* + x* = 1 
3 (x — y) 2 = 4 
5 x = F(y) 

7 x 2 y = y 2 x 


2 x 2 y + y 2 x = 1 
4 y/x + yfy = 3 at x = 4 
6fix) + F(y) = xy 
8 x = sin y 


9 x = tan y 10 / - x at x - 1 

11 Show that the hyperbolas xy = C are perpendicular to the 
hyperbolas x 1 — y 2 —D. (Perpendicular means that the pro- 
duct of slopes is —1.) 

12 Show that the circles (x — 2) 2 4- y 2 = 2 and x 2 + (y — 2) 2 = 
2 are tangent at the point (1,1). 

13 At 25 meters/second, does your car turn faster or slower 
than a car traveling 5 meters further out at 26 meters/second? 
Your radius is (a) 50 meters (b) 100 meters. 

14 Equation (4) is 2x + 2 y dyjdx *= 0 (on a circle). Directly by 
ID reach d 2 yjdx 2 in equation (5). 


Problems 15-18 resolve the speed of light paradox in 
Example 9. 

15 (Small paradox first) The right triangle has z 2 = y 2 4- 100 2 . 
Take the t derivative to show that z* — / sin 6. 

16 (Even smaller paradox) As B moves up the line, why is 

dyjdt larger than dzfdtf Certainly z is larger than y. But as 6 
increases they become . 

17 (Faster than light) The derivative of y =100 tan 6 in 
Example 9 is y — 100 sec 2 0 O' - 200 n sec 2 0. Therefore / 
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passes c (the speed of light) when sec 2 0 passes . 

Such a speed is impossible — we forget that light takes time 
to reach B . 


9 increases by 2 n 
in 1 second 

t is arrival time 
of light 

6 is different from 2nt 

18 (Explanation by ID) Light travels from A to B in time 
zjc t distance over speed. Its arrival time is t = 9j2n + zjc so 
9 t j2n — 1 — z'jc* Then z' = y'sin0 and f — 100 sec 2 0 O' (all 
these are ID) lead to 

/ = 20Ottc/(c cos 2 0 + 2Q0n ain 0) 

As B approaches *r/2, this speed approaches . 

Note: y still exceeds c for some negative angle. That is for 
Einstein to explain. See the 1985 College Moth Journal, page 
186, and the 1960 Scientific American , “Things that go faster 
than light” 

19 If a plane follows the curve y =/(x), and its ground speed 
is dxjdt = 500 mph, how fast is the plane going up? How fast 
is the plane going? 

20 Why can’t we differentiate x = 7 and reach 1=0? 
Problems 21-29 are applications of related rates. 
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shadow is 8 miles from you (the sun is overhead); (c) the plane 
is 8 miles from you (exactly above)? 

25 Starting from a 3-4—5 right triangle, the short sides 
increase by 2 meters/second but the angle between them 
decreases by 1 radian/second. How fast does the area increase 
or decrease? 

26 A pass receiver is at x = 4, y = 8i. The ball thrown at 
I — 3 is at x — c(t — 3), y — 10c(t — 3), 

(а) Choose c so the ball meets the receiver 

*(b) At that instant the distance D between them is chang- 
ing at what rate? 

27 A thief is 10 meters away (8 meters ahead of you, across 
a street 6 meters wide). The thief runs on that side at 7 meters/ 
second, you run at 9 meters/second. How fast are you 
approaching if (a) you follow on your side; (b) you run toward 
the thief; (c) you run away on your side? 

28 A spherical raindrop evaporates at a rate equal to twice 
its surface area. Find drjdt. 

29 Starting from P=V = 5 and maintaining PV=T, find 
dVjdt if dPjdt = 2 and dTjdt = 3. 

JO (a) The crankshaft AB turns twice a second so dOjdt — 

(б) Differentiate the cosine law 6 2 = 3 2 + x 2 — 2 (3* cos 0) 
to find the piston speed dxjdt when 0 = tt/2 and 0 =f n. 


21 (Calculus classic) The bottom of a 10-foot ladder is going 
away from the wall at dxjdt = 2 feet per second. How fast is 
the top going down the wall? Draw the right triangle to find 
dyjdt when the height y is (a) 6 feet (b) 5 feet (c) zero. 

22 The top of the 10-foot ladder can go faster than light. At 
what height y does dyjdt = — c? 

23 How fast does the level of a Coke go down if you drink 
a cubic inch a second? The cup is a cylinder of radius 
2 inches — first write down the volume. 

24 A jet flies at 8 miles up and 560 miles per hour. How fast 
is it approaching you when (a) it is 16 miles from you; (b) its 


31 A camera turns at C to follow a rocket at R. 

(a) Relate dzjdt to dyjdt when y = 10. 

(b) Relate dOjdt to dyjdt based on y = 10 tan 0. 

(c) Relate d 2 0jdt 2 to d 2 yjdt 2 and dyjdt. 





4.3 Inverse Functions and Their Derivatives 



There is a remarkable special case of the chain rule. It occurs when^y) and gfx) are 
“ inverse functions That idea is expressed by a very short and powerful equation: 
f(g(x )) = Here is what that means. 

Inverse functions : Start with any input, say x — 5. Compute y = g(x), say y = 3. Then 
compute J{y), and the answer must be 5. What one function does, the inverse function 
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undoes* If g(S) = 3 then /(3) = 5. The inverse function f takes the output y back to the 
input x * 

EXAMPLE 1 g(x) = x — 2 and J\y) = y + 2 are inverse functions* Starting with x = 5, 
the function g subtracts 2. That produces y = 3* Then the function / adds 2* That 
brings back x — 5* To say it directly: The inverse of y — x — 2 is x = y + 2. 

EXAMPLE 2 y = g(x) = f {x - 32) and x-J\y) = jy + 32 are inverse functions (for 
temperature). Here x is degrees Fahrenheit and y is degrees Celsius* From x = 32 
(freezing in Fahrenheit) you find y = 0 (freezing in Celsius)* The inverse function takes 
y = 0 back to x — 32* Figure 4.4 shows how x = 50° F matches y = 10°C* 

Notice that %(x- 32) subtracts 32 first. The inverse fy + 32 adds 32 last. In the 
same way g multiplies last by § while / multiplies first by f * 



domain of/ -range of g 
y> 0 


■o. 


x>0 

range of /= domain of g 
Fig* 4.4 V F to °C to °F. Always £“ *(#(*)) = x and gfg _1 (y)) = >. If/=jf _1 then g=f~ 1 > 


The inverse function is written f— g ~ 1 and pronounced “ g inverse*’" It is not l/g(x)* 

If the demand y is a function of the price x, then the price is a function of the demand. 
Those are inverse functions* Their derivatives obey a fundamental rule : dyjdx times 
dxjdy equals I. In Example 2, dyjdx is 5/9 and dxjdy is 9/5. 

There is another important point* When /and g are applied in the opposite order , 
they still come back to the start. First / adds 2, then g subtracts 2. The chain g(/(y)) — 
(y + 2) — 2 brings back y* Iff is the inverse of g then g is the inverse off The relation 
is completely symmetric, and so is the definition: 

Inverse function : If y = g(x) then x = g ~ L (y) . If x = g ~ *(y) then y = g(x). 

The loop in the figure goes from x to y to x. The composition g~ l (g{x)) is the “identity 
function.” Instead of a new point z it returns to the original x * This will make the 
chain rule particularly easy — leading to {dyjdx)(dxjdy) = 1. 

EXAMPLE 3 y = g(x) = yfx and x = f(y) = y 2 are inverse functions. 

Starting from x = 9 we find y = 3* The inverse gives 3 2 = 9. The square of yfx is 
/Kg(x)) = x. In the opposite direction, the square root of y 2 is g{f{y)) = y. 

Caution That example does not allow x to be negative. The domain of g — the set 
of numbers with square roots — is restricted to x ^ 0* This matches the range of g~ l . 
The outputs y 1 are nonnegative. With domain of g = range of g~ 1 ^ the equation x = 
U/xf is possible and true. The nonnegative x goes into g and comes out of g *. 

In this example y is also nonnegative. You might think we could square anything, 
but y must come back as the square root of y 2 . So y ^0. 

To summarize: The domain of a function matches the range of its inverse. The inputs 
to g^ 1 are the outputs from g * The inputs to g are the outputs from g -1 . 
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/jf g(x) = y then solving that equation for x gives x = g~ *(y): 

if y = 3jc — 6 then x = i(y + 6) (this is g " A (y)) 

if y = x 3 + 1 then x = y - 1 (this is g~ x (y)) 

In practice that is how g~ x is computed: Solve g(x) — y. This is the reason inverses 
are important. Every time we solve an equation we are computing a value of g ~ 1 * 
Not all equations have one solution* Not all Junctions have inverses * For each y , 
the equation g(x ) — y is only allowed to produce one x , That solution is x — g~ *(y). 
If there is a second solution, then g _1 will not be a function — because a function 
cannot produce two x’s from the same y , 

EXAMPLE 4 There is more than one solution to sinx = ^, Many angles have the 
same sine* On the interval 0 < x < rc, the inverse of y = sin x is not a function* 
Figure 4*5 shows how two x’s give the same y. 

Prevent x from passing te/2 and the sine has an inverse. Write x = sin” *y. 

The /unction g has no inverse if two points x x and x 2 give gix^^ g(x 2 )* Its inverse 

would have to bring the same y back to x x and x 2 , No function can do that; g _l (y) 

cannot equal both x x and x 2 , There must be only one x for each y. 

To be invertible over an interval , g must be steadily increasing or steadily decreasing. 



Fig. 4,5 Inverse exists (one x for each y). No inverse function (two x’s for one y). 


THE DERIVATIVE OF g ^ 

It is time for calculus. Forgive me for this very humble example, 

EXAMPLE 5 (ordinary multiplication) The inverse of y ~ g(x) = 3x is x ~J[y) ~ jy. 

This shows with special clarity the rule for derivatives: The slopes dyjdx = 3 and 
dxjdy — ^ multiply to give 1 * This rule holds for all inverse functions, even if their 
slopes are not constant. It is a crucial application of the chain rule to the derivative 
offlsto) = x. 


4C (Derivative of inverse Junction) From J[g{x)) ^ x the chain rule gives 
/'(ffWM*) * 1* Writing y — g(x) and x =/(y), this rule looks better: 

dx dy dx _ 1 

dy dx dy dyjdx ‘ 

The slope of x ^ g~ x (y) times the slope of y = g(x) equals one* 


This is the chain rule with a special feature. Since /(g(x)) = x, the derivative of both 
sides is 1* If we know g* we now know/'. That rule will be tested on a familiar 
example. In the next section it leads to totally new derivatives* 
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EXAMPLE 6 The inverse of y = x 3 is x = y 1/3 , We can find dxjdy two ways: 


directly: 


dx 

dy 



. . dx 1 1 

mdirCCtly: ^ = d^ = 3? 


1 

3 ^ 2 / 3 ' 


The equation (dxjdy){dyjdx) = 1 is not ordinary algebra, but it is true. Those deriva- 
tives are limits of fractions. The fractions are (Ax/Ay)(Ay/Ax) = 1 and we let Ax -► 0, 



Hg. 4.6 Graphs of inverse functions: x — iy is the minor image of y — 3x. 


Before going to new functions, I want to draw graphs. Figure 4.6 shows y — yfx 
and y= 3x. What is special is that the same graphs also show the inverse functions . 
The inverse of y = is x = y 2 . The pair x = 4, y = 2 is the same for both. That is 
the whole point of inverse functions — if 2 = g(4) then 4 = g _1 (2). Notice that the 
graphs go steadily up. 

The only problem is, the graph of x = g~ i (y) is on its side. To change the slope 
from 3 to 5, you would have to turn the figure. After that turn there is another 
problem — the axes don’t point to the right and up. You also have to look in a mirror! 
(The typesetter refused to print the letters backward. He thinks it's crazy but it’s not.) 
To keep the book in position, and the typesetter in position, we need a better idea. 

The graph of x-^y comes from turning the picture across the 45 0 line. The y axis 
becomes horizontal and x goes upward. The point (2, 6) on the line y = 3x goes into 
the point (6,2) on the line x-^y. The eyes see a reflection across the 45° line 
(Figure 4.6c). The mathematics sees the same pairs x and y. The special properties of 
g and g~ L allow us to know two functions — and draw two graphs — at the same 
time.t The graph of x = g~ 1 (y) is the mirror image of the graph of y = g(x). 


EXPONENTIALS AND LOGARITHMS 


I would like to add two more examples of inverse functions, because they are so 
important. Both examples involve the exponential and the logarithm. One is made up 
of linear pieces that imitate 2 X ; it appeared in Chapter 1. The other is the true function 
2 X , which is not yet defined — and it is not going to be defined here. The functions b x 
and log b y are so overwhelmingly important that they deserve and will get a whole 
chapter of the book (at least). But you have to see the graphs. 

The slopes in the linear model are powers of 2. So are the heights y at the start of 
each piece. The slopes 1, 2, 4, ... equal the heights 1, 2, 4, ... at those special points. 
The inverse is a discrete model for the logarithm (to base 2). The logarithm of 1 is 

0, because 2° = 1. The logarithm of 2 is 1, because 2 1 = 2. The logarithm of 2 J is the 
exponent _/. Thus the model gives the correct jc = log2y at the breakpoints y = 

1, 2, 4, 8 , .... The slopes are 1, i, ... because dxjdy = 1 j{dyjdx). 


tl have seen graphs with y — g(x) and also y=^g J (x). For me that is wrong: it has to be 
x — g~ l (y). If y = sin x then x = sin" l y+ 
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The model is good, but the real thing is better. The figure on the right shows the 
true exponential y = 2 X . At x = 0, 1,2, ... the heights yare the same as before. But 
now the height at x = j is the number 2 1/2 , which is Jz. The height at x = .10 is the 
tenth root 2 l/1 ° = 1,07,,,, The slope at x- 0 is no longer 1 — it is closer to Ay/ Ax = 
,07/. 10, The exact slope is a number c (near ,7) that we are not yet prepared to reveal. 

The special property of y = 2 X is that the slope at all points is cy. The slope is 
proportional to the function, The exponential solves dyjdx = cy. 

Now look at the inverse function — the logarithm , Its graph is the mirror image: 

If y = 2 X then x = log 2 y. // 2 1/10 s? 1.07 then log 2 1.07 ^ 1/10. 

What the exponential does, the logarithm undoes — and vice versa. The logarithm of 
2 X is the exponent x. Since the exponential starts with slope c, the logarithm must 
start with slope 1/c. Check that numerically. The logarithm of 1.07 is near 1/10. The 
slope is near .10/.07, The beautiful property is that dxjdy = 1 jcy. 



H 9 .4.7 

Piecewise linear models and smooth curves: y = 2* and x = log 2 > 1 * Base h = 2. 

1 have to mention that calculus avoids logarithms to base 2. The reason lies in that 
mysterious number c. It is the “natural logarithm” of 2, which is .693147... — and 
who wants that? Also 1/. 693147... enters the slope of log 2 y. Then (dxjdy^dyjdx) = 1. 
The right choice is to use “natural logarithms” throughout. In place of 2, they are 
based on the special number e\ 

y = e x is the inverse of x = In y. (2) 

The derivatives of those functions are sensational — they are saved for Chapter 6. 
Together with x n and sin x and cos x, they are the backbone of calculus. 

Note It is almost possible to go directly to Chapter 6. The inverse functions x - 
sin _1 y and x ~ tan _1 y can be done quickly. The reason for including integrals first 
(Chapter 5) is that they solve differential equations with no guesswork: 

dy dx\ f f dy 

-f- - y or — = - leads to \dx = I — or x = in y + C. 

dx ’ dy y J J y 

Integrals have applications of all kinds, spread through the rest of the book. But do 
not lose sight of 2 X and They solve dyjdx = cy — the key to applied calculus, 

THE INVERSE OF A CHAIN MffOr)) 

The functions g(x)-x- 2 and h(y)- 3 y were easy to invert. For £ _1 we added 2 , 
and for h~ 1 we divided by 3. Now the question is: If we create the composite function 
z - h(g(x)), or z = 3(x — 2), what is its inverse? 
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Virtually all known functions are created in this way, from chains of simpler 
functions. The problem is to invert a chain using the inverse of each piece. The answer 
is one of the fundamental rules of mathematics: 


4D The inverse of z = /i(g(jc)) is a chain of inverses in the opposite order : 

x = g“ 1 (ft“ 1 {z)). (3) 

h~* is applied first because h was applied last: g“ 1 (/i^ 1 (/i(^(x)))) = x. 


That last equation looks like a mess, hut it holds the key. In the middle you see 
h ~ 1 and h. That part of the chain does nothing! The inverse functions cancel, to leave 
# But that is x. The whole chain collapses, when g~ l and h~ l are in the 

correct order— which is opposite to the order of h(g(x )). 

EXAMPLE 7 z — h(g(x)) = 3(x - 2) and x = g l (h ~ \z)) = U + 2. 

First h~ 1 divides by 3. Then g 1 adds 2. The inverse of h°g is g ~ 1 c \ It can be 
found directly by solving z = 3(x - 2). A chain of inverses is like writing in prose — we 
do it without knowing it. 

EXAMPLE 8 Invert z = v "'"x — 2 by writing z 1 = x - 2 and then x = z 2 + 2, 

The inverse adds 2 and takes the square — but not in that order. That would give 
(z + 2) 2 , which is wrong. The correct order is z 2 + 2. 

The domains and ranges are explained by Figure 4.8. We start with x^2. 
Subtracting 2 gives y ^ 0. Taking the square root gives z ^ 0. Taking the square 
brings back y > 0. Adding 2 brings back r r ^ 2— which is in the original domain of g> 



i>(x) : = h(\) y = /r't:) .v = g ] {y) y=2.v z=y-l y - z + l ,v = |v 


Rg. 48 The chain g *(& 1 (/i(^(.\')))) = x is one-to-one at every step. 


EXAMPLE 9 Inverse matrices (AB ) " 1 = B 1 A 1 (this linear algebra is optional). 

Suppose a vector x is multiplied by a square matrix B: y = g(.\) = Bx. The inverse 
function multiplies by the inverse matrix: x = g~ ! (y) = B~ l y. It is like multiplication 
by B — 3 and B~ 1 = 1/3, except that x and y are vectors. 

Now suppose a second function multiplies by another matrix A: z = = ABx. 

The problem is to recover x from z. The first step is to invert A n because that came 
last: Bx = A~ i z< Then the second step multiplies by B~ l and brings back x = 
B~ l A~'z, The produet B 1 A 1 inverts the product AB. The rule for matrix inverses 
is like the rule for function inverses — in fact it is a special case. 

I had better not wander too far from calculus. The next section introduces the 
inverses of the sine and cosine and tangent, and finds their derivatives. Remember 
that the ultimate source is the chain rule. 
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4.3 EXERCISES 


Read-through questions 

The functions g(x) “ x — 4 and J{y) — y + 4 are o func- 
tions, because /Tj?fxti= b , Also jgf /T vti = c . The 
notation is /=£ -1 and g = d The composition ft is 
the identity function. By definition x^g~ 1 (y) if and only if 
y — 1 When y is in the range of g t it is in the g of 

g~K Similarly x is in the h of g when it is in the I 
of If g has an inverse then gfaJ J eix^) at any two 
points. The function g must be steadily k or steadily 

i 

The chain rule applied to /(g(x)) = x gives idfjdvM m \ = 

n . The slope of g~ 1 times the slope of g equals o . 
More directly dxjdy = 1/_P_. For y = 2x+l and x = 
i(y— 1 ), the slopes are dv/dx — q and dx/dv= r 
For y = x z and x = * the slopes are dyjdx = t and 

dx/dy = u . Substituting x 2 for y gives dxjdy = v . 
Then {dxjdy)(dyfdx) = w . 

The graph of y — g(x) is also the graph of x = * . but 

with x across and y up. For an ordinary graph of g ^ take 
the reflection in the line y If (3, 8 ) is on the graph of g> 
then its mirror image f * ) is on the graph of g" 1 . Those 

particular points satisfy 8 = 2 3 and 3 = A . 

The inverse of the chain z = h(g(x)) is the chain x = B 
If g(x) — 3x and h(y) = y 3 then z = C . Its inverse is x = 

d . which is the composition of e and f , 


Solve equations 1-10 for x, to find the inverse function x = 
g - 1 (y), When more than one x gives the same y, write 


“no inverse.” 


1 y = 3x - 6 

2 y = Ax + B 

3 y = x 2 - 1 

4 y = xj(x— 1 ) [solve xy 

5 y= l + x~ l 

II 

7 y = x J -I 

8 > = 2 x + |x| 

9 y = sin x 

10 y = x ,/s [draw graph] 


1 1 + ay 

II Solving y — ■ gives xy — ay — \ or x — . Now 

solve that equation for y. 


x 4" 1 y H“ 1 

12 Solving y = - ^ gives xy — y = x + 1 or x = -. Draw 


the graph to see why / and/ 1 are the same. Compute dyjdx 
and dxjdy . 


13 Suppose /is increasing andy(2) = 3 and_ft3) = 5. What can 
you say about / _ 1 (4)? 


14 Suppose J{2) = 3 and /(3) = 5 and ^5) = 5. What can you 
say about / -1 ? 


15 Suppose y{2) = 3 and.ft3) = 5 and_ft 5 ) = 0. How do you 
know that there is no function / -1 ? 

16 Vertical line test If no vertical line touches its graph twice 

then /x) is a function (one y for each x). Horizontal tine 
test. If no horizontal line touches its graph twice then^fx) is 
invertible because . 

17 Ifyfx) and g(x) are increasing, which two of these might 
not be increasing? 

yw+sw ytosw /(*(*)) / _l w * /tw 

18 If y = 1/x then x — 1/y. If y = 1 — x then x = 1 — y. The 
graphs are their own mirror images in the 45° line. Construct 
two more functions with this property /=/ _1 orf[J{x)) = x, 

19 For which numbers m are these functions invertible? 

(a) y = mx + b (b) y = mx + x 3 (c) y = mx + sin x 

20 From its graph show that y = ]x[ + cx is invertible if c > 1 

and also if c < — 1. The inverse of a piecewise linear function 
is piecewise 


In 21-26 find dyjdx in terms of x and dxjdy in terms of y. 

21 y = x 5 22 y = l/(x — 1 ) 

23 y = x 3 - 1 24 y = 1/x 3 


25 y = 


x- 1 


26 y = 


ax + b 

cx-\-d 


27 If dyjdx — 1 jy then dxjdy — and x — . 

28 If dxjdy = 1/y then dyjdx = (these functions are 

y = e* and x — In y, soon to be honored properly). 

29 The slopes of J{x) = 3X 3 and g(x) = - 1/x are x 2 and 1/x 2 . 
Why isn’t /= g“ *? What is *? Show that g , (g~ 1 )' = 1. 

30 At the points x^ x 2 , x$ a piecewise constant function 
jumps to y i, y 2 , y 3 . Draw its graph starting from y(0) = 0. 
The mirror image is piecewise constant with jumps at the 

points __ to the heights . Why isn’t this the 

inverse function? 


In 31-38 draw the graph of y = g(x). Separately draw its mirror 
image x=g~ l (y). 


31 y = 5x-10 
33 >=1 /(jc+1) 
35 y = I0 1 
37 y = 2~* 


32 y = cos 
34 3 ? = |x| — lx 
36 y = — x 2 , 0 ^ x ^ 1 

38 y = l/,/l —x\ 0 ^ x < 1 


In 39-42 find dxjdy at the given point. 

39 y = sin x at x = ji/6 40 y — tan x at x = jt/4 

41 y = sin x 2 at x = 3 42 y = x — sin x at x = 0 
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43 If y is a decreasing Function of x, then x is a 

function of y. Prove by graphs and by the chain rule. 


54 Newton’s method solves J[x*) = 0 by applying a linear 
approximation to/’ 1 : 


44 If f(x) > x for all jc* show that / *(y) < y. 


r 1 <o)*r I (y) + (4T‘/‘f>')(0->’). 


45 True or false, with example: 

(a) If/(x) is invertible so is fc(x) = t/W) 2 . 

(b) lfj[x) is invertible so is fc(x) =JJJ[x))> 

(c) /~ '(y) has a derivative at every y. 


For y—f[x) this is Newton’s equation jc* & x + . 

55 If the demand is l/(p+ 1) 2 when the price is p, then the 

demand is y when the price is . If the range of prices 

is p > 0, what is the range of demands? 


In the chains 46-51 write down g(x) and f(y) and their inverses. 
Then find * -£ -1 (/~ 1 {z))« 

46 z = 5(x- 4) 47 z^(x m f 

48 z = (6 +jc ) 3 49 z = 6 + x 3 

50 z - fyx + 4) + 4 51 z = log(10*) 

52 Solving /(x) = 0 is a large part of applied mathematics. 

Express the solution x* m terms of /’ l : x* ss . 

53 (a) Show by example that d 2 xjdy 2 is not l/(d 2 y/dx 2 ). 

(b) If y is in meters and x is in seconds, then d 2 y/dx 2 is in 
and d 2 xjdy 2 is in . 


56 If dFjdx—ffx) show that the derivative of G(y) = 

57 For each number y find the maximum value of yx — 2x 4 . 
This maximum is a function G(y), Verify that the derivatives 
of G{y) and 2x 4 are inverse functions. 

58 (for professors only) If <j(y) is the maximum value of 
yx — F(x), prove that F(x) is the maximum value of xy — G(y). 
Assume that J[x) *= dFjdx is increasing* like 8x 3 in 
Problem 57. 

59 Suppose the richest x peroent of people in the world have 

10^x peroent of the wealth. Then y percent of the wealth is 
held by percent of the people. 



4.4 Inverses of Trigonometric Functions 



Mathematics is built on hasic functions like the sine, and on basic ideas like the 
inverse. Therefore it is totally natural to invert the sine function. The graph of x = 
sin " l y is a mirror image of y — sin x. This is a case where we pay close attention to 
the domains, since the sine goes up and down infinitely often. We only want one piece 
of that curve, in Figure 4.9. 

For the hold line the domain is restricted. The angle x lies between —n/2 and 4- tt/ 2. 
On that interval the sine is increasing, so each y comes from exactly one angle x. If 
the whole sine curve is allowed, infinitely many angles would have sin x = 0. The sine 




Hfl, 4.9 Graphs of sin x and sin 1 y. Their slopes are cos x and l/^/l — y 2 . 
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function could not have an inverse. By restricting to an interval where sin x is increas- 
ing, we make the function invertible. 

The inverse function brings y back to x. It is x = sin" l y (the inverse sine): 

x = $in -1 y when y - sin x and |x| ^ n/2. (1) 


The inverse starts with a number y between -1 and 1. It produces an angle x = 
sin^*y — the angle whose sine is y. The angle x is between -n/2 and nj 2, with the 
required sine. Historically x was called the “arc sine” of y, and arcsin is used in 
computing. The mathematical notation is sin -1 . This has nothing to do with 1/sin x. 

The figure shows the 30° angle x = n/ 6. Its sine is y = 5 . The inverse sine of { is n/6. 
Again: The symbol sin ^ A (l) stands for the angle whose sine is 1 {this angle is x = 
n/2). We are seeing g~ J (g(x)) — x: 


sin' 1 (sin .x) = x for 



sin(sin l y) = y for - 1 ^ y ^ 1 , 


EXAMPLE 1 (important) If sin x = y find a formula for cos x. 

Solution We are given the sine, we want the cosine. The key to this problem must 
be cos 2 * “ 1 — sin 2 *. When the sine is y, the cosine is the square root of 1 - y 2 : 

cos * = cos(sin“ l y) = ^/l - y 2 . (2) 

This formula is crucial for computing derivatives. We use it immediately. 


THE DERIVATIVE OF THE INVERSE SINE 


The calculus problem is to find the slope of the inverse function J\y) = sin - 1 y. 
The chain rule gives ( slope of inverse function) = l /(slope of original function ). 
Certainly the slope of sin * is cos x. To switch from x to y, use equation (2): 

dy dx 11 

y = sin x gives — = cos * so that — = = - / (3) 

dx dy cos x - y 2 

This derivative l/^/l -*y 2 gives a new v-f pair that is extremely valuable in calculus: 

velocity u(f) = l/y/l ~ t 2 distance f(t) = sin 

Inverse functions will soon produce two more pairs, from the derivatives of tan^y 
and sec" l y. The table at the end lists all the essential facts. 

EXAMPLE 2 The slope of sin _1 y at y=l is infinite: \/yf\ ~ y 2 = 1/0. Explain. 

At y = l the graph of y = sin x is horizontal. The slope is zero. So its mirror image 
is vertical. The slope 1/0 is an extreme case of the chain rule. 

Question What is d\dx (sin" **)? Answer 1 fJT- * 2 . I just changed letters. 

THE INVERSE COSINE AND ITS DERIVATIVE 


Whatever is done for the sine can be done for the cosine, But the domain and range 
have to be watched. The graph cannot be allowed to go up and down. Each y from 
- 1 to 1 should be the cosine of only one angle x. That puts x between 0 and n. Then 
the cosine is steadily decreasing and y — cos * has an inverse: 

cos - ^cos *) = * and cos(cos ~ l y) ~ y. 


(4) 
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The cosine of the angle x = 0 is the number j? s L The inverse cosine of y — I is the 
angle x — 0* Those both express the same fact, that cos 0 - 1. 

For the slope of cos" we could copy the calculation that succeeded for sin ” l y . 
The chain rule could be applied as in (3). But there is a faster way, because of a 
special relation between cos" l y and sin” 1 y. Those angles always add to a right angle : 

cos -1 ^ + sin“ l y = n/2. (5) 

Figure 4.9c shows the angles and Figure 4TGc shows the graphs. The sum is njl (the 
dotted line), and its derivative is zero* So the derivatives of cos l y and sin" l y must 
add to zero. Those derivatives have opposite sign , There is a minus for the inverse 
cosine, and its graph goes downward: 



Fig. 4,10 The graphs of y — cos x and x = cos" l y* Notice the domain 0 ^ x rc. 


Question How can two functions x = sin _1 y and x = — cos _i y have the same 
derivative ? 

Answer sin ~ 1 y must be the same as cos “ 1 y + C. Equation (5) gives C = njl. 

THE INVERSE TANGENT AND ITS DERIVATIVE 

The tangent is sinx/eosx, The inverse tangent is not sin _1 y/cos _1 y, The inverse 
function produces the angle whose tangent is y. Figure 4.1 1 shows that angle, which 
is between — njl and njl. The tangent can be any number, but the inverse tangent 
is in the open interval - njl <x< njl. (The interval is “open" because its endpoints 
are not included.) The tangents of njl and -njl are not defined. 

The slope of y = tan x is dy/dx = sec 2 x* What is the slope of x = tan “ 1 yl 








174 


4 D»rtvattv8> by Hie Chain Rule 


* 



EXAMPLE 3 The tangent of x = rc/4 is y = L We check slopes. On the inverse tangent 
curve, dxjdy — 1/(1 + y 2 ) — On the tangent curve, dyjdx — sec 2 x. At tt/ 4 the secant 
squared equals 2. The slopes dxjdy — ? and dyjdx = 2 multiply to give 1, 

Important Soon will come the following question. What function has the derivative 
1/(1 + x 2 )? One reason for reading this section is to learn the answer. The function 
is in equation (8 ) — if we change letters . It isf{x) = tan” l x that has slope 1/(1 + x 2 ). 



cos x 1 cot X 


Rg. 4,12 cos 2 * + $in 2 x - 1 and I + tan 2 x = see 2 * and t + cot 2 x =csc 2 x. 


INVERSE COTANGENT, INVERSE SECANT, INVERSE COSECANT 


There is no way we can avoid completing this miserable list! But it can be painless. 
The idea is to use 1 /(dyjdx) for y = cot x and y = sec x and y — esc jc: 


dx 

dy 


— 7- and 

CSC 2 X 


dx 

dy 


— and 

sec x tan x 


dx 

dy 


-1 

CSC x cot X 


P) 


In the middle equation, replace sec x by y and tan x by + yfy 2 - 1. Choose the sign 
for positive slope (compare Figure 4,11), That gives the middle equation in (10): 


The derivatives of cot 1 y and sec 1 y and esc 1 y are 


d , . , -1 

j, (cot '»Tf7 


~{sec 1 y)=“ — / -7- (esc 1 y)= , 

\y\ s/T-l d y |j'lv / ? 3 T 


Note about the inverse secant When y is negative there is a choice for x = sec _1 y. 
We selected the angle in the second quadrant (between rc/2 and n). Its cosine is 
negative, so its secant is negative. This choice makes scc -1 y ~ cos _1 (l/y) ? which 
matches sec x= 1/cos x, It also makes sec ~ x y an increasing function, where cos _1 y 
is a decreasing function. So we needed the absolute value \y\ in the derivative. 
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Some mathematical tables make a different choice. The angle x could be in the 
third quadrant (between —n and -nj 2). Then the slope omits the absolute value. 
Summary For the six inverse functions it is only necessary to learn three derivatives. 
The other three just have minus signs, as we saw for sm _1 y and cos" V Each inverse 
function and its “cofunction” add to rr/2, so their derivatives add to zero. Here are 
the six functions for quick reference, with the three new derivatives. 


function f[y) 

inputs y 

sin _1 y, cos _1 y 


tan _1 y, cot^y 

all y 

sec _1 y, csc _1 y 



outputs x 



slope dxjdy 

± 7^T 2 

i 

± 1 +y 2 
± [j'l>/y 2 - 1 


If y = cos x or y = sin x then |y| ^ L For y = sec x and y = esc x the opposite is true; 
we must have |y| ^ 1. The graph of $ec _1 y misses all the points — 1 < y < 1. 

Also, that graph misses x = itjl — where the cosine is zero. The secant of nj 2 would 
be 1/0 (impossible). Similarly esc" ^misses x = 0, because y — esc 0 cannot be 1/sin 0. 
The asterisks in the table are to remove those points x — nj2 and x — 0. 

The column of derivatives is what we need and use in calculus. 


4.4 EXERCISES 


Read-through questions 

The relation x = $in -1 y means that o is the sine of 
b . Thus x is the angle whose sine is c . The number 
y lies between d and o . The angle x lies between 
f and g . (If we want the inverse to exist, there 
cannot be two ang les with the same sine.) The cosine of the 
angle $in -l y is + J h . The derivative of x = sin~ i y is 
dxjdy — * 

The relation x = cos" *y means that y equals I Again 
the number y lies between * and I This time the 

angle x lies between m and n (so that each y comes 

from only one angle x). The sum sin -L v + cos~ l v = o 
(The angles are called P , and they add to a Q angle.) 
Therefore the derivative of x — cos -1 y is dxjdy = r the 
same as for sin -1 y except for a » sign. 

The relation x = tan ~ l y means that y — The 
number y lies between u and v The angle x lies 
between w and * . The derivative is dxjdy = ¥ 

Since tan~* v + cot ~ 1 v = * . the derivative of cot -1 y is 

the same except for a A sign. 

The relation x = sec “ l y means that B . The number y 
never lies between C and D . The angle x lies between 
E and F . but never at x = & . The derivative of 

x = sec - l y is dxjdy = H 


In 1-4, find the angles sin~ l y and cos l y and tan -1 y in 
radians. 

I y-o ly=-t 3 37 = 1 = 

5 We know that sin n = 0. Why isn’t n = sin " 1 0? 

6 Suppose sin x = y. Under what restriction is x = sin - *y? 

7 Sketch the graph of x = sin" x y and locate the points with 
slope dxjdy = 2. 

8 Find dxjdy if x = sin" 1 £y. Draw the graph. 

9 If y = cos x find a formula for sin x. First draw a right 
triangle with angle x and near side y — what are the other 
two sides? 

10 If y = sin x find a formula for tan x. First draw a right 
triangle with angle x and far side y — what are the other sides? 

11 Take the x derivative of sin ~ l (sm x ) = x by the chain rule. 
Check that d(sin - l y)/dy = l/^/l — y 2 gives a correct result. 

12 Take the y derivative of cos(c os -1 y) — y by the chain rule. 
Check that d(cos - x y)jdy = — lj^fl — y 2 gives a correct result. 

13 At y = 0 and y = 1, find the slope dxjdy of x — sin - and 
x = cos -1 y and x = tan -1 y. 

14 At x = 0 and x = 1, find the slope dxjdy of x = sm _1 y and 
x — cos - *y and x = tan -1 y. 
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15 True or false, with reason: 

(a) (sin - l y } 2 + (cos - l y) 2 = 1 

(b) sin - l y = cos - 1 j has no solution 

(c) sin - ^ is an increasing (unction 

(d) sin - 1 y is an odd function 

(e) sin - *y and — cos -1 y have the same slope — so they are 
the same. 

(f ) stnfcos x) = cos(sin x) 

16 Find tan(cos -1 (sin x)) by drawing a triangle with sides 
sin x, cos x* F 

Compute the derivatives in 17-28 (using the letters as given). 

17 w = sin -1 x 18 u = tan -l 2x 

19 z = sin - ^sin 3x) 20 i = sin - *(cos x) 

21 z = (sin -1 x) 2 22 z-(sin ^x) -1 

23 2 -J\ -y l sin - l y 24 z = (l + x 2 )tan -1 x 

25 x = sec - l (y + 1) 26 u = sec - ^sec x 1 ) 

27 u = sin - 1 y/cos - 1 — y 1 

28 u = sin~ 1 y + cos“*y + tan -1 y 

29 Draw a right triangle to show why tan - *y + cot - ^ = 71 / 2 . 

30 Draw a right triangle to show why tan - L y = cot - i {\jy). 

31 If y = tan x find sec x in terms of y. 

32 Draw the graphs of y = cot x and x = cot - 'y. 

33 Find the slope dxjdy of x = tan - l y at 

(a) y = — 3 (b) x = 0 (c) x = — rc/4 


34 Find a function u(t) whose slope satisfies u' + tV = l. 

35 What is the second derivative d 2 xjdy 2 of x = sin -1 y? 

36 What is d 2 ujdy 2 for u = tan _I y? 

Find the derivatives in 37-44, 

37 y = sec |x 38 x = sec -1 2y 

39 u = sec" l (x n ) 40 u = sec -1 (tan x) 

41 tany = (x — l)/(x + 1) 42 z = (sin x)(sin " l x) 

43 y — sec - 1 ^fx 2 + \ 44 z — sin(cos' L x> — cos(sin - { x) 

45 Differentiate cos - 1 (ljy) to find the slope of sec - 1 y in a 
new way< 

46 The domain and range of x =csc -1 y are * 

47 Find a function u(y) such that dujdy = 4/^/l — y 2 . 

48 Solve the differential equation dujdx = 1/(1 + 4x 2 ) + 

49 If dujdx = IjJX-x 1 find u( 1 ) - u(0). 

50 (recommended) With u(x) = {x — l)/(x + 1), find the deriv- 
ative of tan - V*)’ This is also the derivative of . So 

the difference between the two functions is a . 

51 Find u(x) and tan ^(x) and tan -1 x at x = 0 and x = 00 , 

Conclusion based on Problem 50: tan - ^x) — tan - i x equals 
the number . 

52 Find u(x) and tan -1 u(x) and tan -1 x as x -* — 00. Now 

tan -1 u(x) — tan - l x equals > Something has hap- 

pened to tan” L u{x) + At what x do u(x) and tan - 'ufx) change 
instantly? 
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CHAPTER 5 


Integrals 



5.1 The Idea of the Integral 



This chapter is about the idea of integration, and also about the technique of integ- 
ration, Wc explain how it is done in principle, and then how it is done in practice. 
Integration is a problem of adding up infinitely many things, each of which is infini- 
tesimally small. Doing the addition is not recommended. The whole point of calculus 
is to offer a better way. 

The problem of integration is to find a limit of sums. The key is to work backward 
from a limit of differences (which is the derivative). We can integrate i h (x) if it turns 
up as the derivative of another function f(x), The integral of v = cos x is/- sin x. The 
integral of i ■ = x is/=^.v 2 . Basically, /(*) is an “antiderivative". The list of f ' s will 
grow much longer (Section 5.4 is crucial). A selection is inside the cover of this book, 
[f we don't find a suitable /!*), numerical integration can still give an excellent answer. 

1 could go directly to the formulas for integrals, which allow you to compute areas 
under the most amazing curves. (Area is the clearest example of adding up infinitely 
many infinitely thin rectangles, so it always comes first. It is certainly not the only 
problem that integral calculus can solve.) But I am really unwilling just to write down 
formulas, and skip over all the ideas, Newton and Leibniz had an absolutely brilliant 
intuition, and there is no reason why wc can't share it. 

They started with something simple. We will do the same. 


SUMS AND DIFFERENCES 


Integrals and derivatives can be mostly explained by working (very briefly) with sums 
and differences. Instead of functions, wc have n ordinary numbers. The key idea is 
nothing more than a basic fact of algebra. In the limit as n -> oc, it becomes the basic 
fact of calculus. The step of "going to the limit" is the essential difference between 
algebra and calculus! It has to be taken, in order to add up infinitely many 
infinitesimals — but we start out this side of it. 

To see what happens before the limiting step, we need two sets of n numbers. The 
first set will be v lf v 2 , where v suggests velocity. The second set of numbers 

will be j\J z /„, where / recalls the idea of distance. You might think d would 

be a better symbol for distance, but that is needed for the dx and dy of calculus. 
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A first example has n — 4: 

, t?2 * Vi = 1, 2, 3, 4 — 1> 3, 6, 10. 

The relation between the u’s and / ' s is seen in that example. When you are given 
1, 3, 6, 10* how do you produce 1, 2, 3, 4? By taking differences. The difference 
between 10 and 6 is 4. Subtracting 6 — 3 is 3. The difference f 2 — /i = 3—1 is v 2 = 2. 
Each v is the difference between two f f s: 

Vj is the difference f } — x . 

This is the discrete form of the derivative. I admit to a small difficulty at j = 1 , from 
the fact that there is no / 0 . The first v should be fi -/ 0 , and the natural idea is to 
agree that / 0 is zero. This need for a starting point will come back to haunt us (or 
help us) in calculus. 

Now look again at those same numbers — but start with u. From v - 1, 2, 3, 4 how 
do you produce f- 1, 3* 6, 10? By taking sums . The first two u*s add to 3, which is / 2 , 
The first three u’s add to / 3 = 6. The sum of all four v's is 1 + 2 + 3 + 4 = 10. Taking 
sums is the opposite of taking differences. 

That idea from algebra is the key to calculus. The sum/} involves all the numbers 
v x + v 2 + + Uj, The difference Vj involves only the two numbers/}— /}_ ; . The fact 

that one reverses the other is the “Fundamental Theorem.” Calculus will change sums 
to integrals and differences to derivatives — but why not let the key idea come through 
now? 


5A Fundament*} Theorem of Cakubts (before limits): 

If each vj =f } -/^ x , them i>, + v 2 + - + v n =/„ ~f Q . 


The differences of the / T s add up to /„ —/ 0 , All /’ s in between are canceled* leaving 
only the last/}, and the starting f Q The sum “telescopes”: 

Vi + v 2 + U 3 + "* + v n ~(f x -f Q ) + (f 2 -/i) + {h -/a)+ + (/„ 

The number f x is canceled by -f x . Similarly -f 2 cancels f 2 and -/ 3 cancels / 3 . 
Eventually/}, and -/ 0 are left. When / 0 is zero, the sum is the final/},. 

That completes the algebra. We add the v's by finding the /’ s. 

Question How do you add the odd numbers 1 + 3 + 5+ ■'• + 99 (the v'sfi 
Answer They are the differences between 0, 1, 4, 9, .... These /’s are squares. By the 
Fundamental Theorem, the sum of 50 odd numbers is (50) 2 . 

The tricky part is to discover the right /’ s! Their differences must produce the u T s. 
In calculus, the tricky part is to find the right fix). Its derivative must produce v(x). 
It is remarkable how often / can be found — more often for integrals than for sums. 
Our next step is to understand how the integral is a limit of sums . 

SUMS APPROACH INTEGRALS 

Suppose you start a successful company. The rate of income is increasing. After 
x years, the income per year is x million dollars. In the first four years you reach 
^/2, and million dollars. Those numbers are displayed in a bar graph 
(Figure 5.1a, for investors). I realize that most start-up companies make losses, but 
your company is an exception. If the example is too good to be true, please keep 
reading. 
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Income 
per year 




Hg.5.i Total income — total area of rectangles ~ 6.15. 


Year 


The graph shows four rectangles, of heights y/i, y/l, y/i, y/A. Since the base of 
each rectangle is one year, those numbers are also the areas of the rectangles. One 
investor, possibly weak in arithmetic, asks a simple question: What is the total income 
for all four yearsft There are two ways to answer, and I will give both. 

The first answer is y/l + y/l + y/i + y/A. Addition gives 6.15 million dollars. 
Figure 5.1b shows this total — which is reached at year 4. This is exactly like velocities 
and distances, but now v is the income per year and /is the total income. Algebraically, 
/ is still + ■■■ + v } . 

The second answer comes from geometry. The totai income is the total area of the 
rectangles. We are emphasizing the correspondence between addition and area. That 
point may seem obvious, but it becomes important when a second investor (smarter 
than the first) asks a harder question. 

Here is the problem. The incomes as stated are faise. The company did not make 
a million dollars the first year. After three months, when x was 1/4, the rate of income 
was only y/x = 1/2. The bar graph showed y / 1 = 1 for the whole year, but that was 
an overstatement. The income in three months was not more than 1/2 times 1/4, the 
rate multiplied by the time. 

All other quarters and years were also overstated. Figure 5.2a is closer to reality, 
with 4 years divided into 16 quarters. It gives a new estimate for total income 

Again there are two ways to find the totai. We add y/TjA 4- y/lj 4 + ■•• + ^/16/4, 
remembering to multiply them all by 1/4 (because each rate applies to 1/4 year). 
This is also the area of the 16 rectangles. The area approach is better because the 1/4 
is automatic. Each rectangle has base 1/4, so that factor enters each area. The total 
area is now 5.56 million dollars, closer to the truth. 

You see what is comin g. The next step divides time into weeks. After one week the 
rate y/x is only y/i/52. That is the height of the first rectangle— its base is Ax = 
1/52. There is a rectangle for every week. Then a hard-working investor divides time 
into days, and the base of each rectangle is Ax = 1/365. At that point there are 
4 x 365 = 1460 rectangles, or 1461 because of leap year, with a total area below 5^ 
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4 4 4 4 4 44444 

Fig. 5,2 Income = sum of areas (not heights) 

-U f~ /_ 

4 4 + \/ 4 + + V 4 / 


million dollars. The calculation is elementary but depressing — adding up thousands 
of square roots, each multiplied by Ax from the base. There has to be a better way. 

The better way, in fact the best way, is calculus. The whole idea is to allow for 
continuous change. The geometry problem is to find the area under the square root 
curve . That question cannot be answered by arithmetic, because it involves a limit. 

The rectangles have base Ax and heights V /Ax, 2Ax y^4. There are 4/Ax 

rectangles — more and more terms from thinner and thinner rectangles. The area is 
the limit of the sum as Ax — * (X 

This limiting area is the "integral.” We are looking for a number below 5^. 

Algebra (area of n rectangles): Compute v x + 1 r + v n by finding f s. 

Key idea: If Vj =/) — /}„ i then the sum is /„ — f Q , 

Calculus (area under curve): Compute the limit of Ax[u(Ax) + v(2Ax) + ■■ 

Key idea: If v(x) = djldx then area = integral to be explained next. 


5.1 EXERCISES 


Read-through questions 

The problem of summation is io add - r + v n . It is solved 
if we find /'s such that ij = a Then - v n equals 

b The cancellation in (J\ —f } ) 4 (f 2 —j\)+ JJ + 
(ftj—fn-i) leaves only c Taking sums is the d or 
taking differences. 

The differences between 0, 1,4,9 are i .'i . c s , v* = e 

For J]—j z the difference between f lt) and f is r ]0 = 

From this pattern M 3 f 5 + JL ■ 19 equals g 


For functions, finding the integral is the reverse of h 
If the derivative of f(x) is then the i of vix) is/(x). 
If rfx) = lQx then/f.v) = J This is the k of a triangle 
with base x and height I0x. 

Integrals begin with sums. The triangle under r= lOx oul 
to x = 4 has area I It is approximated by four rectangles 
of heights 10, 20, 30, 40 and area m . It is better approxi- 
mated by eight rectangles of heights n and area o 
For n rectangles covering the triangle the area is the sum of 
p As n -* x this sum should approach the number 

g That is the integral of v = 10 x from 0 4, 
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Problems 1-6 are about sums f and differences ij, 

1 With i? = L 2* 4, 8, the formula for vj is (not 2^). 

Find _/i J ' 2 , / 3 , _/ 4 starting from / 0 = 0, What is ffl 

2 The same r = 1, 2, 4. 8, ... are the differences between 

/= L 2, 4, 8, 16 Now / 0 = 1 and /} = 2 J . (a) Check that 

2 5 - 2 4 equals r 5 . (b) What is 1 + 2 + 4 + 8+16? 

3 The differences between j = l, 1/2. 1/4, 1/8 are v = 
1/2, - 1 4, — 1/8. These negative Fs do not add up to these 

positivc/'s. Verify that jq + r 2 + r 3 — / 4 — / 0 is still true. 

4 Any constant C can be added to the antiderivative f(x) 

because the of a constant is zero. Any C can be 

added to yy/, , ... because the „ between the f s is 

not changed. 

5 Show that = F, (r - l) has j) — /)_ i = r J 3 . Therefore the 

geometric scries I+r+ -+r J ' L adds up to 

(remember to subtract / 0 ). 

6 The sums _/ ; = (r J — I)/(r - I) also have f -J ) _ 1 = r J ~ F 

Now _/„ = . Therefore 1 + r + ■■■ + r J ~ 1 adds up to 

/’ . The sum I + r + ■ ■ + r" equals . 

7 Suppose r(x) = 3 for x < 1 and r(.x) = 7 for x > l. Find the 
area /tv) from 0 to x, under the graph of r(x). (Two pieces.) 

8 T T i 1 = 1 , — 2, 3, ”4 write down thc/’s starting from 

f 0 = 0. Find formulas for v } and j] when j is odd and j is even. 

Problems 9-16 are about the company earning v x per year. 

9 When time is divided into weeks there are 4 x 52 = 208 
rectangles. Write down the first area, the 208th area, and the 
jth area. 

10 How do you know that the sum over 208 weeks is smaller 
than the sum over 16 quarters? 

11 A pessimist would use v /.v at the beginning of each time 
period as the income rate for that period. Red raw Figure 5.1 
(both parts) using heights v 0, v ' 1, v 2, v 3. How much low r er 
is the estimate of total income? 

12 The same pessimist would redraw r Figure 5.2 with heights 
0, v M/4, .... What is the height of the last rectangle? How 
much does this change reduce the total rectangular area 5.56? 

13 At every step from years to weeks to days to hours, the 

pessimist's area goes and the optimist's area goes 

. The difference between them is the area of the last 


14 The optimist and pessimist arrive at the same limit as 
years are divided into weeks, days, hours, seconds. Draw the 
v /x curve between the rectangles to show why the pessimist 
is always too low and the optimist is too high. 

15 (Important) Let f(x) be the area under the x fx curve, above 
the interval from 0 to x, The area to x -h A* is f{x + Ax). The 

extra area is A /= . This is almost a rectangle with 

base ____ and height So A// Ax is close to . 

As Ax -► 0 we suspect that dfidx = , 

16 Draw the v /x curve from x = 0 to 4 and put triangles 
below to prove that the area under it is more than 5. Look 
left and right from the point where ^/T = L 

Problems 17-22 are about a company whose expense rate 

t(x) = 6 — v is decreasing. 

17 The expenses drop to zero at x = . The total 

expense during those years equals . This is the area 

of . 

18 The rectangles of heights 6, 5, 4, 3, 2, 1 give a total 

estimated expense of . Draw r them enclosing the 

triangle to show why this total is too high. 

19 How many rectangles (enclosing the triangle) would you 
need before their areas are within 1 of the correct triangular 
area? 

20 The accountant uses 2-year intervals and computes v = 
5, 3, 1 at the midpoints (the odd-numbered years). What is 
her estimate, how r accurate is it, and why? 

21 What is the area f(x) under the line t^v) = 6 — x above the 
interval from 2 to x? What is the derivative of this /(x)? 

22 What is the area/(x) under the line r(x) - 6 - x above the 
interval from x to 6? What is the derivative of this /(v)? 

23 With A.v = 1/3, find the area of the three rectangles that 
enclose the graph of F.v) = x 2 . 

24 Draw graphs of r = V F v and v = x 2 from 0 to I. Which 

areas add to 1 J> The same is true for r = x J and l- = . 

25 From x to x + Ax, the area under v = x 2 is A/ This 

is almost a rectangle with base A.v and height . So 

A/ Ax is close to . In the limit we find dfidx = x z 

and fix] = . 

26 Compute the area of 208 rectangles under l<x) = v x from 
x = 0 to x = 4. 
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5.2 Antiderivatives 



The symbol J was invented by Leibniz to represent the integral It is a stretched-out 
S, from the Latin word for sum. This symbol is a powerful reminder of the whole 
construction: Sum approaches integral, $ approaches j, and rectangular area 
approaches curved area: 

curved area = J i?(x) dx = j +J x dx. (1) 

The rectangles of base Ax lead to this limit — the integral of x . The “dx” indicates 
that Ax approaches zero. The heights Vj of the rectangles are the heights u(x) of the 
curve. The sum of Vj times Ax approaches “ the integral of v of x dx .” You can imagine 
an infinitely thin rectangle above every point, instead of ordinary rectangles above 
special points. 

We now find the area under the square root curve. The “ limits of integration ” are 
0 and 4. The lower limit is x = 0, where the area begins. ( The start could be any point 
x = a.) The upper limit is x — 4, since we stop after four years. (The finish could be 
any point x — b.) The area of the rectangles is a sum of base Ax times heights v /x. 
The curved area is the limit of this sum. That limit is the integral of v /x from 0 to 4: 

fon [(VaI)(Ax) + (v^K &x) + - + (v^(Ax)] = I dx. (2) 

The outstanding problem of integral calculus is still to be solved. What is this limiting 
area? We have a symbol for the answer, involving J and ^x and dx — but we don’t 
have a number. 


THE ANTIDERIVATIVE 

I wish I knew who discovered the area under the graph of J x . It may have been 
Newton. The answer was available earlier, but the key idea was shared by Newton 
and Leibniz. They understood the parallels between sums and integrals, and between 
differences and derivatives. I can give the answer, by following that analogy. I can’t 
give the proof (yet) — it is the Fundamental Theorem of Calculus. 

In algebra the difference f -fj-\ is v } . When we add, the sum of the v' s is/, -f 0 . 
In calculus the derivative of fix) is u(x). When we integrate, the area under the u(x) 
curve isf(x) minus f(0). Our problem asks for the area out to x — 4: 


5B (Discrete vs. continuous, rectangles vs. curved areas, addition vs. 
integration) The integral of u(x) is the difference in f(x): 

If dfidx = yfx them ana = f*I* yfx dx =^4) -f(0). (3) 

What isf(x)l Instead of the derivative of v /x, we need its “ antiderivative " We have 
to find a function fix) whose derivative is N fx . It is the opposite of Chapters 2-4, and 
requires us to work backwards. The derivative of x n is nx n_1 — now we need the 
antiderivative. The quick formula is/(x) = x" + 1 /{n + 1) — we aim to understand it. 

Solution Since the derivative lowers the exponent, the antiderivative raises it. We 
go from x 1/2 to x 3/3 . But then the derivative is (3/2)x w2 . It contains an unwanted 
factor 3/2. To cancel that factor , put 2/3 into the antiderivative: 

f[x) — fx 3/2 has the required derivative v(x) = x l 12 = ^fx. 
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12 3 4 1 2 3 4 

Fig. 5.3 The integral of i^x) = ^/x is the exact area 16/3 under the curve. 


There you see the key to integrals: Work backward from derivatives (and adjust). 

Now comes a number — the exact area. At x = 4 we find x 3/2 = 8. Multiply by 2/3 
to get 16/3. Then subtract /(0) = 0: 

P 4 V X dx = ?(4)« - ^<0) 3/2 = ?<8) = ^ (4) 

Jjc = 0 3 J J 3 

TA<? fofci/ inco/ite over yb«r _^eor5 is 16/3 = 5y imV/rbn dollars . This is/^4) — y^O). The 
sum from thousands of rectangles was slowly approaching this exact area 5y. 

Other areas The income in the first year, at x~ 1, is ^1) 3/2= 1 million dollars. 
(The false income was 1 million dollars.) The total income after x years is lx 3/2 , 
which is the antiderivative /(x). The square root curve covers 2/3 of the overall rectangle 
it sits in . The rectangle goes out to x and up to ^/x, with area x 3/2 , and 2/3 of that 
rectangle is below the curve. (1/3 is above.) 

Other antiderivatives The derivative of x 5 is 5x 4 . Therefore the antiderivative of x 4 
is x 5 /5. Divide by 5 (or n + 1) to cancel the 5 (or n + 1) from the derivative. And don’t 
allow n 4- 1 = 0: 

The derivative r{x) = x n has the antiderivative f(x) = x n+ 1 j(n + 1). 

EXAMPLE 1 The antiderivative of x 2 is $x 3 . This is the area under the parabola 
v(x) = x 2 . The area out to x = 1 is $(1) 3 - j(0) 3 , or 1/3. 

Remark on ^/x and x 2 The 2/3 from v /x and the 1/3 from x 2 add to 1. Those are 
the areas below and above the y/x curve, in the corner of Figure 5.3. If you turn the 
curve by 90 Q , it becomes the parabola. The functions y = v /x and x = y 2 are inverses! 
The areas for these inverse functions add to a square of area 1. 

AREA UNDER A STRAIGHT LINE 

You already know the area of a triangle. The region is below the diagonal line v = x 
in Figure 5.4. The base is 4, the height is 4, and the area is i<4)(4) — 8. Integration is 
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Fig. 5,4 Triangular area 8 as the limit of rectangular areas 10, 9, 8^ 


not required! But if you allow calculus to repeat that answer, and build up the integral 
J{x) = ?x 2 as the limiting area of many rectangles, you will have the beginning of 
something important. 

The four rectangles have area 1 + 2 + 3 + 4=10. That is greater than 8 , because 
the triangle is inside. 10 is a first approximation to the triangular area 8 , and to 
improve it we need more rectangles. 

The next rectangles will be thinner, of width &x= 1/2 instead of the original 
Ax = l. There will be eight rectangles instead of four. They extend above the line, 
so the answer is still too high. The new heights are 1/2, 1, 3/2, 2, 5/2, 3, 7/2, 4, The 
total area in Figure 5.4b is the sum of the base Ax — 1/2 times those heights: 

area = 4(i + 1 + f + 2 + + 4) = 9 {which is closer to 8 ). 

Question What is the area of 16 rectangles? Their heights are 3 4. 

Answer With base A,\ = i the area is i(i + 1 + ■" + 4) = 8 ^. 

The effort of doing the addition is increasing. A formula for the sums is needed, and 
will be established soon, (The next answer would be 8 £.) But more important than 
the formula is the idea. We are carrying out a limiting process, one step at a time. The 
area of the rectangles is approaching the area of the triangle, as A.v decreases. The 
same limiting process will apply to other areas, in which the region is much more 
complicated. Therefore we pause to comment on what is important. 

Area Under a Curve 

What requirements are imposed on those thinner and thinner rectangles? It is not 
essential that they all have the same width. And it is not required that they cover the 
triangle completely. The rectangles could lie below the curve. The limiting answer 
will still be 8 , even if the widths A.* are unequal and the rectangles fit inside the 
triangle or across it. We only impose two rules: 

1. The largest width Ax majt must approach zero. 

2. The top of each rectangle must touch or cross the curve. 

The area under the graph is defined to be the limit of these rectangular areas, if that 
limit exists. For the straight line, the limit does exist and equals 8 . That limit is 
independent of the particular widths and heights — as we absolutely insist it should 
be. 

Section 5.5 allows any continuous r(.\). The question will be the same — Does the 
iimit exist ? The answer will be the same — Yes, That limit will be the integral of t(x), 
and it will be the area under the curve. It will be f(x). 



EXAMPLE 2 The triangular area from 0 to x is ^(base)(height) = £(x)(x). That is 
y^jc) = jx 2 . Its derivative is v(x) = x. But notice that \x 2 + 1 has the same derivative . 
So does /= ^x 2 -I- C, for any constant C. There is a “constant of integration ” in/(x ), 
which is wiped out in its derivative u(x). 

EXAMPLE 3 Suppose the velocity is decreasing: v(x) = 4 - jc. If we sample v at x = 
1, 2, 3, 4, the rectangles lie under the graph. Because v is decreasing, the right end of 
each interval gives i? min . Then the rectangular area 3 + 2+ l + 0= 6is less than the 
exact area 8. The rectangles are inside the triangle, and eight rectangles with hase \ 
come closer: 

rectangular area = 7 ( 3 ^ + 3 H — + t + 0) = 7. 

Sixteen rectangles would have area 1\, We repeat that the rectangles need not have 
the same widths Ax, but it makes these calculations easier 
What is the area out to an arbitrary point (like x = 3 or x = 1)? We could insert 
rectangles, but the Fundamental Theorem offers a faster way. Any antiderivative of 
4 — x will give the area. We look for a function whose derivative is 4 — x. The derivative 
of 4x is 4, the derivative of \x 2 is x, so work backward: 

to achieve dfjdx = 4“x choose /(x) = 4x - ^x 2 . 

Calculus skips past the rectangles and computes ^3) = The area between x=l 
and x = 3 is the difference = 4, In Figure 5.5, this is the area of the trapezoid. 

The f-curve flattens out when the v-curve touches zero . No new area is being added , 



Fig. 5.5 The area is A/= — 3* = 4. Since r(x) decreases >/(*) bends down. 


INDEFINITE INTEGRALS AND DEFINITE INTEGRALS 


We have to distinguish two different kinds of integrals. They both use the antideriva- 
tive /(x). The definite one involves the limits 0 and 4, the indefinite one doesn’t: 

The indefinite integral is a function f(x) = 4x - \x 2 < 

The definite integral from x — 0 to x — 4 is the number f{ 4} — /(0). 

The definite integral is definitely 8. But the indefinite integral is not necessarily 
4x — jx 2 . We can change f(x) by a constant without changing its derivative (since the 
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derivative of a constant is zero). The following functions are also antiderivatives; 

j[x)= 4*-^* 2 + 1, /(*) = 4x-$x 2 -9 i /(x) = 4x- jx 2 + C 

The first two are particular examples. The last is the general case. The constant C 
can be anything (including zero), to give all functions with the required derivative. 
The theory of calculus will show that there are no others. The indefinite integral is 
the most general antiderivative (with no limits): 

indefinite integral fix ) = J r(x) dx = 4x — jx 2 + C, (5) 

By contrast, the definite integral is a number. It contains no arbitrary constant C. 
More that that, it contains no variable x, The definite integral is determined by the 
function u(x) and the limits of integration (also known as the endpoints ). It is the area 
under the graph between those endpoints. 

To see the relation of indefinite to definite, answer this question: What is the definite 
integral between x— 1 and x = 3? The indefinite integral gives /(3) = 7£+C and 
fi 1) = 3y 4- C. To find the area between the limits, subtract f at one limit from fat the 
other limit: 

j i x _ 1 Hx)(lx=jm -/(!) = (7i f C) - (3i + 0 = 4. (6) 

The constant cancels itself! The definite integral is the difference between the values 
of the indefinite integral. C disappears in the subtraction. 

The difference/^) - fit) is like /„ — / 0 . The sum of vj from 1 to n has become “ the 
integral of v{x) from 1 to 3.” Section 5.3 computes other areas from sums, and 5.4 
computes many more from antiderivatives. Then we come back to the definite integral 
and the Fundamental Theorem: 


L'(.X) (l.X = 
Jfl 


df 

a ilx 


dx =f{b) -/(fi). 


(7) 


5.2 EXERCISES 


Read -through questions 

Integration yields the a under a curve y = lj(x). It starts 
from rectangles with base b and heights u(x) and areas 
c As Ax -* 0 the area l i ,A.x + j + i\Ax becomes the 
d of t?(x). The symbol for the indefinite integral of v(x) is 
© 


The problem of integration is solved if we find fix) such 
that f Then / is the fl of i\ and t»(x) dx equals 
h minus ■ The limits of integration are j This 
is a *< integral, which is a i and not a function/(.x). 

The example u(x) = x has fix) = m . It also has fix ) — 
n The area under r(.x) from 2 to 6 is o The constant 
is canceled in computing the difference p minus q 

If i?(x) = V s then/(.x) = r 


Find an antiderivative fix) for in 1-14. Then compute the 
definite integral i<x) dx = /(l) -/(O), 


1 Sx^ + Ax 5 2 

3 l..' v /x (or .x“ I J ) 4 

5 v 1 3 + (2.v)‘ 3 6 

7 2 sin x + sin 2.x 8 

9 x cos x (by experiment) 10 

1 1 sin x cos .x 12 

13 0 (find all /) 14 

15 If dfidx = t.(x) then the defini 

b is . If fj-fj-i =Vj 


I: 3 + " v i ls 


x+ 1 2.x 2 
(^.x) 3 (or x 3, ' 2 t 

^ i 3 

sec 2 x + 1 

x sin x (by experiment) 

sin 2 .x cos x 

-I (find all/) 

e integral of r(x) from a to 
then the definite sum of 


The sum rj + - l + v„ =/„“/□ leads to the Fundamental 
Theorem j^r(x)^x= s The t integral is fix) and 
the v integral is fib) — fia ). Finding the v under the 
r-graph is the opposite of finding the w of the /-graph. 


16 The areas include a factor Ax, the base of each rectangle. 

So the sum of vs is multiplied by to approach the 

integral The difference of f's is divided by to 

approach the derivative. 
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17 The areas of 4, 8, and 16 rectangles were 10, 9, and 
containing the triangle out to x — 4. Find a formula for the 
area A s of N rectangles and test it for N = 3 and N = 6, 

IS Draw four Tectangles with base 1 below the y = x line, and 
find the total area. What is the area witb N rectangles? 

19 Draw y = sin x from 0 to it. Three rectangles (base rc/3) 
and six rectangles (base ir/6) contain an arch of the sine func- 
tion. Find the areas and guess the limit. 

20 Draw an example where three lower rectangles under a 
curve (heights mi, m 2 ,m 3 ) have less area than two rectangles. 

21 Draw y ~ 1 jx 1 for 0 < x < 1 with two rectangles under it 
(base 1/2). What is their area, and what is the area for four 
rectangles? Guess the limit. 

22 Repeat Problem 21 for y = 1/x. 

23 (with calculator) For u(x) — \j\fx take enough rectangles 
over 0 ^ x ^ 1 to convince any reasonable professor that the 
area is 2. Find_flx) and verify that/l)—/0) = 2. 

24 Find the area under the parabola u = x 2 from x = 0 to 
x = 4. Relate it to the area 16/3 below ^ fx . 

25 For [?! and v 1 in the figure estimate the areas J[ 2) and/4). 
Start witb/0) = G. 






26 Draw y = u(x) so that the area/x) increases until x= 1, 
stays constant to x = 2, and decreases to/3) = 1. 

27 Describe the indefinite integrals of ^ and Do the areas 
increase? Increase then decrease? ... 

28 For v 4 (x) find the area/4) -/ 1). Draw/ 4 (x). 

29 The graph of B(t) shows the birth rate: births per unit time 
at time t. D(t) is the death rate. In what way do these numbers 
appear on the graph? 

1. The change in population from t = 0 to t = 10. 

2. The time T when the population was largest. 

3. The time t* when the population increased fastest. 

30 Draw the graph of a function y 4 (x) whose area function 
is d 4 {x). 

31 If i? 2 (x) is an antiderivative of y a (x), draw y^fx). 

32 Suppose u(x) increases from u(0) — 0 to u(3) = 4. The area 

under y — u(x) plus the area on the left side of x — v~ *(y) 
equals . 

33 True or false , when/x) is an antiderivative of u(x). 

(a) 2/fjc) is an antiderivative of 2u(x) (fry examples) 

(b) /2x) is an antiderivative of u(2x) 

(c) /x) + i is an antiderivative of u(x) + 1 

(d) /x + 1) is an antiderivative of i^x + 1). 

(e) (/x)) a is an antiderivative of (v(x)) 2 . 


5.3 Summation versus Integration 

This section does integration the hard way. We find explicit formulas for f„ — 
Vi H — + v From areas of rectangles, the limits produce the area/x) under a curve. 
According to the Fundamental Theorem, dfjdx should return us to v(x ) — and we 
verify in each case that it does. 

May I recall that there is sometimes an easier way? If we can find an/x) whose 
derivative is u(x), then the integral of v is/. Sums and limits are not required, when / 
is spotted directly. The next section, which explains how to look for/x), will displace 
this one. (If we can't find an antiderivative we fall back on summation.) Given a 
successful/ adding any constant produces another / — since tbe derivative of tbe 
constant is zero. The right constant achieves /0) = 0, with no extra effort. 
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This section const ructs/(x) from sums. The next section searches for antiderivatives. 

THE SIGMA NOTATION 

In a section about sums, there has to be a decent way to express them. Consider 
1 2 + 2 2 + 3 2 + 4 2 . The individual terms are vj = j 2 . Their sum can be written in sum- 
mation notation , using the capital Greek letter L (pronounced sigma): 

l 2 + 2 2 + 3 2 + 4 2 is written £ 

Spoken aloud, that becomes "'the sum oj j 2 from j= 1 to 42* It equals 30. The limits 
on / (written below and above Z) indicate where to start and stop: 

n 9 

l 1 , + ■■■ + = Y, V j atK l V S + + = X V k- (1) 

; = ] k = 3 

The k at the end of (1) makes an additional point. There is nothing special about the 
letter /. That is a "dummy variable f no better and no worse than k (or/). Dummy 
variables are only on one side (the side with Z), and they have no effect on the sum. 
Ihe upper limit n is on both sides. Here are six sums: 


n 

4- 

V k = 1 + 2 + 3 + ■■■ + h 

y {■ i)' = - i + i - 1 + i = o 

*= i 

j - i 


u i ~\ 

X (2/- 1)= 1 + 3- 5+ 7 + 9 = 5 2 

7=1 

^ i'i = i 0 only one term 

L — 0 ' J 

^ t r l 

* i , i i „ r- t . 

> -r = 1 + + — ■" = 2 mhnite senes 

fcfo 2* 2 4 1 

^ j 2 — meaningless? j 

j - 1 


The numbers 1 and n or 1 and 4 (or 0 and x) arc the lower limit and upper limit. 
The dummy variable i or / or k is the index of summation. I hope it seems reasonable 
that the infinite scries 1 T- , + i + r " adds to 2. We will come back to it in Chapter lOT 
A sum like E"_ 3 6 looks meaningless, but it is actually 6-6+ j + 6 = 6m 
It follows the rules. In fact is not meaningless either. Every term is j 2 and 

by the same rules, that sum is 4 j 2 . However the / was probably intended to be /. 
Then the sum is 1+449+16 = 30. 

Question What happens to these sums when the upper limits are changed to jt? 
Answer T7ie sum depends on the stopping point n. A formula is required {when 
possible). Integrals stop at x. sums stop at n, and w r e now look for special cases when 
/(. y) or f n can be found. 


A SPECIAL SUMMATION FORMULA 

How do you add the first 100 whole numbers 1 .' The problem is to compute 

L 00 

X /= 1 + 2 + 3+ ■■■ +98 + 99+ 100 = ? 

j= i 


tZeno the Greek believed it was impossible to gel anywhere, since he would only go halfway 
and then half again and half again. Infinite series would have changed his whole life 
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If you were Gauss, you would see the answer at once, (He solved this problem at a 
ridiculous age, which gave his friends the idea of getting him into another class.) His 
solution was to combine 1 + 100, and 2 + 99, and 3 + 98, always adding to 101. There 
are fifty of those combinations. Thus the sum is (50)( 101) = 5050. 

The sum from 1 to n uses the same idea. The first and last terms add to n + 1. The 
next terms n - 1 and 2 also add to n + 1. If n is even (as 100 was) then there are jn 
parts. Therefore the sum is \n times n + 1; 

£j=l + 2+''+(fl-l) + tt = ]-n(n + 1)* (2) 

j= i ^ 

The Important term is \n 2 , but the exact sum is jn 2 + jn. 

What happens if n is an odd number (like n = 99)7 Formula (2) remains true. The 
combinations 1 + 99 and 2 + 98 still add to n + 1 = 100, There are ^99) = 49^ sueh 
pairs, beeause the middle term (which is 50) has nothing to combine with. Thus 
1 + 2 + ■* + 99 equals 49^ times 100, or 4950. 

Remark That sum had to be 4950, because it is 5050 minus 100. The sum up to 99 
equals the sum up to 100 with the last term removed. Our key formula /„ — t 
has turned up again! 

EXAMPLE Find the sum 101 + 102 + ■■■ + 200 of the second hundred numbers. 

First solution This is the sum from 1 to 200 minus the sum from 1 to 100: 

200 200 100 

£ / = £ ./ ~ £ }■ (3) 

101 1 1 

The middle sum is 4(20Q)(201) and the last is ?(100)(101) + Their difference is 15050. 
Note\ 1 left out “] = " in the limits. It is there, but not written. 

Second solution The answer 15050 is exactly the sum of the first hundred numbers 
(which was 5050) plus an additional 10000. Believing that a number like 10000 ean 
never turn up by accident, we look for a reason. It is found through changing the 
limits of summation. 

200 100 

Y j is the same sum as Y (k + 100). (4) 

y ~ i o 1 k = l 

This is important, to be able to shift limits around. Often the lower limit is moved 
to zero or one, for convenience. Both sums have 100 terms (that doesn't change). The 
dummy variable j is replaced by another dummy variable k. They are related by 
j — k~\- 100 or equivalently by k=j— 100, 

The variable must change everywhere — in the lower limit and the upper limit as 
well as inside the sum. If j starts at 101, then k=j- 100 starts at 1. Ifj ends at 200, 
k ends at 100. If j appears in the sum, it is replaced by k + 100 (and if j 2 appeared it 
would become (k + 100) 2 ). 

From equation (4) you see why the answer is 15050, The sum 1 + 2 + ■■■ + 100 is 
5050 as before. 100 is added to each of those 100 terms ♦ That gives 10000. 

EXAMPLES OF CHANGING THE VARIABLE (and the limits) 

2 4 

Y - equals Y 1 (here i=j~ 1). Both sums arc 1 + 2 + 4 + 8 

i = 0 j= 1 

rt n ■ 3 

Y ^equals Y v j + s {here / =j + 3 and j = i - 3). Both sums are + — + v n . 

i = 3 ; = o 



Why change n to n — 3? Because the upper limit is i = n. So j + 3 = n and j — n — 3* 
A final step is possible, and you will often see it. The new variable j can be changed 
back to i. Dummy variables have do meaning of their own, but at first the result 
looks surprising: 

^ 2 l equals ^ 2 J_1 equals ^ 2 i_1 , 

1 = 0 ]=l f=l 

With practice you might do that in one step, skipping the temporary letter j . Every 
i on the left becomes i — 1 on the right. Then i — 0, 5 changes to i — 1, .. . , 6- (At 
first two steps are safer.) This may seem a minor point, but soon we will be changing 
the limits on integrals instead of sums. Integration is parallel to summation, and it 
is better to see a “change of variable” here first. 

Note about 1 + 2 + ■■ ■■ + n. The good thing is that Gauss found the sum %n(n + 1). 
The bad thing is that his method looked too much like a trick. 1 would like to show 
how this fits the fundamental rule connecting sums and differences: 

j 'fu l + v 2 + - + u„=/ n then v a =f n - (5) 

Gauss says that/, is in(« + 1). Reducing n by 1, his formula for f n - x is i(n- l)n. The 
difference f n “■/„-! should be the last term n in the sum: 

fn -/«-!= i»(n + 1) - i(B ~ l)n = i (n 2 + n-n 2 + n) = n. (6) 

This is the one term v H = n that is included in f n but not in f„-\* 

There is a deeper point here. For any sum / B , there are two things to check. The 
/’$ must begin correctly and they must change correctly. The underlying idea is 
mathematical induction: Assume the statement is true below n. Prove it for n. 

Goal To prove that 1 + 2 + + n = in(n + 1). This is the guess f n . 

Proof by induction : Check f x (it equals 1). Check f n — i (if equals n). 

For n — 1 the answer 1) = ^*1*2 is correct. For n = 2 this formula ^*2*3 

agrees with 1 + 2. But that separate test is not necessary! If f x is right , and if the 
change f n ~f n ~i is right for every n, then f n must be right , Equation (6) was the key 
test, to show that the change in /’s agrees with v . 

That is the logic behind mathematical induction, but I am not happy with most 
of the exercises that use it. There is absolutely no excitement. The answer is given by 
some higher power (like Gauss), and it is proved correct by some lower power (like 
us). It is much better when we lower powers find the answer for ourselves.! Therefore 
I will try to do that for the second problem, which is the sum of squares. 

THE SUM OF / 3 AND THE INTEGRAL OF x 2 

An important calculation comes next. It is the area in Figure 5.6. One region is made 
up of rectangles, so its area is a sum of n pieces. The other region lies under the 
parabola v — x 2 . It cannot be divided into rectangles, and calculus is needed. 

The first problem is to find /„ — l 2 + 2 2 + 3 2 + + n 2 . This is a sum of squares, 

with /i = 1 and f 2 = 5 and / 3 = 14. The goal is to find the pattern in that sequence. 
By trying to guess /„ we are copying what will soon be done for integrals. 

Calculus looks for an /(x) whose derivative is u(x). There /is an antiderivative (or 


tThe goal of real teaching is for the student to find the answer. And also the problem. 
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Fig* 5*6 Rectangles enclosing v = x 2 have area (3^ + + ^/i)(A^c) 3 & j(rtAx) 3 = 3X 3 . 


an integral). Algebra looks for s whose differences produce u H * Here f H could be 
called an antidifference (better to call it a sum). 

The best start is a good guess. Copying directly from integrals, we might try 
/„ = 3 * To test if it is right, check whether 1 produces v n = n 2 : 

in 3 - i(n - l) 3 = in 3 - }{n 3 - 3n 2 + 3n - 1) = n 2 - n + i 

We see n 2 t but also — n + 3* The guess ^n 3 needs correction terms. To cancel 3 in the 
difference, T subtract from the sum. To put back n in the difference, I add 
1 + 2 + --■ + n = %n(n + 1) to the sum. The new guess (which should be right) is 

/„ = 3« 3 + 1 »(" + 1) - i« = + W + in. (7) 


To check this answer, verify first that f x = 1, Also f 2 = 5 and / 3 = 14* To be certain, 
verify that f n —f n ~i = n 2 - For calculus the important term is 


n t 1 

The sum j 2 of the first n squares is - n 3 plus corrections 

i-y 3 


2 ” 


and - n. 
o 


In practice ^n 3 is an excellent estimate. The sum of the first 100 squares is approxi- 
mately 3(100) 3 , or a third of a million. If we need the exact answer, equation (7) is 
available: the sum is 338,350. Many applications (example: the number of steps to 
solve 100 linear equations) can settle for ^n 3 . 

What is fascinating is the contrast with calculus. Calculus has no correction terms\ 
They get washed away in the limit of thin rectangles* When the sum is replaced by 
the integral (the area), we get an absolutely clean answer: 

The integral of v = x 2 from x = 0 to x~n is exactly 3fi 3 * 


The area under the parabola, out to the point x = 100, is precisely a third of a million. 
We have to explain why, with many rectangles. 

The idea is to approach an infinite number of infinitely thin rectangles* A hundred 
rectangles gave an area of 338,350. Now take a thousand rectangles. Their heights 
are (t^) 2 , *,, because the curve is v = x 2 . The base of every rectangle is 

Ax = and we add heights times base: 


area of rectangles = 




2l 

10 


1 \ /1000 
10 / + ' " + ( 10 




Factor out (i^) 3 . What you have left is l 2 + 2 2 + ■** + 1000 2 , which fits the sum of 
squares formula. The exact area of the thousand rectangles is 333,833*5* I could try 
to guess ten thousand rectangles but I won't. 

Main point: The area is approaching 333,333.333.... But the calculations are getting 
worse* It is time for algebra — which means that we keep “Ax” and avoid numbers* 
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The interval of length 100 is divided into n pieces of length Ax\ (Thus n = 100/ Ax.) 
The _/th rectangle meets the curve v = x 2 , so its height is (jAx) 2 . Its base is Ax, and 
we add areas: 

area = (Ax) 2 (Ax) + (2Ax) 2 (Ax) + - ■ + (nAx) 2 (Ax) = ^ (jAx) 2 {Ax), (8) 

j = i 

100 

Factor out (Ax)' 1 , leaving a sum of n squares. The area is (Ax) 3 times and n — — : 


(Ax) 3 


100 

Ax 


1/100Y | /too' 
2 \Ax ) + 6 ( A7 


\ 100 3 + f 100 2 {Ax) + ~ l00(Ax) 2 . 
3 2 6 


(9) 


This equation show's what is happening. The leading term is a third of a million, 
as predicted. The other terms arc approaching zero! They contain Ax, and as the 
rectangles get thinner they disappear. They only account for the small corners of 
rectangles that lie above the curve. The vanishing of those corners will eventually be 
proved for any continuous functions — the area from the correction terms goes to 
zero — but here in equation (9) you see it explicitly. 

The area under the curve came from the central idea of integration: 100/Ax rectan- 
gles of width Ax approach the limiting area = ^(100)^ The rectangular area is L vj Ax. 
The exact area is j r(x) dx. In the limit £ becomes j and Vj becomes t^x) txnd A x 
becomes dx. 


That completes the calculation for a parabola. It used the formula for a sum of 
squares, which was special. But the underlying idea is much more general. The limit 
of the sums agrees with the antiderivative: The antiderivative ofv(x) = x 2 is f(x) = ^x 2 > 
According to the Fundamental Theorem, the area under r(x) is f(x): 

CV*) dx=jnm-m = 3(ioo)\ 

That Fundamental Theorem is not yet proved! I mean it is not proved by us. Whether 
Leibniz or Newton managed to prove it, I am not quite sure. But it can be done. 
Starting from sums of differences, the difficulty is that wc have too many limits at 
once. The sums of r^Ax are approaching the integral. The differences A// Ax approach 
the derivative, A real proof has to separate those steps, and Section 5,7 will do it. 

Proved or not, you are seeing the main point. What was true for the numbers j) 
and rj is true in the limit for r(.\) and /(x). Now t(x) can vary continuously, but it is 
still the slope of /(x). The reverse of slope is area . 



(l+2 + 3 + 4) 2 = l 1 + 2" + 3 J + 4' 

Proof without words by Roger Nelson {Mathematics Magazine 1990). 


Finally wc review the area under r = x. The sum of 1 — 2 + — i- n is jn 2 — yn. This 
gives the area of n = 4/ Ax rectangles, going out to x = 4, The heights arc yAx, the 
bases are Ax, and wc add areas: 


4'i.t 

X 

i 


(_/Ax)(Ax) = (Ax) 2 


1 ~ 
2 \ Ax 


1 

H- — 

2 


Ax,, 


= 8 + 2Ax. 


(. 10 ) 
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With Ax = 1 the area is 1+2 + 3 + 4=10. With eight rectangles and Ax = y, the 
area was 8 + 2Ax — 9. Sixteen rectangles of width \ brought the correction 2Ax down 
to y. The exact area is 8. The error is proportion ol to Ax . 

Important note There you see a question in applied mathematics. If there is an error, 
what size is it? How does it behave as Ax 0? The Ax term disappears in the limit, 
and {Ax) 2 disappears faster. But to get an error of 10^ 6 we need eight million 
rectangles : 

2Ax = 2-4/8, 000,000= 10“ 6 . 

That is horrifying! The numbers 10, 9, 8y, 8^, ... seem to approach the area 8 in a 
satisfactory way, but the convergence is much too slow. It takes twice as much work 
to get one more binary digit in the answer — which is absolutely unacceptable. Some- 
how the Ax term must be removed. If the correction is (Ax) 2 instead of Ax, then a 
thousand rectangles will reach an accuracy of 10 ~ 6 . 

The problem is that the rectangles are unbalanced. Their right sides touch the graph 
of v, but their left sides are much too high. The best is to cross the graph in the middle 
of the interval — this is the midpoint rule. Then the rectangle sits halfway across the 
line v = x, and the error is zero. Section 5.8 conies back to this rule — and to Simpson’s 
rule that fits parabolas and removes the (Ax) 2 term and is built into many calculators. 

Finally we try the quick way. The area under v = x \sf= yx 2 , because dfjdx is v , 
The area out to x = 4 is y(4) 2 = 8. Done. 



4 9 12 3 


fig. 5.7 Endpoint rules: error ^ l/(work) ~ 1/n. Midpoint rule is belter: error ^ l/(work) 2 . 

Optional : pth powers Our sums are following a pattern. First, 1 + -** + n is jn 2 plus 
yn. The sum of squares is ^n 3 plus correction terms. The sum of pth powers is 

Ip + 2 P + — + n p = — + 1 plus correction terms . (1 1) 

P + 1 

The correction involves lower powers of n, and you know what is coming. Those 
corrections disappear in calculus. The area under v = x p from 0 to n is 

r a Ft /a x i 

x p dx = lim £ 0‘AxnAx)=— ; - n p+1 . (12) 

Jx = o ;=1 p+l 

Calculus doesn’t care if the upper limit n is an integer, and it doesn’t care if the power 
p is an integer. We only need p + 1 > 0 to be sure n p+ L is genuinely the leading term. 
The antiderivative of v = x p is f = x p+ l j(p + 1). 

We are close to interesting experiments. The correction terms disappear and the 
sum approaches the integral. Here are actual numbers for p = 1, when the sum and 
integral are easy: £ n = 1 + ••• + n and l n = Jx dx = jn 2 . The difference is D n = yn. The 
thing to watch is the relative error E n = DJI„: 


n 

s n 

/. 


E, = OJI, 

100 

5050 

5000 

50 

.010 

200 

20100 

20000 

100 

.005 
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The number 20100 is *(200)(201). Please write down the next line n = 400, and please 
find a formula for £ n . You can guess E n from the table, or you can derive it from 
knowing S n and The formula should show that E n goes to zero. More important, 
it should show how quick (or slow) that convergence will be. 

One more number — a third of a million — was mentioned earlier. It came from 
integrating x 2 from 0 to 100, which compares to the sum S 100 of 100 squares: 

n p S n /„ = *n 3 D = S-I E — D/I 

100 2 338350 333333* 5016* 01505 

200 2 2686700 2666666* 20033* 0075125 

These numbers suggest a new idea, to keep n fixed and change The computer can 
find sums without a formula! With its help we go to fourth powers and square roots: 

" P S«l*+ ■•• + »' / — n p+l /(p+ 1) D~S-I D/I 

100 4 2050333330 *<100) 5 50333330 0.0252 

100 * 671.4629 *<100) 3/2 4.7963 0.0072 

In this and future tables we don’t expect exact values. The last entries are rounded 
off, and the goal is to see the pattern. The errors E n p are sure to obey a systematic 
rule — they are proportional to 1/n and to an unknown number C(p) that depends 
on p . I hope you can push the experiments far enough to discover C(p). This is not 
an exercise with an answer in the back of the book — it is mathematics. 


5.3 EXERCISES 


Read-through questions 

The Greek letter o indicates summation. In Z* Vj the 
dummy variable is b . The limits are £ , so the first 

term is d and the last term is * When Vj = j this 
sum equals f . For n = 100 the leading term is 0 
The correction term is h The leading term equals the 
integral of v = x from 0 to 100, which is written * The 
sum is the total J of 100 rectangles, The correction term 
is the area between the * and the * 

The sum Z?_ 3 t 2 is the same as Z* =1 m and equals 
" . The sum Z? =4 v t is the same as o and equals 

p . For = Z* = t Vj the difference equals Q . 

The formula for l 2 + 2 2 + + n 2 is f H = 1 To prove 

it by mathematical induction, check f = » and check 

f n —f n - 1 = t The area under the parabola v = x 2 from 
x = 0 to x = 9 is « . This is dose to the area of y 

rectangles of base Ax. The correction terms approach zero 
very » . 

4 5 

1 Compute the numbers Y 1/rr and Y (2i — 3). 

f»= 1 i=2 

3 « 

2 Compute Y (j 2 — y) and £ 1/2 J . 


6 * 

3 Evaluate the sum Y 2 l and Y 2'. 

i = 0 i = 0 

4 Evaluate Y ( — l) 1 * and Y ( — 1) J J- 

i=l 

5 Write these sums in sigma notation and compute them: 

2 + 4 + 6 + — + 100 l + 3 + 5-h — 1-1 99 l-*+*-± 

6 Express these sums in sigma notation: 

— v 2 + 1^3 — V A V X W X + V 2 V>2 + + V„Yt>' Uj -h t?3 -h 

7 Convert these sums to sigma notation: 

. 271.471 

a 0 + a x x- h — I- j/ sin — + sm — H hsm 2 n 

ft ft 

8 The binomial formula uses coefficients 

(a + br = G) a ” + (i) a ”‘ 1 6 + ■■■ + (") 6 " = A — bK 

100 1000 

9 With electronic help compute Y 1 lj and Y I /J* 

i t 

10 10 

10 On a computer find ^ { — 1 )■'/_/ ! times Y 1 jj\ 
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11 Simplify X (a, 4- b,) 2 + £ ( 0i - b t f to X . 

(=1 1=1 1=1 

12 Show that ^ X af and ^ a » b i a ] Y, h- 

* io / 1 i\ 

13 "Telescope" the sums V (2 fc — 2 t_1 )and J] — - — :). 

i=i j= i y + l j / 

All but two terms cancel, 

14 Simplify the sums Y (/,-/)_ i) and X (/;+i -/;)• 

M j=* 

8 6 9 11 

15 Tree m false. (a) X r,= X t>i-a (b)[»i=y fi -2 

J =4 /= 2 1=1 

A 6 8 

16 £ », = x . “<* X * 3 = X ■ 

(=1 J =0 1=0 i = 2 

17 The antiderivative of d 2 fjdx 1 is dfjdx. What is the sum 
{h ~ 2 h +/o) + (A - 2/ 2 +A> + " + <A - Vi +/ 7 J? 

18 Verify that l 2 + 2 2 H — + n 2 is /, — 
n(n + l)(2n + 1)/6 by checking that /i is correct and 

19 Prove by induction: 1 + 3 + *'• + (2n— 1) = n\ 

20 Verify that l 3 + 2 3 + -+rt 3 isA = i« 2 (fl + l) 2 by check- 
ing/! and /*—/„-!■ The text has a proof without words . 

21 Suppose f n has the form an + bn 2 + cn 3 . If you know 
A — l, fi = 5, f* — 14, turn those into three equations for 
a, b, c. The solutions 0 = £, c-i give what formula? 

22 Find q in the formula l 8 H \-n 8 = qri 9 + correction- 

23 Add n = 400 to the table for S n = 1 + — + n and find the 
relative error E n . Guess and prove a formula for E n . 

24 Add n = 50 to the table for S a = l 2 + 4- n 2 and compute 

E 50 . Find an approximate formula for £„, 

25 Add p = i and p = 3 to the table for S i0f)tP = 
l p + — + 100 p . Guess an approximate formula for E 100tP . 

26 Guess C(p) in the formula E ntP & C(p)/n. 

27 Show that |1 - 5| < [1| + | — 5|, Always |i? t + v z \ < |i?i| + |u 2 | 

unless . 


28 Let S be the sum 1 + x + x 2 + ■ - of the (infinite) geometric 
series. Then xS = x + x 2 + x 3 H — is the same as S minus 

. Therefore S = . None of this makes sense 

if * — 2 because , 


29 The double sum V \ V (i+/)l is v x = V (1 +j) plus 

i=l U =i J t=1 

v 2 = X (2 4- j}. Compute Uj and v 2 and the double sum, 
i= 1 


30 The double sum 



The double sum 


(Wl.l +'*’j,l) + (Wl.2 + W 2.i} + 


(Wl,l+Wl.i+W,j) + 

Compare. 


31 Find the flaw in the proof that 2" = 1 for every 
n = 0, 1, 2, .... For n — 0 we have 2° = 1. If 2* = 1 for every 
n < V, then 2* = 2 s - 1 • 2 N ~ L /2*“ 2 = M/1 = 1. 


32 Write out all terms to see why the following are true: 

33 The average of 6, 11, 4 is £ = i(6F 11 +4). Then 

(6 — u) + {1 1 — iJ) + (4 — u) = , The average of 

Vi, is v = . Prove that Z(^ — i)) = 0. 


34 The Schwarz inequality is 




Compute both sides if a x = 2, a 2 = 3, hi = 1, b z =4. Then 
compute both sides for any fli, b 2 - The proof in 

Section 11.1 uses vectors. 


35 Suppose n rectangles with base Ax touch the graph of t^x) 
at the points x = Ax, 2Ax, ..., nAx. Express the total rectan- 
gular area in sigma notation. 

36 If 1/Ax rectangles with base Ax touch the graph of i<x) 
at the left end of each interval (thus at x = 0, Ax, 2Ax, 
express the total area in sigma notation. 


H^/UAx) —f[(j — l)Ax) 


37 The sum Ax y 

M Ax 

In the limit this becomes 


equals , 


dx = 



5.4 Indefinite Integrals and Substitutions 



This section integrates the easy way, by looking for antiderivatives. We leave aside 
sums of rectangular areas, and their limits as Ax 0. Instead we search for an f(x) 
with the required derivative v(x). In practice, this approach is more or less indepen- 
dent of the approach through sums— but it gives the same answer. And also, the 
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search for an antiderivative may not succeed * We may not find /. In that case we go 
back to rectangles, or on to something better in Section 5*8* 

A computer is ready to integrate v , but not by discovering /* It integrates between 
specified limits, to obtain a number (the definite integral). Here we hope to find a 
function (the indefinite integral)* That requires a symbolic integration code like 
MACSYMA or Mathematica or MAPLE, or a reasonably nice t?(x) ? or both* An 
expression for /(x) can have tremendous advantages over a list of numbers* 

Thus our goal is to find antiderivatives and use them* The techniques will be further 
developed in Chapter 7 — this section is short but good* First we write down what 
we know* On each line y f(x) is an antiderivative of t?(x) because dfjdx = n(x)* 


Known pairs 

Function v{x) 

Antiderivative f (x) 

Powers of x 

x" 

x" + l /(n + 1) + C 

- 1 is not included, because 

n-H 1 would be zero* 

v- x _1 will lead us 

Trigonometric functions 

cos X 

sin x + C 


sin x 

— cos x + C 


sec 2 x 

tan x + C 


esc 2 x 

- cot x + C 


sec x tan x 

sec x + C 


CSC x cot X 

— esc x + C 

Inverse functions 


sin” 1 x -1- C 


1/(1 + X 2 ) 

tan - 1 x -1- C 


vwy* 2 - 1 

sec - 1 x + C 


You recognize that each integration formula came directly from a differentiation 
formula* The integral of the cosine is the sine, because the derivative of the sine is 
the cosine* For emphasis we list three derivatives above three integrals: 

d 

— (constant) = 0 
ax 

| 0 dx — C 

There are two ways to make this list longer* One is to find the derivative of a new 
/( x). Then / goes in one column and v = df jdx goes in the other column*t The other 
possibility is to use rules for derivatives to find rules for integrals* That is the way to 
extend the list, enormously and easily, 

RULES FOR INTEGRALS 

Among the rules for derivatives, three were of supreme importance* They were linear- 
ity, the product rule , and the chain rule * Everything flowed from those three. In the 



fWe will soon meet e*, which goes in both columns. It isf[x) and also r(x)* 
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reverse direction (from v to /) this is still true. The three basic methods of differential 
calculus also dominate integral calculus: 

linearity of derivatives linearity of integrals 
product rule for derivatives integration by parts 
chain rule for derivatives — ► integrals by substitution 

The easiest is linearity, which comes first. Integration by parts will be left for 
Section 7.1. This section starts on substitutions, reversing the chain rule to make an 
integral simpler. 


LINEARITY OF INTEGRALS 

What is the integral of v(x)~ w(x)? Add the two separate integrals. The graph of 
v 4 vv has two regions below it, the area under v and the area from v to v + w. 
Adding areas gives the sum rule. Suppose / and g are antiderivatives of v and w; 

sum rule: f+ g is an antiderivative of i- + w 

constant rule: cf is an antiderivative of cv 

linearity : ti/4 hg is an antiderivative of av + bw 

This is a case of overkill. The first two rules are special cases of the third, so logically 
the last rule is enough. However it is so important to deal quickly with constants — 
just 'factor them out " — that the rule cv+-*cf is stated separately. The proofs eome 
from the linearity of derivatives: (aj + bg)' equals af + bg* which equals av + bw. 
The rules can be restated with integral signs: 

sum rule: J \ r(.v) + vv(.x) J dx = j t-(x) dx + j vv(.x) dx 

constant rule : j cr(.v) dx = c | r(x) dx 

linearity : j [ ttE'(x) — Mv(x) J dx = a j i(x) dx 4 h j w(x) tlx 

Note about the constant in f(x)~ C. All antiderivatives allow the addition of a con- 
stant. For a combination like ar(x)4 fm-'(x), the antiderivative is u/'(x) 4 b£(x) + C. 
The constants for each part combine into a single const ant. To give all possible antide- 
rivatives of a function, just remember to write " + after one of them. The real 
problem is to find that one antiderivative, 

EXAMPLE 1 The antiderivative of v = x 2 4 x ” 2 is / = x’\'3 4 (x ~ 1 )/(— 1) 4 C. 

EXAMPLE 2 The antiderivative of 6 cos t + 7 sin r is 6 sin t — 7 cos x 4 C. 

- * 1 1 - sin x 1 - sin x , 

EXAMPLE 3 Rewrite as ^ — = — — = — = sec x - sec x tan x, 

1 ^ sin x 1 — sm~ x cos x 

The antiderivative is tan x - sec x + C, That rewriting is done by a symbolic algebra 
code (or by you). Differentiation is often simple, so most people check that df;dx = r(xf 

Question How to integrate tan 2 x? 

Method Write it as sec 2 x - 1 , Answer tan x - x 4 C. 



198 


5 Integra It 


INTEGRALS BY SUBSTITUTION 

We now present the most valuable technique in this section substitution. To see the 
idea, you have to remember the chain rule: 

/(£(*)) has derivative f f {g{x)){dgjdx) 

sin x 2 has derivative (cos x 2 )(2x) 

(x 3 + l) 5 has derivative 5(x 3 + l) 4 (3x 2 ) 

If the function on the right is given, the function on the left is its antiderivative! There 
are two points to emphasize right away: 

L Constants are no problem — they can always be fixed . Divide by 2 or 15: 

J x cos(x 2 ) dx = - sin(x 2 ) + C J x 2 (x 3 + l) 4 dx = ^(* 3 + ^ + C 

Notice the 2 from x 2 , the 5 from the fifth power, and the 3 from x 3 . 

2. Choosing the inside function g (or u) commits us to its derivative : 

the integral of 2x cos x 2 is sin x 2 + C (g = x 2 , dgjdx = 2x) 

the integral of cos x 2 is {failure} (no dgjdx) 

the integral of x 2 cos x 2 is (failure) (wrong dgjdx) 

To substitute g for x 2 , we need its derivative- The trick is to spot an inside function 
whose derivative is present* We can fix constants like 2 or 15, but otherwise dgjdx 
has to be there* Very often the inside function g is written u. We use that letter to state 
the substitution rule , when / is the integral of v: 

| v{u(x))^dx=mx)) + C. (1) 

EXAMPLE 4 j sin x cos x dx — ^(sin x) 2 + C u = sin x (compare Example 6) 

EXAMPLE 5 J sin 2 x cos x dx = i(sin x) 3 + C u = sin x 

EXAMPLE 6 | cos x sin x dx= - ^(cos x) 2 + C u = cos x (compare Example 4) 

EXAMPLE 7 j tan 4 x sec 2 x dx — 3 (tan x) 5 + C u = tan x 

The next example has u = x 2 — 1 and dujdx = 2x. The key step is choosing u: 

EXAMPLE fl J x dxjjx 1 - 1 = ^x 2 - 1 + C j x^x 2 - 1 dx = ^(x 2 - l) 3 ' 2 + C 

A shift of x (to x + 2) or a multiple of x (rescaling to 2x) is particularly easy: 

EXAMPLES 9-10 J (x + 2) 3 dx = i(x + 2) 4 + C f cos 2x dx = i sin 2x + C 

You will soon be able to do those in your sleep. Officially the derivative of (x + 2) 4 
uses the chain rule. But the inside function u = x + 2 has dujdx — 1. The “1” is there 
automatically, and the graph shifts over — as in Figure 5.8b. 

For Example 10 the inside function is u- 2x. Its derivative is dujdx = 2. This 
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l>(.t + 1^^^ 
area 
/(a+1) 

-/(I) 

0 2 0 1 0 10 1 

Fig. 5.8 Substituting u = x + 1 and u = 2x and u = x 2 . The last graph has half of duldx = 2x. 

required factor 2 is missing in J cos 2x dx, but we put it there by multiplying and 
dividing by 2. Check the derivative of { sin 2x: the 2 from the chain rule cancels the 
The rule for any nonzero constant is similar: 

t>(x + c) dx =/(x + c) and J* t>(cx) dx = ^ f(cx). (2) 

Squeezing the graph by c divides the area by c. Now 3x + 7 rescales and shifts: 




EXAMPLE 11 f cos(3x + 7)dx = ±sin(3x+7)+C j (3x+7) 2 dx = H(3x + 7) 3 + C 

Remark on writing down the steps When the substitution is complicated, it is a good 
idea to get du/dx where you need it. Here 3x 2 + 1 needs 6x: 


7x(3x 2 + l) 4 dx 


4J 


(3x 2 +l) 4 6x dx 


-1 f 4^ 

6 J “ dx 1 


Now integrate: 


7u 5 7(3x 2 +l) 5 

6 5 + C 6 5 


+ C. 


(3) 


Check the derivative at the end. The exponent 5 cancels 5 in the denominator, 6x from 
the chain rule cancels 6, and lx is what we started with. 


Remark on differentials In place of (du/dx) dx , many people just write du : 

j (3x 2 + l) 4 6x dx = j u 4 du = \u 5 + C. (4) 

This really shows how substitution works. We switch from x to u, and we also switch 
from dx to du. The most common mistake is to confuse dx with du. The factor du/dx 
from the chain rule is absolutely needed, to reach du. The change of variables (dummy 
variables anyway!) leaves an easy integral, and then u turns back into 3x 2 4- 1. Here 
are the four steps to substitute u for x: 

1. Choose u(x) and compute du/dx 

2. Locate v(u) times du/dx times dx , or v(u) times du 

3. Integrate J r(u) du to find /(u) + C 

4. Substitute u(x) back into this antiderivative /. 


EXAMPLE 12 J(cos yjx)dx/ 2 v /x = J cos u du= sin u + C = sin ^x 4- C 

(put in u) (integrate) (put back x) 

The choice of u must be right, to change everything from x to u. With ingenuity, 
some remarkable integra ls are poss ible. But most will remain impossible forever. The 
functions cos x 2 and l/ x /4 - sin 2 x have no “elementary” antiderivative. Those integ- 
rals are well defined and they come up in applications — the latter gives the distance 
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around an ellipse. That can be computed to tremendous accuracy, but not to perfect 
accuracy. 

The exercises concentrate on substitutions, which need and deserve practice. We 
give a nonexample — J (x 2 + l) 2 dx does not equal y(x 2 -I- l) 3 — to emphasize the need 
for dujdx . Since lx is missing, u = x 2 + 1 does not work. But we can fix up n: 

r . J r . du i i 

sin nx dx — sin u — = cos u F C cos nx F C. 

[ Inn n 


5.4 EXCISES 


Read-through questions 

Finding integrals by substitution is the reverse of the g 
rule- The derivative of (sin x) 3 is b , Therefore the antide- 
rivative of c is d . To compute J (1 4- sin x) 2 cos x dx, 

substitute u= g . Then dujdx = * so substitute 

du — 0 . In terms of u the integral is J h = 1 . 

Returning to x gives the final answer. 

The best substitutions for j tan (x F 3) sec 2 (x F 3 )dx 
and j (x 2 F l) 10 x dx are u = I and u = k . Then 
du = 1 and m . The answers are n and o , 

The antiderivative of v dv/dx is P . [2xdx/(lFx 2 ) 
leads to J Q « which we don't yet know. The integral 
| dx/( 1 Fx 2 ) is known immediately as * . 


Find the indefinite integrals in 1-20. 

1 f Jl+x dx (add F C) 2 j J 3 — x dx (always F C) 


3 f (x+I )*dx 
5 \(x 2 + l) 5 xdx 
7 J cos 3 x sin x dx 
9 J cos 3 2x sin 2x dx 

ii f dt/yr^? 

13 j 

15 J(l+v^) 

17 | sec x tan x dx 
19 J cos x tan x dx 


4 j (i + ir<ix 
6 J ^/l — 3x dx 
8 J cos x dx/sin 3 x 
10 j cos 3 x sin 2x dx 
12 J t^/l-t 2 A 
14 J r 3 ^!^? A 
16 j (1 + x vl )yfx dx 
18 J sec 2 x tan 2 x dx 
20 J sin 3 x dx 


In 21-32 find a function y(x) that solves the differentia! 
equation. 

21 dyjdx = x 2 F yfx 22 dy/dx = y 2 (try y = ex' 1 ) 

23 dyjdx = J 1 - 2x 24 dyjdx = ljj\-2x 


25 dyjdx = 1/y 

26 dyjdx = x/y 

27 d 2 yjdx 2 = 1 

28 d 5 y/dx 5 — 1 

29 d 2 y/dx 2 = -y 

30 dyjdx — yjxy 

31 d 2 yjdx 2 = ^ 

32 (dyjdx) 2 = 

33 True or false , when / is an antiderivative of 

(a) { t*u(x)) dx 

-/(«•(*)) + C 


(b) \v 2 (x)dx = ±p(x) + C 

(c) | v(x)(dujdx) dx -/ (u(x)) F C 

(d) J^(x)(d^x)dx-i/ 2 (x)FC 

34 True or false , when / is an antiderivative of v: 

(a) l f(x)(dvfdx)dx = U 2 {x) + C 

(b) } v(v(x))(dvjdx) dx =f(v(x)) F C 

(c) Integral is inverse to derivative so /(u(x)) = x 

(d) Integral is inverse to derivative so J (df jdx) dx = /(x) 


35 If d/* /dx = i?(x) 

then } ufx — 1} dx = 

and 

\v(xjl) dx = 


36 If dfjdx = ufjc) 

| t>fjc 2 ljc dx « 

then [ v(2x — Udx = 

and 


37 




r SO 


It 


x"dx 


I Fx 2 ” 1 ~~ 1 Fx 2 

38 j (x 2 F \) 2 dx is not i(x 2 F l) a but 

39 [ 2x dx/(x 2 F 1) is J 


Fx 2 

,3 


du which will soon be In u. 


40 Show that J 2x 3 dxj(\ F x 2 ) 3 = [ (u — 1) du/u 3 = , 

41 The acceleration d 2 fjdt 2 = 9.8 gives f(t) = 

integration constants). 


(two 


42 The solution to d 4 yjdx* = 0 is (four constants). 


43 If f(t) is an antiderivative of t>(f), find antiderivatives of 
(a) tj(t + 3) (b) «(t) + 3 (c) 3t>(f) (d) v(3t). 




5.5 The Definite Integral 


The integral of v(x) is an antiderivative f(x) plus a constant C. This section takes 
two steps. First, we choose C. Second, we construct /( x). The object is to define the 
integral — in the most frequent case when a suitable f(x) is not directly known. 

The indefinite integral contains + C ” The constant is not settled because /(x) + C 
has the same slope for every C. When we care only about the derivative, C makes 
no difference. When the goal is a numher — a definite integral — C can be assigned a 
definite value at the starting point. 

For mileage traveled, we subtract the reading at the start . This section does the 
same for area. Distance is f{t) and area is f(x) — while the definite integral is 
f(b) ”/(a). Don’t pay attention to t or x, pay attention to the great formula of integral 
calculus: 

v(x)dx=m~m to 

Viewpoint 1 : When / is known, the equation gives the area from a to b . 
Viewpoint 2: When / is not known, the equation defines / from the area. 

For a typical v(x), we can’t find f(x) by guessing or substitution. But still v(x) has an 
“area” under its graph— and this yields the desired integral /(x). 

Most of this section is theoretical, leading to the definition of the integral. You 
may think we should have defined integrals before computing them — which is logi- 
cally true. But the idea of area (and the use of rectangles) was already pretty clear in 
our first examples. Now we go much further. Every continuous function u(x) has an 
integral (also some discontinuous functions). Then the Fundamental Theorem com- 
pletes the circle: The integral leads back to dfjdx = v(x). The area up to x is the 
antiderivative that we couldn’t otherwise discover. 

THE CONSTANT OF INTEGRATION 

Our goal is to turn f(x) + C into a definite integral — the area between a and The 
first requirement is to have area = zero at the start: 

f(a) + C = starting area = 0 so C = -f(a). (2) 

For the area up to x (moving endpoint, indefinite integral), use t as the dummy variable: 

the area from a to x is J* t?(f) dt = f{x) — f(a) [indefinite integral) 

the area from a to b is j b a u(x) dx = f(b) — f(a ) (definite integral) 

EXAMPLE 1 The area under the graph of 5(x + l) 4 from a to b has f(x) = (x + l) 5 : 

£ 5(* + l) 4 dx = (x + l) 5 J = {b+ l) s - [a + l) s . 

The calculation has two separate steps — first find /(x), then substitute b and a . After 
the first step, check that dfjdx is v. The upper limit in the second step gives plus f{b\ 
the lower limit gives minus /(a). Notice the brackets (or the vertical bar): 

/(*)]!=/(&)-/(<*) X 3 |? = 8-l [cos x]l' = COS 2f - 1. 

Changing the example to /(x) = (x + l) 5 — 1 gives an equally good antiderivative — 



and now /( 0) = 0. But f(b) —f(a) stays the same, because the - 1 disappears: 
[(* + l) s - 1 J = ((b + l) s - 1) - ((a + l) s -!) = {&+ l) s - (o + l) 3 . 


EXAMPLE 2 When v = 2x sin x 2 we recognize / » -cos x 2 . The area from 0 to 3 is 

H 2x sin x 2 dx = “ cos x 2 = “ cos 9 + cos 0, 

The upper limit copies the minus sign. The lower limit gives — (-cosO), which is 
+ cos 0. That example shows the right form for solving exercises on definite integrals. 


Example 2 jumped directly to /(x) = — cos x 2 . But most problems involving the 
chain rule go more slowly — by substitution. Set u = x 2 , with dujdx — lx: 

r , , p . du r . 

2x sin x z dx = sin u — dx = sin u du . (3) 

Jo Jo dx J ? 

We need new limits when u replaces x 2 , Those limits on u are a 2 and b 2 . (In this case 
a 2 — 0 2 and b 2 — 3 2 — 9,) If x goes from a to b, then u goes from u(a) to u(b). 


EXAMPLES 


(" ( x 2 + 5) 3 x dx= f* 

J* = 0 J* 


u(a) 

6 


v(u) d u =f(u(b))-f(u(a)). 

5 U 2 8 J 5 8 8’ 


(4) 


In this case u = x 2 + 5* Therefore dujdx = 2x (or du = 2x dx for differentials). We have 
to account for the missing 2. The integral is |u 4 . The limits on u=x 2 + 5 are 
u( 0) = 0 2 + 5 and u( 1)= l 2 + 5, That is why the u-integral goes from 5 to 6. The 
alternative is to find f(x) = £(x 2 + S) 4 in one jump (and check it). 


EXAMPLE 4 jo sin x 2 dx = ?? (no elementary function gives this integral}. 

If we try cos x 2 , the chain rule produces an extra 2x — no adjustment will work. Does 
sin x 2 still have an antiderivative? Yesl Every continuous v(x) has an /(x). Whether 
f{x) has an algebraic formula or not, we can write it as j v(x) dx . To define that 
integral, we now take the limit of rectangular areas. 


INTEGRALS AS LIMITS OF “RIEMANN SUMS" 


We have come to the definition of the integral . The chapter started with the integrals 
of x and x 2 , from formulas for 1 + **- + n and l 2 + *** + n 2 . We will not go hack to 
those formulas. But for other functions, too irregular to find exact sums, the rectangu- 
lar areas also approach a limit. 

That limit is the integral. This definition is a major step in the theory of calculus. 
It can be studied in detail, or understood in principle. The truth is that the definition 
is not so painful — you virtually know it already. 

Problem Integrate the continuous Junction v(x) over the interval [ a > b]. 

Step 1 Split [a, b] into n suhintervals [a, x^, [x u x 3 ], [x n _ !5 b]. 

The “meshpoints” x t , x 3 , ... divide up the interval from a to b. The endpoints are 
x 0 = a and x„ = b. The length of subinterval Jfc is Ax fc = x k — x k ^ x . In that smaller 
interval, the minimum of v(x) is m k . The maximum is M k , 
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Now construct rectangles. The “ lower rectangle” over interval k has height m k . The 
"upper rectangle ” reaches to M k . Since v is continuous, there are points x min and x m&x 
where v = m k and v = M k (extreme value theorem). The graph of v(x) is in between. 
Important: The area under u(x) contains the area "s” of the lower rectangles: 

J* u(x) dx ^ m x + m 2 Ax 2 + + fli n Ax„ = s. (5) 

The area under v(x) is contained in the area “ S " of the upper rectangles: 

J* u(x) dx ^ Mi Ax* + M 2 Ax 2 + + M„Ax„ = S. (6) 

The lower sum s and the upper sum S were computed earlier in special cases— when 

v was x or x 2 and the spacings Ax were equal. Figure 5.9a shows why s ^ area ^ S. 





Fig. 5.9 Area of lower rectangles = s. Upper sum S includes top pieces. Riemann sum 5* is in between. 


Notice an important fact. When a new dividing point x' is added, the lower sum 
increases . The minimum in one piece can be greater (see second figure) than the 
original m k . Similarly the upper sum decreases. The maximum in one piece can be 
below the overall maximum. As new points are added, s goes up and S comes down . 
So the sums come closer together: 

s < s' < S f ^ S. (7) 

I have left space in between for the curved area — the integral of u(x). 

Now add more and more meshpoints in such a way that Ax max -► 0. The lower 
sums increase and the upper sums decrease. They never pass each other, if u(x) is 
continuous, those sums close in on a single number A. That number is the definite 
integral- — the area under the graph. 

DEFINITION The area A is the common limit of the lower and upper sums: 

s -* A and S -* A as Ax m „ — > 0. (8) 

This limit A exists for all continuous v(x), and also for some discontinuous functions. 
When it exists, A is the "Riemann integral” of v(x) from a to b . 

REMARKS ON THE INTEGRAL 

As for derivatives, so for integrals: The definition involves a limit. Calculus is built 
on limits, and we always add “if the limit exists.” That is the delicate point. I hope 
the next five remarks (increasingly technical) will help to distinguish functions that 
are Riemann integrable from functions that are not. 

Remark 1 The sums s and S may fail to approach the same limit. A standard 
example has K(x) = 1 at all fractions x = pjq, and K(x) = 0 at all other points. Every 
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interval contains rational points (fractions) and irrational points (nonrepeating deci- 
mals). Therefore m k = 0 and M k = 1* The lower sum is always s ~ 0. The upper sum 
is always 5 = b — a (the sum of Fs times Ax’s)* The gap in equation (7) stays open . This 
function V(x) is not Riemann integrable. The area under its graph is not defined (at 
least by Riemann — see Remark 5). 

Remark 2 The step function f/(x) is discontinuous but still integrable* On every 
interval the minimum m k equals the maximum M k — except on the interval containing 
the jump* That jump interval has m k — 0 and M k = 1 . But when we multiply by Ax fc1 
and require Ax mas -» 0, the difference between s and S goes to zero. The area under 
a step function is clear— the rectangles fit exactly* 

Remark J With patience another key step could be proved: If s -+ A and S -*• A for 
one sequence of meshpoints, then this limit A is approached by every choice of mesh- 
points with Ax ma3l “+ 0, The integral is the lower bound of all upper sums S y and it is 
the upper bound of all possible s — provided those bounds are equal* The gap must 
close, to define the integral. 

The same limit A is approached by “in-between rectangles."" The height u(x*) can 
be computed at any point xjf in subinterval k. See Figures 5*9c and 5.10* Then the 
total rectangular area is a “ Riemann sum" between s and S: 

S * — v(x* )Ax! + ufx* )Ax 2 + + u(x*)Ax n . (9) 

We cannot tell whether the true area is above or below $*. Very often A is closer to 
5* than to s or 5* The midpoint rule takes x? in the middle of its interval (Figure 5.10), 
and Section 5*8 will establish its extra accuracy. The extreme sums s and S are used 
in the definition while S* is used in computation* 







right 


mid 


max 


Fig. 5.10 Various positions for xf in the base. The rectangles have height v(xf ). 



any a* 


Remark 4 Every continuous function is Riemann integrable . The proof is optional (in 
my class), but it belongs here for reference. It starts with continuity at x*: “For cmy 
e there is ad ,***” When the rectangles sit between x* ~ 6 and x* + 5 , the bounds M u 
and differ by less than 2s. Multiplying by the base Ax k , the areas differ by less 
than 2e(Ax*)* Combining all rectangles, the upper and lower sums differ by less than 
2e(Ax ! + Ax 2 + ■■■ + Ax n ) = 2t{b — a). 

As £ -^0 we conclude that S comes arbitrarily close to s. They squeeze in on a 
single number A. The Riemann sums approach the Riemann integral, if v is continuous . 

Two problems are hidden by that reasoning. One is at the end, where S and s come 
together. We have to know that the line of real numbers has no “holes,"' so there is 
a number A to which these sequences converge. That is true. 

Any increasing sequence , if it is bounded above, approaches a limit * 

The decreasing sequence S 7 bounded below, converges to the same limit. So A exists. 

The other problem is about continuity. We assumed without saying so that the 
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width 25 is the same around every point x*. We did not allow for the possibility that 
5 might approach zero where v(x) is rapidly changing — in which case an infinite 
number of rectangles could be needed. Our reasoning requires that 

v{x) is uniformly continuous: 6 depends on £ but not on the position of x*. 

For each e there is a 5 that works at all points in the interval. A continuous function 
on a closed interval is uniformly continuous. This fact (proof omitted) makes the 
reasoning correct, and v(x) is integrable. 

On an infinite interval, even v = x 2 is not uniformly continuous. It changes across 
a subinterval by (jc* + 5) 2 — (x* - 5 ) 2 — 4x*5. As x* gets larger, 5 must get smaller — 
to keep 4x*5 below e. No single <5 succeeds at all jc*. But on a finite interval [0, b], 
the choice <5 = cj4b works everywhere — so v = x 2 is uniformly continuous. 

Remark 5 If those four remarks were fairly optional, this one is totally at your 
discretion. Modem mathematics needs to integrate the zero-one function K(jc) in the 
first remark. Somehow V has more 0’s than l’s. The fractions (where V(x) - 1) can 
be put in a list, but the irrational numbers (where V(x) — 0) are “uncountable.” The 
integral ought to be zero, hut Riemann’s upper sums all involve Af k = 1. 

Lebesgue discovered a major improvement. He allowed infinitely many subintervals 
(smaller and smaller). Then all fractions can be covered with intervals of total width 
a. (Amazing, when the fractions are packed so densely.) The idea is to cover lfq, 2jq y 
q/q by narrow intervals of total width e/2*. Combining all q - 1, 2, 3, ... , the total 
width to cover all fractions is no more than £(i + i + i+ = Since K(jc) = 0 
everywhere else, the upper sum S is only e. And since e was arbitrary, the “ Lebesgue 
integral ” is zero as desired. 

That completes a fair amount of theory, possibly more than you want or need — 
but it is satisfying to get things straight. The definition of the integral is still being 
studied by experts (and so is the derivative, again to allow more functions). By 
contrast, the properties of the integral are used hy everybody. Therefore the next 
section turns from definition to properties, collecting the rules that are needed in 
applications. They are very straightforward. 


5.5 EXERCISES 


Read-through questions 

In J* u(f) dt = f(x) + C, the constant C equals q . Then 
at x = a the integral is b . At x = b the integral becomes 
c . The notation /(x)]* means _d_. Thus cos x]£ equals 
ft . Also [cos x + 3]p equals t which shows why 
the antiderivative includ es an a rbitrary q , Substituting 
u — 2x — 1 changes yj 2x — 1 dx into h (with limits 

on u). 

The integral J* i?(x) dx can be defined for any l func- 
tion t?(x) s even if we can*t find a simple I First the mesh- 
points x x ,x 2 ,~- divide [a, b] into subintervals of length 
Ajc* = k . The upper rectangle with base Ax* has height 
M* = \ The upper sum S is equal to m . The lower 

sum s is n The o is between s and S, As more 
meshpoints are added, S p and s q . If S and s 


approach the same r . that defines the integral. The inter- 
mediate sums S*, named after i . use rectangles of height 
u(x£). Here x£ is any point between t « and S* = u 
approaches the area. 

If v(x) = dfjdx, what constants C make 1-10 true? 

1 l b t v(x)dx=f(b) + C 

2 v(x)dx=m + C 

3 |t v(t)dt=-f(x) + C 

4 u(sin .x) cos x dx --/(sin b) + C 

5 |t '•'(') di — /{r) + C (careful) 

6 dfjdx = t?(x) + C 

7 |J (x 2 -l) 3 2xrfx = j^ 1 u 3 rfu. 
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8 J f r(f) A =/(**) + C 

9 J* — jc) dx — C (change — x to t; also dx and limits) 
10 J 2 v(x)dx = C J* v(2 t)dL 


Choose u(x) in 11-18 and change Omits, Compute the integral 
in 11-16. 


11 j l 0 {x 2 +]) l0 X<LT 
13 JJ /4 tan x 9ec z x dx 
15 J£ /4 sec 2 'x tan x dx 
17 \\dxjx (take u — 1/x) 


12 JJ /2 $in 8 x cos x dx 
14 x 2 " +1 dx (lake u = x 2 ) 

16 j‘ X dxjj TT 2 

18 JJ x 3 (l-x) 3 <fx(u=l-x) 


With A_x - ^ in 19-22, find the maximum M k and minimum 
m k and upper and lower stuns S and s. 

19 (x 2 + l) 4 dx 20 sin 2;ix dx 

21 f£x*dx 22 l x _ x xdx. 

23 Repeat 19 and 20 with Ax = £ and compare with the cor- 
rect answer, 

24 The difference S — s in 21 is the area 2 3 Ax of the far right 
rectangle. Find Ax so that S < 4.001, 

25 If u(x) is increasing fora^x^b, the difference S-sis the 

area of the rectangle minus the area of the 

rectangle. Those areas approach zero. So every increasing 
function on [a, b] is Riemann integrable. 


26 Find the Riemann sum 5* for V(x) in Remark 1, when 
Ax = 1/n and each x? is the midpoint. This S* is well-behaved 
but still V{x) is not Riemann integrable. 

27 JF(x) equals 1 at x = ±, i, , . . , and elsewhere JF(x) = 0, 

For Ax = ,01 find the upper sum 5. Is W(x) integrable? 

28 Suppose M(x) is a multistep function with jumps of i, i, 
i, ... at the points x J, .... Draw a rough graph with 
M(0) = 0 and Af(l)=*l. With Ax =i find Sand s. 

29 For M(x) in Problem 28 find the difference S - s (which 
approaches zero as Ax -►()). What is the area under the 
graph? 

30 If dfjdx = -v(x) and /( 1) = 0, explain /(*)- u(t) dt. 

31 (a) If dfjdx = + v{x) and /{ 0) = 3, find /(x). 

(b) If dfjdx = + v(x) and f(3) = 0, find /(x), 

32 In your own words define the integral of u(x) from a to b. 

33 True or false , with reason or example. 

(a) Every continuous u(x) has an antiderivative f(x). 

(b) If v(x) is not continuous, S and s approach different 
limits. 

(c) If S and $ approach A as Ax 0, then all Riemann 
sums S* in equation (9) also approach A . 

(d) If Vi (x) + v 2 (x) — tj 3 {x)> their upper sums satisfy 
Si 4 - S 2 — S3. 

(e) If [?!(x) + i? 2 M = i> 3 {x), their Riemann sums at the 
midpoints xjf satisfy Sf -h S J = S J . 

(f) The midpoint sum is the average of S and $. 

(g) One x? in Figure 5.10 gives the exact area. 


5.6 Properties of the Integral and Average Value 

The previous section reached the definition of J* u(x) dx. But the subject cannot stop 
there. The integral was defined in order to be used. Its properties are important, and 
its applications are even more important. The definition was chosen so that the 
integral has properties that make the applications possible. 

One direct application is to tbe average value of v(x). The average of n numbers is 
clear, and the integral extends that idea — it produces the average of a whole contin- 
uum of numbers u(x). This develops from the last rule in the following list (Property 
7), We now collect together seven basic properties of definite integrals. 

The addition rule for J [u(x) + w(x)] dx will not be repeated — even though this 
property of linearity is the most fundamental. We start instead with a different kind 
of addition. There is only one function v(x), but now there are two intervals. 

The integral from a to b is added to its neighbor from b to c. Their sum is the integral 
from a to c. That is the first (not surprising) property in the list. 
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Property 1 Areas over neighboring intervals add to the area over the combined 
interval: 

v(x) dx + f 6 v(x) dx = J* i>(x) dx. (1) 

This sum of areas is graphically obvious (Figure 5.1 la). It also comes from the formal 
definition of the integral. Rectangular areas obey (1) — with a meshpoint at x = b to 
make sure. When Ax max approaches zero, their limits also obey (1). All the normal 
rules for rectangular areas are obeyed in the limit by integrals. 

Property 1 is worth pursuing. It indicates how to define the integral when a = b. 
The integral “ from b to b” is the area over a point, which we expect to be zero. It is. 

Property 2 t>(x) dx = 0. 

That comes from Property 1 when c = b. Equation (1) has two identical integrals, so 
the one from b to b must be zero. Next we see what happens if c = a — which makes 
the second integral go from b to a. 

What happens when an integral goes backward ? The “lower limit” is now the larger 
number b. The “upper limit” a is smaller. Going backward reverses the sign: 

Property 3 j a b v(x) dx = - tf(x) dx =f(a) - /(b ). 

Proof When c = a the right side of (1) is zero. Then the integrals on the left side 
must cancel, which is Property 3. In going from b to a the steps Ax are negative. That 
justifies a minus sign on the rectangular areas, and a minus sign on the integral 
(Figure 5.1 lb). Conclusion: Property 1 holds for any ordering of a, b, c. 



r° , x 3 

r 

c 2 dt 

EXAMPLES 

t 2 dt = - — 

dt=- 1 

7 = 0 


)x 3 

ji 

J2 ' 


Property 4 For odd functions \ a _ a t;(x) dx- 0. ‘Odd" means that u(-x)= - t?(x). 
For even functions \ a _ a u(x) dx = 2 v(x) dx. “ Even ’ means that v(-x) = 4- v(x). 

The functions x, x\ x 5 , ... are odd. If x changes sign, these powers change sign. The 
functions sin x and tan x are also odd, together with their inverses. This is an impor- 
tant family of functions, and the integral of an odd function from -a to a equals zero. 
Areas cancel: 

r- 6 * 5 dx = * 6 ] a -a = a 6 -(- a) 6 = 0. 

If p(x) is odd then /(x) is even! All powers l,x 2 , x 4 , ... are even functions. Curious 
fact : Odd function times even function is odd , but odd number times even number is 
even. 

For even functions, areas add: cos x dx = sin a - sin(- a) = 2 sin a. 
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The next properties involve inequalities. If u(x) is positive, the area under its graph 
is positive (not surprising). Now we have a proof: The lower sums s are positive and 
they increase toward the area integral. So the integral is positive: 

Property 5 If u(x) > 0 for a < x < b then j 4 v(x) dx > 0. 

A positive velocity means a positive distance. A positive v lies above a positive area. 
A more general statement is true. Suppose u(x) stays between a lower function /(x) 
and an upper function u(x). Then the rectangles for v stay between the rectangles for l 

and u. In the limit, the area under v (Figure 5.12) is between the areas under J and u: 

Property 6 If /(x) ^ i'(x) ^ u(x) for u^x^b then 

fa j(x) dx 5$ J 4 u(x) dx < £ m(x) dx. (2) 

EXAMPLE 1 cos ( ^ 1 => Jq cos ( dt ^ 1 dt => sin x $ x 

EXAMPLE 2 1 ^ sec 2 f => 1 dr ^ sec 2 f dt => x ^ tan x 


EXAMPLE 3 Integrating z ^ 1 leads to tan~ 1 x ^ x. 

1 + x'* 


Ail those examples are for x > 0. You may remember that Section 2.4 used geometry 
to prove sin h<h< tan h. Examples 1-2 seem to give new and shorter proofs. But I 
think the reasoning is doubtful. The inequalities were needed to compute the deriva- 
tives (therefore the integrals) in the first place. 



c c 

Fig. 5.12 Properties 5-7: r above zero, v between / and u, average value (+ balances — ). 


Property 7 (Mean Value Theorem for Integrals) If u(x) is continuous, there is a 
point c between a and b where u(c) equals the average value of i>(x): 

1 f 4 

u(c) = t— — u(x) dx = "average value of i'(x).” (3) 

b~ a j a 

This is the same as the ordinary Mean Value Theorem (for the derivative of f(x)): 

f'(c) — = “average slope of /.’’ (4) 

b — a 


With f‘ = v, (3) and (4) are the same equation. But honesty makes me admit to a flaw 
in the logic. We need the Fundamental Theorem of Calculus to guarantee that 
f(x) = J* t>(f) dt really gives f = v. 

A direct proof of (3) places one rectangle across the interval trom a to b. Now raise 
the top of that rectangle, starting at u min (the bottom of the curve) and moving up to 
’’man (the top of the curve). At some height the area will be just right — equal to the 
area under the curve. Then the rectangular area, which is ( b - a ) times u(c), equals 
the curved area J 4 i>(x) dx. This is equation (3). 
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Fig. 5.13 Mean Value Theorem for integrals: area/(/? — a) — average height = i?(c) at some c\ 


That direct proof uses the intermediate value theorem : A continuous function u(x) 
takes on every height between u mjri and At some point {at two points in 

Figure 5.12c) the function u(x) equals its own average value. 

Figure 5 A3 shows equal areas above and below the average height t?{c) = v Avi . 


EXAMPLE 4 The average value of an odd function is zero {between — 1 and 1): 


1 

2 


xdx= — 

4 


1 1 A 

rr° 


note 


b-a 2 


For once we know c. It is the center point x = 0, where t;(c) = y avc = 0. 


EXAMPLE 5 The average value of x 2 is 3 (between 1 and - 1): 


1 r 1 

2 .- 




1 

6 


I 

3 


1 

note — 

b-a 


Where does this function x 2 equal its average value 3? That happens when c 1 = so 
c can be either of the points 1/^/5 and —ljy/i in Figure 5,13b, Those are the Gauss 
points, which are terrific for numerical integration as Section 5.8 will show. 


EXAMPLE 6 The average value of sin 2 x over a period (zero to 71) is j: 

( 1 1 
note = - 

\ b — a n 

The point c is ti/ 4 or 3^/4, where sin 2 c = j> The graph of sin 2 x oscillates around its 
average value \ . See the figure or the formula: 

sin 2 x = j — j cos 2x. (5) 

The steady term is the oscillation is - \ cos 2x. The integral is fix) = sin 2x, 

which is the same as ^x - j sin x cos x> This integral oj sin 2 x will be seen again , Please 
verify that df jdx — sin 2 x. 


sin 2 x dx = 


x — sin x cos x 


2n 


THE AVERAGE VALUE AND EXPECTED VALUE 


The "average value” from a to b is the integral divided by the length b — a< This 
was computed for x and x 2 and sin 2 x, but not explained. It is a major application 
of the integral, and it is guided by the ordinary average of n numbers: 


^ave 1 

b — a 


r(x) dx comes from 


v» v t = ~( v i + V 2 + ”■ + t'«)- 
n 


Integration is parallel to summation] Sums approach integrals. Discrete averages 
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approach continuous averages. The average of $ is The average of f , 3, 

$ i$ 3. The average of n numbers from 1/n to njn is 


1 / 1,2 ^ n \ n+1 

n\n n nj 2 n 


( 7 ) 


The middle term gives the average, when n is odd. Or we can do the addition. As 
it — ► 00 the sum approaches an integral (do you see the rectangles?). The ordinary 
average of numbers becomes the continuous average of v(x) = x: 


n+ 1 1 

2n 2 


and 



( 


1 

note ~ 

D — Q 


*) 


In ordinary language: "The average value of the numbers between 0 and 1 is 4 .” Since 
a whole continuum of numbers lies between 0 and 1, that statement is meaningless 
until we have integration* 

The average value of the squares of those numbers is {x 2 ) m = { x 2 dx/(b — a) — 

If you pick a number randomly between 0 and 1 , its expected value is \ and its expected 
square is 

To me that sentence is a puzzle* First* we don’t expect the number to be exactly 
i — so we need to define "expected value*” Second, if the expected value is why is 
the expected square equal to 4 instead of £? The ideas come from probability theory, 
and calculus is leading us to continuous probability. We introduce it briefly here, and 
come back to it in Chapter 8* 


PREDICTABLE AVERAGES ROM RANDOM EVENTS 


Suppose you throw a pair of dice* The outcome is not predictable. Otherwise why 
throw them? But the average over more and more throws is totally predictable. We 
don’t know what will happen, but we know its probability* 

For dice, we arc adding two numbers between 1 and 6* The outcome is between 2 
and 12* The probability of 2 is the chance of two ones: (l/6)(l/6) = 1/36* Beside each 
outcome we can write its probability: 


\36/ ^\36/ \36/ \36/ ”\36/ \36/ \36/ \36/ 1 °l v 36/ X1 ^36/ 12 ^36/ 


To repeat, one roll is unpredictable* Only the probabilities are known, and they add 
to 1* (Those fractions add to 36/36; all possibilities are covered.) The total from a 
million rolls is even more unpredictable — it can be anywhere between 2,000,000 and 
12,000,000. Nevertheless the average of those million outcomes is almost completely 
predictable* This expected value is found by adding the products in that line above: 

Expected value : multiply ( outcome ) times (probability of outcome ) and add: 

_2_ _6 12 20 30 42 40 36302212 . 

36 36 36 + 36 36 + 36 36 + 36 36 36 36 _ * 


IT you throw the dice 1000 times, and the average is not between 6*9 and 7*1, you get 
an A* Use the random number generator on a computer and round oft to integers. 

Now comes continuous probability * Suppose all numbers between 2 and 12 are 
equally probable* This means all numbers — not just integers. What is the probability 
of hitting the particular number x — n ^ It is zero\ By any reasonable measure, n has 
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no chance to occur. In the continuous case, every x has probability zero. But an 
interval of x’s has a nonzero probability: 


the probability of an outcome between 2 and 3 is 1/10 

the probability of an outcome between x and x + Ax is Ajc/10 


To find the average, add up each outcome times the probability of that outcome* 
First divide 2 to 12 into intervals of length Ax = 1 and probability p= 1/10. If we 
round off the average is 63 : 



Here all outcomes are integers (as with dice). It is more accurate to use 20 intervals 
of length 1/2 and probability 1/20. The average is and you see what is coming. 
These are rectangular areas (Riemann sums). As Ax -+ 0 they approach an integral. 
The probability of an outcome between x and x + dx is p(x) dx s and this problem has 
p(x) = 1/10. The average outcome in the continuous case is not a sum but an integral : 


* 12 

expected value E(jt) = 


xp(x) dx = 




X* 

20 


= 7 . 


That is a big jump. From the point of view of integration, it is a limit of sums. From 
the point of view of probability, the chance of each outcome is zero but the probability 
density at * is p(x) = 1/10. The integral of p(x) is 1, because some outcome must 
happen. The integral of xp(*) is the expected value. Each choice of x is 

random, but the average is predictable. 

This completes a first step in probability theory. The second step comes after more 
calculus. Decaying probabilities use e~ x and e ~* 2 — then the chance of a large x is 
very small. Here we end with the expected values of x n and Ify/x and l/x, for a 
random choice between 0 and 1 (so p(x) = 1 ): 



A CONFUSION ABOUT "EXPECTED" CLASS SIZE 


A college can advertise an average class size of 29, while most students are in large 
classes most of the time, I will show quickly how that happens. 

Suppose there are 95 classes of 20 students and 5 classes of 200 students. The total 
enrollment in 100 classes is 1900+ 1000 = 2900. A random professor has expected 
class size 29* But a random student sees it differently. The probability is 1900/2900 
of being in a small class and 1000/2900 of being in a large class. Adding class size 
times probability gives the expected class size for the student: 


( 20 ) 


1900 

2900 


+ ( 200 ) 


1000 

2900 


= 82 students in the class. 


Similarly, the average waiting time at a restaurant seems like 40 minutes (to the 
customer). To the hostess, who averages over the whole day, it is 10 minutes. If you 
came at a random time it would be 10, but if you are a random customer it is 40. 

Traffic problems could be eliminated by raising the average number of people per 
car to 2.5, or even 2. But that is virtually impossible. Part of the problem is the 
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difference between (a) the percentage of cars with one person and (b) the percentage 
of people alone in a car. Percentage (b) is smaller. In practice, most people would be 
in crowded cars. See Problems 37-38* 


5.6 EXERCISES 


Read-through questions 

The integrals J* u(x) dx and i?{*) dx add to g « The 
integral u(x) dx equals *> , The reason is c » If 

u(x) ^ x then u(x) dx ^ * The average value of u(x) on 

the interval 1 ^ x ^ 9 is defined by • . It is equal to u(c) 

at a point x = c which is * The rectangle across this 
interval with height u(c) has the same area as 0 . The 

average value of o(x) = x + 1 on the interval is 

h 

If x is chosen from 1, 3, 5, 7 with equal probabilities i, its 
expected value (average) is 1 The expected value of x l 
is I If x is chosen from 1, 2, ***, 8 with probabilities 
its expected value is fc . If x is chosen from 1 ^ x ^ 9, the 
chance of hitting an integer is * The chance of falling 
between x and x + dx is p(x) dx = m . The expected value 
£(x) is the integral n * It equals Q , 

In 1-6 find the average value of u(x) between a and b, and find 
all points c where u ave = i?(c). 

1 u = x 4 , a = — 1, b = [ 2 v — x 5 ) a — —K b = l 

3 v = cos 1 x, a = 0, b = 7T 4 v = ^/x, a — 0, b = 4 

5 v = l/x 2 , a = 1 T b = 2 6 v = (sin x) 9 , a = —tt, b = tt. 

7 At x = 8, F(x) = |* v(t) dt 4- J* u{t) dt is * 

8 Jj x dx + x dx — x dx = ^ 

Are 9-16 true or false? Give a reason or an example. 

9 The minimum of |* u(t) dt is at x = 4. 

10 The value of J* +3 u(f) dt does not depend on x. 

1 1 The average value from x = 0 to x — 3 equals 

i(u ave on G ^ x ^ 1) + on 1 < x <3). 

12 The ratio (f{b) - f(a))l(b - a) is the average value of /(x) 
on a ^ x ^ b. 

13 On the symmetric interval — 1 < x ^ 1, u(x) — u tve is an 
odd function. 

14 If t(x) ^ r(x) ^ u(x) then dl/dx < dvjdx < dujdx * 

15 The average of u(x) from 0 to 2 plus the average from 2 
to 4 equals the average from 0 to 4. 

16 (a) Antiderivatives of even functions are odd functions, 
(b) Squares of odd functions are odd functions* 


17 What number v gives || \ (y(x) — y) dx = 0? 

18 If /(2) = 6 and /(6) = 2 then the average of df/dx from 

x = 2 to x = 6 is 

19 (a) The averages of cos x and |cos x| from 0 to n are 


(b) The average of the numbers 1 ^ , ..., v n is than 

the average of \v^ |, ... , |u n |. 

20 (a) Which property of integrals proves i?(x) dx ^ 
|J |u(x)| dx? 

(b) Which property proves — |J u(x) dx \v{x)\ dx? 
Together these are Property 8: |JJ u(x) dx| [u(x)| dx. 

21 What function has u 4VC (from 0 to x) equal to £ u(x) at all 
x? What functions have = i?(x) at all x? 

22 (a) If u(x) is increasing, explain from Property 6 why 
|p u(f) dt ^ xi?(x) for x > 0. 

(b) Take derivatives of both sides for a second proof. 

23 The average of t>(x)= 1/(1 -\-x 2 } on the interval [0 , b] 

approaches as b -> 00 * The average of K(x) = 

x 2 /(l -h x 1 } approaches * 

24 If the positive numbers v n approach zero as n -+ 00 prove 
that their average (uj + — + u„)/n also approaches zero. 

25 Find the average distance from x = a to points in the 
interval 0 < x < 2. Is the formula different if a < 21 

26 (Computer experiment) Choose random numbers x 
between 0 and 1 until the average value of x 2 is between .333 
and .334* How many values of x 2 are above and below? If 
possible repeat ten times* 

27 A point P is chosen randomly along a semicircle (see 
figure: equal probability for equal arcs). What is the 
average distance y from the x axis? The radius is 1, 

28 A point Q is chosen randomly between —1 and 1. 

(a) What is the average distance Y up to the semicircle? 

(b) Why is this different from Problem 27? 


Button needle 
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29 (A classic way to compute n) A 2" needle is tossed onto 
a floor with boards 2" wide. Find the probability of falling 
across a crack. (This happens when cos 6 > y = distance from 
midpoint of needle to nearest crack. In the rectangle 
0 < 6 ^ ti / 2 , 0 ^ y < 1 , shade the pari where cos 9 > y and find 
the fraction of area that is shaded,) 

30 If Buffo n T s needle has length 2x instead of 2, find the 
probability P(x) of falling across the same cracks, 

31 If you roll three dice at once, what are the probabilities of 
each outcome between 3 and 18? What is the expected value? 

32 If you choose a random point in the square 0<x< 1, 
0 ^ y < 1, what is the chance that its coordinates have y 1 ^ x? 

33 The voltage K(t) = 220 cos 2nt(6 0 has frequency 60 hertz 
and amplitude 220 volts. Find K B¥C from 0 to L 

34 (a) Show that i? evcn (x) = i(u(x) + is always even, 

(b) Show that iwM = is always odd. 

35 By Problem 34 or otherwise, write (x + if and l/(x + 1) 
as an even function plus an odd function. 

36 Prove from the definition of df jdx that it is an odd func- 
tion if /(x) is even. 


37 Suppose four classes have 6, 8, 10, and 40 students, averag- 
ing . The chance of being in the first class is 

, The expected class size (for the student) is 

£M - 6 (s) +8 (s) +10 (s) + «(s)- 

38 With groups of sizes x u ..., x n adding to G, the average 

size is . The chance of an individual belonging to 

group 1 is . The expected size of his or her group is 

E{x) = x,{xJG)+ - + x n (x„/G),* Prove Zj xf/G>G/n. 


39 True or false, 15 seconds each: 

(a) If /(x) ^ g(x) then df/dx < dgjdx. 

(b) If dfldx dg/dx then f{x) £ g(x). 

(c) xu(x) is odd if i?(x) is even, 

(d) If < w ave on all intervals then i?(x) < w(x) at all 
points. 


40 


If u(x) = 


2x for x < 3 
— 2x for x > 3 


then f{x) = 


x 2 for x < 3 
- x 2 for x > 3 ' 


Thus J* u{x) dx = /(4) — /{ 0) — —16. Correct the mistake, 
41 If u(x) = \ x — 2] find /(x). Compute j* u(x) dx. 


42 Why are there equal areas above and below u lve ? 



5.7 The Fundamental Theorem and Its Applications 



When the endpoints are fixed at a and b , we have a definite integral. When the upper 
limit is a variable point x, we have an indefinite integral. More generally: When the 
endpoints depend in any way on x, the integral is a function of x. Therefore we can 
find its derivative. This requires the Fundamental Theorem of Calculus. 

The essence of the Theorem is: Derivative of integral of v equals u. We also compute 
the derivative when the integral goes from <j{x) to fr(x) — both limits variable. 

Part 2 of the Fundamental Theorem reverses the order: Integral of derivative of f 
equals /+ C. That will follow quickly from Part 1, with help from the Mean Value 
Theorem. It is Part 2 that we use most, since integrals are harder than derivatives. 
After the proofs we go to new applications, beyond the standard problem of area 
under a curve. Integrals can add up rings and triangles and shells — not just rectangles. 
The answer can be a volume or a probability — not just an area, 

THE FUNDAMENTAL THEOREM, PART 1 

Start with a continuous function u. Integrate it from a fixed point a to a variable 
point x. For each x, this integral /(x) is a number. We do not require or expect a 
formula for /(x) — it is the area out to the point x. It is a function of x! The Fundamen- 
tal Theorem says that this area function has a derivative (another limiting process). 
The derivative df/dx equals the original i?(x). 
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5C (Fundamental Theorem, Part 1) Suppose v{x) is a continuous function: 
If = then df!dx = v(x). 


The dummy variable is written as f, so we can concentrate on the limits* The value 
of the integral depends on the limits a and x, not on r* 

To find df jdx , start with Af=f(x + Ax) — /(x) = difference of areas: 

A f = dt - & ^(0 dt = i: +Ai »(t) (i) 

Officially, this is Property 1. The area out to x + Ax minus the area out to x equals 
the small part from x to x + Ax, Now divide by Ax: 


A£ 

Ax 



[?{f) dt = average value = u(c). 


( 2 ) 


This is Property 7, the Mean Value Theorem for integrals* The average value on this 
short interval equals u(c), This point c is somewhere between x and x + Ax (exact 
position not known), and we let Ax approach zero. That squeezes c toward x, so v{c) 
approaches u(x) — remember that v is continuous. The limit of equation (2) is the 
Fundamental Theorem: 


Ax 


— and u(c) u(x) so 
dx 



( 3 ) 


If Ax is negative the reasoning still holds. Why assume that u(x) is continuous? 
Because if v is a step function, then f(x) has a comer where dfjdx is not v(x). 

We could skip the Mean Value Theorem and simply bound v above and below: 

for r between x and x + Ax: u min < v{t) < u max 

integrate over that interval: o min Ax^ A f ^u majl Ax (4) 

divide by Ax: ^ Af /Ax r maj[ 

As Ax -► 0, v min and v m&% approach v(x). In the limit df jdx again equals v(x). 


area under u{x) 

= increase in f{.\) 


area ™ 
u(x)Ax 

X X + Ax 

Hg* 5*14 Fundamental Theorem, Part 1: (thin area A/)/(base length Ax) -► height u(x). 

Graphical meaning The /-graph gives the area under the u-graph* The thin strip in 
Figure 5*14 has area A/, That area h approximately u(x) times Ax , Dividing by its 
base, Af /Ax is close to the height v(x). When Ax -► 0 and the strip becomes infinitely 
thin, the expression “close to” converges to “equals.” Then df jdx is the height at u(x). 

DERIVATIVES WITH VARIABLE ENDPOINTS 

When the upper limit is x, the derivative is v(x)* Suppose the lower limit is x. The 
integral goes from x to b, instead of a to x. When x moves, the lower limit moves* 
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The change in area is on the left side of Figure 5.15. As x goes forward , area is removed. 
So there is a minus sign in the derivative of area: 

n d g 

The derivative of g(x) = t?(t) dt is — = - u(x). (5) 

Jx dx 

The quickest proof is to reverse b and x, which reverses the sign (Property 3): 

g{x) = - j r(t) dt so by Part 1 ~ = - r(x). 

Jb dx 



Rg*5 *15 Area from x to b has dgjdx = — l<x), Area v(b)db is added, area v{a)da is lost 


The general case is messier but not much harder (it is quite useful). Suppose both 
limits are changing. The upper limit 5(x) is not necessarily x, but it depends on x 
The lower limit a(x) can also depend on x (Figure 5.15b), The area A between those 
limits changes as * changes, and we want dAfdx: 


If 



v{t) dr 


then 


dA 

dx 


dh da 

= u(b(x))--v(a(x))-. 
dx dx 


( 6 ) 


The figure shows two thin strips, one added to the area and one subtracted. 

First check the two cases we know. When a = 0 and b = x, we have dajdx = 0 and 
db/dx = L The derivative according to (6) is u(x) times 1 — the Fundamental Theorem. 
The other case has a~ x and b = constant. Then the lower limit in (6) produces — r(x). 
When the integral goes from a = 2x to b = x 3 , its derivative is new: 


EXAMPLE 1 A = \* 2 X cos f dt = sin x 3 - sin 2x 
dAjdx = (cos x 3 ){3x 2 ) - (cos 2x)(2). 

That fits with (6), because dbjdx is 3x 2 and dajdx is 2 (with minus sign)* It also looks 
like the chain rule — which it is! To prove (6) we use the letters v and / : 

rnx) 

A = u(£) dt = f(b{x}) -f(a(x)) (by Part 2 below) 

JflU) 


dA dh da 

- r =f(b{x))--r{aix)) T 
dx dx ax 


(by the chain rule) 


Since /' = &, equation (6) is proved. In the next example the area turns out to be 
constant, although it seems to depend on x. Note that t'(f)= 1/f so y(3x)~ I/3x. 


EXAMPLE 2 A - I d , has £ - (£) (3) - (±) (2) = 0. 
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Question 


A = 


v 

v(t) dt has 

- X 


dA 

dx 


= t/x) 4- u(— x). Why does v(— x) have a plus sign ? 


THE FUNDAMENTAL THEOREM, PART 2 


We have used a hundred times the Theorem that is now to be proved. It is the key 
to integration. “ The integral of dfjdx is fix) C.” The application starts with v(x\. 
We search for an f[x) with this derivative. If dfjdx = t{x), the Theorem says that 


u(x) dx = 


'*L 

dx 


dx =f{x) + C. 


We can't rely on knowing formulas for v and / only the definitions of j and djdx. 

The proof rests on one extremely special case: dfjdx is the zero function t We easily 
find /(*) = constant. The problem is to prove that there are no other possibilities: / 
must be constant. When the slope is zero, the graph must be flat. Everybody knows 
this is true, but intuition is not the same as proof. 

Assume that dfjdx = 0 in an interval. If f(x) is not constant, there are points where 
f(a) =£f(&)+ By the Mean Value Theorem, there is a point c where 

/'(c) = (this is not zero because f(a}^f(b)). 


But /'(c) # 0 directly contradicts dfjdx = 0. Therefore f(x) must be constant. 

Note the crucial role of the Mean Value Theorem. A local hypothesis (dfjdx ™ 0 
at each point) yields a global conclusion {/= constant in the whole interval). The 
derivative narrows the field of view, the integral widens it. The Mean Value Theorem 
connects instantaneous to average, local to global, points to intervals. This special 
case (the zero function) applies when /t(x) and f(x) have the same derivative: 

If dAjdx = dfjdx on an interval then /!(*) =/(*) + C, (7) 

Reason: The derivative of /l(x) ~ fix) is zero. So j4(.v) - fix) must be constant. 

Now comes the big theorem. It assumes that y(x) is continuous, and integrates 
using fix): 


5D 


df 

(Fundamental Theorem, Part 2) If u(jc) — — then 

dx 


v(x) dx=m-f(a). 

a 


Proof The antiderivative is /(.*). But Part l gave another antiderivative for the same 
e--(.v). It was the integral — constructed from rectangles and now called 


A{x) = 




u(r ) dt 


also has 


dA 

dx 




Since A = v and /' = t\ the special case in equation (7} states that A(x] =/(x) 4 C. 
That is the essential point: The integral from rectangles equals f(x) 4 C\ 

At the lower limit the area integral is A — 0. So f(a) 4 C = 0. At the upper limit 
fib) + C = Aib}< Subtract to find /1(b), the definite integral: 

Mb) = j* L-(.v) dx =f{b) -f{a). 

Calculus is beautiful its Fundamental Theorem is also its most useful theorem. 
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Another proof of Part 2 starts with /' = v and looks at subintervals: 

f(xi)-f(a)~ i?(xf)(Xi — a ) (by the Mean Value Theorem) 

/(x 2 )-/(xi)= v{xf)(x 2 -x x ) (by the Mean Value Theorem) 

f(b) } - v(x* )(h - x n _ j ) (by the Mean Value Theorem). 

The left sides add to f(b ) -/(«}♦ The sum on the right, as Ax -► 0, is j* v(x) dx, 

APPLICATIONS OF INTEGRATION 

Up to now the integral has been the area under a curve. There are many other 
applications, quite different from areas. Whenever addition becomes ^continuous ^ we 
have integrals instead of sums. Chapter 8 has space to develop more applications, but 
four examples can be given immediately— which will make the point. 

We stay with geometric problems, rather than launching into physics or engineering 
or biology or economics. All those will come. The goal here is to take a first step 
away from rectangles. 

EXAMPLE 3 (for circles) The area A and circumference C are related by dAjdr— C. 

The question is why. The area is Trr 2 , Its derivative 2 nr is the circumference. By the 
Fundamental Theorem, the integral of C is A. What is missing is the geometrical 
reason. Certainly nr 2 is the integral of 2 nr, but what is the real explanation for A = 
J C(r) drl 

My point is that the pieces are not rectangles. We could squeeze rectangles under 
a circular curve, but their heights would have nothing to do with C. Our intuition 
has to take a completely different direction, and add up the thin rings in Figure 5,16. 



Fig. 5.16 Area of circle = integral over rings. Volume of sphere = integral over shells. 


Suppose the ring thickness is Ar. Then the ring area is close to C times Ar. This is 
precisely the kind of approximation we need, because its error is of higher order (Ar) 2 . 
The integral adds ring areas just as it added rectangular areas: 


A = 


C dr 


-r 


2nr dr — nr 2 . 


That is our first step toward freedom, away from rectangles to rings. 
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The ring area &.A can be checked exactly— it is the difference of circles: 

AA = K(r + Ar) 2 - nr 2 — 2nr A r + jt(Ar) 2 , 

This is CAr plus a correction. Dividing both sides by Ar 0 leaves dAjdr = C 
Finally there is a geometrical reason. The ring unwinds into a thin strip. Its width 
is Ar and its length is close to C. The inside and outside circles have different perime- 
ters, so this is not a true rectangle — but the area is near CAr, 

EXAMPLE 4 For a sphere, surface area and volume satisfy A = dV/dr. 

What worked for circles will work for spheres. The thin rings become thin shells, A 
shell goes from radius r to radius r + Ar, so its thickness is Ar. We want the volume 
of the shell, but we don't need it exactly. The surface area is 4jrr\ so the volume is 
about 4jrr 2 Ar, That is dose enough! 

Again we are correct except for (Ar) 2 . Infinitesimally speaking dV ^ A dr\ 

V=\ Adr=\ 4nr 2 dr = \ nr\ 

Jo Jo 3 

This is the volume of a sphere. The derivative of V is 4, and the shells explain why. 
Main point: Integration is not restricted to rectangles . 

EXAMPLE 5 The distance around a square is As, Why does the area have dAjds = 2s? 

The side is s and the area is s 1 . Its derivative 2s goes only half way around the square , 
I tried to understand that by drawing a figure. Normally this works, but in the figure 
dAjds looks like 4s. Something is wrong. The bell is ringing so I leave this as an 
exercise. 


EXAMPLE 6 Find the area under u(x) = cos 1 x from x = 0 to x = 1. 

That is a conventional problem, but we have no antiderivative for cos^ 1 x. We could 
look harder, and find one. However there is another solution — unconventional but 
correct. The region can be Jilted with horizontal rectangles (not vertical rectangles). 
Figure 5.17b shows a typical strip of length x = cos v (the curve has v = cos” 1 x). As 
the thickness Av approaches zero, the total area becomes j x dv. We are integrating 
upward, so the limits are on v not on x : 

area = 2 cos v dv^ sin t?]* 2 = L 

The exercises ask you to set up other integrals — not always with rectangles. Archi- 
medes used triangles instead of rings to find the area of a circle. 


r ^ 

; s 

r; i s s 

1 s 


AA « 4sAsl 




Fig. 5.17 Trouble with a square. Success with horizontal strips and triangles. 
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5.7 EXERCISES 


Read-through questions 

The area /(x) = £ v(t) dt is a function of a . By Part 1 of 
the Fundamental Theorem, its derivative is b . In the 
proof, a small change Ax produces the area of a thin c . 
This area A / is approximately d times e . So the 
derivative of t 2 dt is t 

The integral f b x t 2 dt has derivative q . The minus sign 
is because h . When both limits a(x) and b(x) depend on 
x, the formula for df/dx becomes i minus I In the 
example t dt , the derivative is * 

By Part 2 of the Fundamental Theorem, the integral of 
df/dx is i . In the special case when df/dx = 0, this says 
that m . From this special case we conclude: If dA/dx — 
dB/dx then A(x) = n . If an antiderivative of 1/x is In x 
(whatever that is), then automatically J* dx/x = o . 

The square O^x^s, 0 ^ y ^ s has area A = p . If s 
is increased by As, the extra area has the shape of g . 
That area A A is approximately r . So dA/ds = s 


Find the derivatives of the following functions F(x). 
1 Jj cos 2 1 dt 2 J* cos 3 1 dt 

3 Jo fdt 4 Jo x"dt 

5 J* 1 u 3 du 6 J* /3 u(u) du 

7ir 

.if 

* Jo 

l C x 1 C x + 2 

9 - sin 2 tdt 10 - t 3 dt 

x Jo 2j x 

ii jafo mdu]dt 

13 Jo v(t) dt+\\ v{t) dt 
15 J_ x sin t 2 dt 
17 J* u(t)v(t) dt 

fill 

19 J. 


i>(f) dt (a “ running average" of v) 

v(t) dt (the average of v\ use product rule) 


12 J* (df/dt) 2 dt 

14 f'o tK-0 dt 
16 J* x sin t dt 

•8 IS** 


sin 1 1 dt 


20 


ru 

Jo dt 


21 True or false 

(a) If df /dx = dg/dx then f(x) = g(x). 

(b) If d 2 f/dx 2 = d 2 g/dx 2 then f(x) = g(x) 4- C. 

(c) If 3 > x then the derivative of jj t>(f) dt is — t>(x). 

(d) The derivative of J ] r(x) dx is zero. 

22 For F(x) = \ 2 * sin t dt , locate F(n -f Ax) — F(7r) on a sine 
graph. Where is F( Ax) — F(0)? 

23 Find the function t>(x) whose average value between 0 and 
x is cos x. Start from v(t) dt = x cos x. 


24 Suppose df/dx = 2x. We know that d(x 2 )/dx = 2x. How 
do we prove that f(x ) = x 2 -f C? 

25 If \°_ x v(t) dt = \ x 0 v(t) dt (equal areas left and right of 

zero), then u(x) is an function. Take derivatives to 

prove it. 

26 Example 2 said that \\ x x dt/t does not really depend on x 
(or r!). Substitute xu for t and watch the limits on u. 

27 True or false, with reason: 

(a) All continuous functions have derivatives. 

(b) All continuous functions have antiderivatives. 

(c) All antiderivatives have derivatives. 

(d) A(x) = J 3 * dt/t 2 has dA/dx = 0. 

Find J, v(t) dt from the facts in 28-29. 

28 = v{x) 29 r 

dx J 0 x + 2 

30 What is wrong with Figure 5.17? It seems to show that 
dA = 4s ds, which would mean A = J 4s ds = 2s 2 . 

31 The cube 0^x, y, z^s has volume V — . The 

three square faces with x = s or y = s or z = s have total area 

A — . If s is increased by As, the extra volume has 

the shape of . That volume AF is approximately 

. So dV/ds = . 

32 The four-dimensional cube 0^x, y, z, f^s has hyper- 

volume H = . The face with x = s is really a 

. Its volume is V = . The total volume of 

the four faces with x = s, y = s, z = s, or t = s is . 

When s is increased by As, the extra hypervolume is 
A H * . So dH/ds = , 

33 The hypervolume of a four-dimensional sphere is H = 

i7r 2 r 4 . Therefore the area (volume?) of its three-dimensional 
surface x 2 + y 2 + z 2 -f t 2 = r 2 is . 

34 The area above the parabola y = x 2 from x = 0 to x = 1 
is $. Draw a figure with horizontal strips and integrate. 

35 The wedge in Figure (a) has area i r 2 d0 . One reason: It is 
a fraction d0/2n of the total area nr 2 . Another reason: It is 

close to a triangle with small base rdO and height . 

Integrating ^r 2 dO from 0 = 0 to 9 = gives the area 

of a quarter-circle. 

36 A = JJ, yjr 2 — x 2 dx is also the area of a quarter-circle. 
Show why, with a graph and thin rectangles. Calculate this 
integral by substituting x = r sin 0 and dx — r cos 0 dO. 
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37 The distance r in Figure (b) is related to 0 by r = 

Therefore the area of the thin triangle is h 2 dO = 
Integration to 0 = gives the total area j. 

38 The x and y coordinates in Figure (c) add to 

r cos 0 + r sin 6 = Without integrating explain why 

r * 2 do _ 

Jo (cos 0 -bsin 0) 2 

39 The horizontal strip at height y in Figure (d) has width dy 

and length x = * So the area up to y = 2 is 

What length are the vertical strips that give the same area? 


41 The length of the strip in Figure (e) is approximately 

. The width is , Therefore the triangle has 

area jj da (do you get £?), 

42 The area of the ellipse in Figure (f) is Inr 2 . Its derivative 
is 4 nr. But this is not the correct perimeter. Where does the 
usual reasoning go wrong? 

43 The derivative of the integral of u(x) is d(x), What is the 
corresponding statement for sums and differences of the num- 
bers Uj? Prove that statement. 


40 Use thin rings to find the area between the circles r = 2 
and r = 3. Draw a picture to show why thin rectangles would 
be extra difficult. 


2 

dy 

[ 


mmu 




S' 

= ^7 


(d) 


12 3 4 



44 The integral of the derivative of f(x) is /(x) + C What is 
the corresponding statement for sums of differences of _/}? 
Prove that statement. 

45 Does d 2 f/dx 2 = a(x) lead to (J£ a(f) dt) dx =/(l) — /( 0)? 

46 The mountain y= — x 2 + r has an area A(t) above the x 
axis. As t increases so does the area. Draw an xy graph of the 
mountain at t = 1. What line gives dAjdtl Show with words 
or derivatives that d 2 Aldt 2 > 0. 



5.8 Numerical Integration 



This section concentrates on definite integrals. The inputs are y(x) and two endpoints 
a and b , The output is the integral /. Our goal is to find that number 
fa yi x ) d* = U accurately and in a short time. Normally this goal is achievable — as 
soon as we have a good method for computing integrals. 

Our two approaches so far have weaknesses. The search for an antiderivative 
succeeds in important cases, and Chapter 7 extends that range — but generally /(x) 
is not available. The other approach (by rectangles) is in the right direction but too 
crude. The height is set by y(x) at the right and left end of each small interval. The 
right and left rectangle rules add the areas {Ax times y ): 

R tt = (Ax)(y i + y 2 + - + y a ) and L n = (Ax)(y 0 + y x + + y n -x). 

The value of y(x) at the end of interval / is y Jt The extreme left value y 0 = y{a) enters 
L„ . With n equal intervals of length A x = (b- a)jn, the extreme right value is y a = 
y(h). It enters Otherwise the sums are the same — simple to compute, easy to 
visualize, but very inaccurate. 

This section goes from slow methods (rectangles) to better methods (trapezoidal 
and midpoint) to good methods ( Simpson and Gauss). Each improvement cuts down 
the error. You could discover the formulas without the book, by integrating x and 
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x 2 and x 4 . The rule R n would come out on one side of the answer, and L„ would be 
on the other side. You would figure out what to do next, to come closer to the exact 
integral. The book can emphasize one key point: 


The quality of a formula depends on how many integrals 
| 1 dx , j x dx, J x 2 dx„ ..., it computes exactly . If J x p dx 
is the first to be wrong , the order of accuracy is p. 

By testing the integrals of 1, x, x 2 , ..., we decide how accurate the formulas are. 

Figure 5.18 shows the rectangle rules R n and L n . They are already wrong when 
y = x. These methods are first-order: p = 1. The errors involve the first power of 
Ax — where we would much prefer a higher power. A larger p in (Ax) p means a 
smaller error. 



Fig. 5.18 Errors E and e in R n and L n are the areas of triangles. 


When the graph of y(x) is a straight line, the integral / is known. The error triangles 
E and e have base Ax. Their heights are the differences y J+l — y } . The areas are 
3 (base)(height), and the only difference is a minus sign. (L is too low, so the error 
L - I is negative.) The total error in R n is the sum of the E’s: 

Kn-/ = M*(yi “>'o)+ — +iAjc(y,->'.-i) = iAx(>' ( ,-3;o)- (1) 

All y’s between y 0 and y n cancel. Similarly for the sum of the e’s: 

L n -I=- iAx(y„ - >>o) = - iAx[y(f») - y(a)]. (2) 


The greater the slope of y(x), the greater the error — since rectangles have zero slope. 

Formulas (1) and (2) are nice — but those errors are large. To integrate y = x from 
a = 0 to b = 1, the error is ?Ax(l - 0). It takes 500,000 rectangles to reduce this error 
to 1/1,000,000. This accuracy is reasonable, but that many rectangles is unacceptable. 

The beauty of the error formulas is that they are “ asymptotically correct" for all 
functions. When the graph is curved, the errors don't fit exactly into triangles. But 
the ratio of predicted error to actual error approaches 1. As Ax-»0, the graph is 
almost straight in each interval — this is linear approximation. 

The error prediction \Ax[y(b) - y(a)] is so simple that we test it on y(x) = ^/x: 



n = 

1 

10 

100 

1000 

error R n 

-/ = 

.33 

.044 

.0048 

.00049 

error L„ 

-/ = 

-.67 

-.056 

-.0052 

-.00051 


The error decreases along each row. So does Ax = .1, .01, .001, .0001. Multiplying n 
by 10 divides Ax by 10. The error is also divided by 10 (almost). The error is nearly 
proportional to Ax — typical of first-order methods. 

The predicted error is ^Ax, since here y(l) = 1 and y(0) = 0. The computed errors 
in the table come closer and closer to ^Ax = .5, .05, .005, .0005. The prediction is the 
‘leading term” in the actual error. 
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The table also shows a curious fact. Subtracting the last row from the row above 
gives exact numbers 1, .1, .01, and .001. This is (R n - I)-(L n - /), which is R n - L n . 
It comes from an extra rectangle at the right, included in R n but not L n . Its height is 
1 and its area is 1, .1, .01, .001. 

The errors in R n and L n almost cancel. The average T n = %(R n + L n ) has less error — 
it is the “trapezoidal rule." First we give the rectangle prediction two final tests: 

n = 1 n = 10 n = 100 n = 1000 

j(x 2 -x)dx: errors 1.7 • 10” 1 1.7 • 10" 3 1.7 • 10“ 5 1.7-10’ 7 

J dx/( 10 + cos 27 tx): errors — 1 • 10” 3 2-10" 14 “0” “0" 

Those errors are falling faster than Ax. For y = x 2 — x the prediction explains why: 
>•(0) equals y(l). The leading term, with y(b) minus y(a), is zero. The exact errors are 
£(Ax) 2 , dropping from 10" 1 to 10~ 3 to 10" 5 to 10“ 7 . In these examples L n is identical 
to R n (and also to 7J,), because the end rectangles are the same. We will see these 
i(Ax) 2 errors in the trapezoidal rule. 

The last row in the table is more unusual. It shows practically no error. Why do 
the rectangle rules suddenly achieve such an outstanding success? 

The reason is that y(x) = 1/(10 -l- cos 27rx) is periodic. The leading term in the error 
is zero, because y(0) = >(1). Also the next term will be zero, because y'( 0) = y'(l). Every 
power of Ax is multiplied by zero, when we integrate over a complete period. So the 
errors go to zero exponentially fast. 

Personal note I tried to integrate 1/(10 + cos 27ix) by hand and failed. Then I was 
embarrassed to discover the answer in my book on applied mathematics. The method 
was a special trick using complex numbers, which applies over an exact period. 
Finally I found the antiderivative (quite complicated) in a handbook of integrals, and 
verified the area 

THE TRAPEZOIDAL AND MIDPOINT RULES 


We move to integration formulas that are exact when y = x. They have second- 
order accuracy. The Ax error term disappears. The formulas give the correct area 
under straight lines. The predicted error is a multiple of (Ax) 2 . That multiple is found 
by testing y = x 2 — for which the answers are not exact. 

The first formula combines R n and L n . To cancel as much error as possible, take 
the average £(/?„ + L„). This yields the trapezoidal rule , which approximates 
f y(x) dx by T„: 

T H = ?R„ + ±L n = bx(±y 0 + y t + y 2 + ■■■ + y a -, + iyj. (3) 

Another way to find T n is from the area of the “trapezoid" below y = x in Figure 5. 1 9a. 



e=-L(Ax ) 2 o>; +1 -y;) f=-yi 



M 


A.v 


j+\ 


Fig. 5.19 Second-order accuracy: The error prediction is based on v = x 2 . 
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The base is Ax and the sides have heights yj-i and y } . Adding those areas gives 
i(L„ + R„) in formula (3) — the coefficients of y i combine into i + i = 1. Only the first 
and last intervals are missing a neighbor, so the rule has i y 0 and %y n . Because 
trapezoids (unlike rectangles) fit under a sloping line, T n is exact when y = x. 

What is the difference from rectangles? The trapezoidal rule gives weight £Ax to 
y 0 and y„. The rectangle rule R„ gives full weight Ax to y„ (and no weight to y 0 ). 

- T h is exactly the leading error £y„ - iy 0 . The change to T n knocks out that error. 

Another important formula is exact for y(x) = x. A rectangle has the same area as 
a trapezoid, if the height of the rectangle is halfway between y>_ 1 and y } . On a straight 
line graph that is achieved at the midpoint of the interval. By evaluating y(x) at the 

halfway points ^Ax, $Ax, f Ax we get much better rectangles. This leads to the 

midpoint rule M n : 

Af„ = Ax(y 1/2 + y 3/2 + + y„_ 1/2 ). . (4) 

For j* x dx, trapezoids give ^(0) + 1 + 2 + 3 + ^(4) = 8. The midpoint rule gives 
i + | + 4 + j = 8, again correct. The rules become different when y = x 2 , because y 1/2 
is no longer the average of y 0 and yj. Try both second-order rules on x 2 : 


I=p 0 x 2 dx n = 

1 

10 

100 

error T„ - I = 

1/6 

1/600 

1/60000 

error M„ - / = 

-1/12 

-1/1200 

-1/120000 


The errors fall by 100 when n is multiplied by 10. The midpoint rule is twice as good 
(- 1/12 vs. 1/6). Since all smooth functions are close to parabolas (quadratic approxi- 
mation in each interval), the leading errors come from Figure 5.19. The trapezoidal 
error is exactly &(Ax) 2 when y(x) is x 2 (the 12 in the formula divides the 2 in /): 

T B -l*^(Ax) 2 [(y' l -yi)+ - + (/„- yi-O] = “ >0 (5) 

<Ax) 2 [y; (Ax) 2 [y'(6) - y'(a)~] (6) 

For exact error formulas, change /(b)-y'(a) to (b - a)y"(c). The location of c is 
unknown (as in the Mean Value Theorem). In practice these formulas are not much 
used — they involve the pth derivative at an unknown location c. The main point 
about the error is the factor (Ax) p . 

One crucial fact is easy to overlook in our tests. Each value of y(x) can be extremely 
hard to compute. Every time a formula asks for y } , a computer calls a subroutine. The 
goal of numerical integration is to get below the error tolerance, while calling for a 
minimum number of values of y. Second-order rules need about a thousand values for 
a typical tolerance of 10 -6 . The next methods are better. 

FOURTH-ORDER RULE: SIMPSON 

The trapezoidal error is nearly twice the midpoint error (1/6 vs. -1/12). So a 
good combination will have twice as much of M„ as T„. That is Simpson ’s rule: 

S " = ^ T " + \ M * = \ + 4y ' 12 + 2l>l + 4l>312 + + + + >""]* ( ? ) 
Multiply the midpoint values by 2/3 = 4/6. The endpoint values are multiplied by 
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2/6, except at the far ends a and b (with heights y 0 and y„). This 1-4-2-4-2-4-1 
pattern has become famous* 

Simpson's rule goes deeper than a combination of T and AT It comes from a 
parabolic approximation to y(x) in each interval. When a parabola goes through y 0 , 
y lf2i y ls the area under it is £Ax(y 0 + 4y L/2 + y 2 )* This is S over the first interval. All 
our rules are constructed this way : Integrate correctly as many powers l, x, x 2 t ... as 
possible . Parabolas are better than straight tines, which are better than flat pieces* 
S beats M, which beats R . Check Simpson’s rule on powers of x, with Ax~ I/m: 



n = I 

n= 10 

n = 100 

error if , = x 2 

0 

0 

0 

error if , = x 3 

0 

0 

0 

error if , = x* 

8.33- I0 -3 

8.33- 10‘ 7 

8.33 • 10 -11 


Exact answers for x 2 are no surprise. S„ was selected to get parabolas right. But the 
zero errors for x 3 were not expected* The accuracy has jumped to fourth order , with 
errors proportional to (Ax) 4 . That explains the popularity of Simpson’s rule. 

To understand why x 3 is integrated exactly, look at the interval [-1, 1], The odd 
function x 3 has zero integral, and Simpson agrees by symmetry: 


J x 3 dx — ~ x 4 ] = 0 and 


"y 

-[(-l) 3 + 4(0) 3 + 1 3 ] = 0, 


( 8 ) 



Fig. 5,20 Simpson versus Gauss: E = c(Ax) 4 (yj+ y - yj") with c s = 1/2880 and c G = —1/4320. 


THE GAUSS RULE (OPTIONAL) 


We need a competitor for Simpson, and Gauss can compete with anybody. He 
calculated integrals in astronomy, and discovered that two points are enough for a 
fourth-order method . From —1 to l (a single interval) his rule is 

f -1 y(x) dx » y(- 1/73) + ,(1/73). (9) 

Those " Gauss points ” x = —1/73 and x = 1/^/3 can be found directly. By placing 
them symmetrically, all odd powers x, x\ ... are correctly integrated* The key is in 
y = x 2 , whose integral is 2/3* The Gauss points “x G and + x ti get this integral right: 

2 1 1 
3 = {“■ *g ) 2 + ( x g) 2 > so xj = - and x G = ± 

Figure 5.20c shifts to the interval from 0 to Ax. The Gauss points are 
(1 ± 1/^/3) Ax/2. They are not as convenient as Simpson’s (which hand calculators 
prefer)* Gauss is good for thousands of integrations over one interval. Simpson is 
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good when intervals go back to back — then Simpson also uses two y’s per interval. 
For y = x 4 , you see both errors drop by 10“ 4 in comparing n = 1 to n = 10: 

/ = jo x 4 dx Simpson error 8.33 *10” 3 8.33*10“' 

Gauss error -5.56* 10“ 3 - 5.56* 10“ 7 


DEFINITE INTEGRALS ON A CALCULATOR 


It is fascinating to know how numerical integration is actually done. The points are 
not equally spaced! For an integral from 0 to 1, Hewlett-Packard machines might 
internally replace x by 3 u 2 — 2 u 3 (the limits on u are also 0 and 1). The machine 
remembers to change dx. For example. 


Algebraically that looks worse — but the infinite value of 1/^/x at x = 0 disappears 
at h = 0. The differential 6 (u — u 2 ) du was chosen to vanish at u = 0 and u = 1 . We 
don't need y(x) at the endpoints — where infinity is most common. In the u variable 
the integration points are equally spaced — therefore in x they are not. 

When a difficult point is inside [a, h], break the interval in two pieces. And chop 
off integrals that go out to infinity. The integral of e should be stopped by 
x = 10, since the tail is so thin. (It is bad to go too far.) Rapid oscillations arc among 
the toughest — the answer depends on cancellation of highs and lows, and the calcula- 
tor requires many integration points. 

The change from x to u affects periodic functions. I thought equal spacing was 
good, since 1/(10 + cos 2nx) was integrated above to enormous accuracy. But there 
is a danger called aliasing. If sin 8 ttx is sampled with Ax = 1/8, it is always zero. A 
high frequency 8 is confused with a low frequency 0 (its ''alias' ' which agrees at the 
sample points). With unequal spacing the problem disappears. Notice how any integ- 
ration method can be deceived: 


dx 


becomes 


r 

Jo 


6(u - u 2 ) du 

Jlu 1 ~ 2 u 1 


n 


6(1 - u) du 
v /3 - 2u * 


Ask for the integral of y = 0 and specify the accuracy. The calculator 
samples y at x } , x h . (With a PAUSH key, the x's may be displayed.) 
Then integrate F(x) = (x — Xj ) 2 — (x — x ft ) 2 . That also returns the 
answer zero (now wrong), because the calculator follows the same steps. 


On the HP-28S you enter the function, the endpoints, and the accuracy. The 
variable x can be named or not (see the margin). The outputs 4.67077 and 4.7E-5 are 
the requested integral e x dx and the estimated error bound. Your input accuracy 
.00001 guarantees 


3 

2 

1 


1 EXP (X) 1 
fX 1 2 } 
-00001 


relative error in y = 


true y — computed y 
computed y 


^ . 00001 . 


3: C (EXP) ) 
2 : {1 2 > 

1 : -00001 


The machine estimates accuracy based on its experience in sampling y(x). If you 
guarantee e x within .00000000001 , it thinks you want high accuracy and takes longer. 

In consulting for HP, William Kahan chose formulas using 1, 3, 7, 15, ... sample 
points. Each new formula uses the samples in the previous formula. The calculator 
stops when answers are close. The last paragraphs are based on Kahan's work. 
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TI-81 Program to Test the Integration Methods L, R, T, M, S 


Prgm I : NUM INT 

: D/2-*H 

A + J D-* X 

: D i sp 

M 

:Disp " A= " 

: A-*X 

R+Yl-*R 

T, 

S 

: Input A 

: Yl-*L 

ISXJ,N) 

: D i sp 

L 

:Disp "B= " 

:1-*J 

Goto 1 

: Di sp 

R 

:Input B 

:0-*R 

(L + R“Yl ) D-*L 

: Di sp 

H 

: Lbl N 

:0-»H 

R D-> R 

: Di sp 

T 

:Disp "N= " 

: Lbl 1 

MD-*H 

: D i sp 

S 

:Input N 

: X + H->X 

(L + R) /2-*T 

: Pau se 

: (B-A) / N-> D 

:M + Yi -*M 

( 2M + T ) /3-cS 

: Go 1 0 

N 


Place the integrand y(x) in the Y 1 position on the Y = function edit screen. Execute 
this program, indicating the interval [^4, £] and the number of subintervals N. Rules 
L and R and Af use N evaluations of y(x). The trapezoidal rule uses JV+ 1 and 
Simpson’s rule uses 2 N + 1. The program pauses to display the results. Press ENTER 
to continue by choosing a different N, The program never terminates (only pauses). 
You break out by pressing ON, Don’t forget that IS, Goto, ... are on menus. 


5.8 EXERCISES 


Read-through questions 

To integrate y(x), divide [a, fc] into n pieces of length 
Ax = o , and L„ place a b over each piece, 
using the height at the right or c endpoint: 
R n = Ax(y 1 + **• + y fl ) and L r = d . These are a 
order methods, because they arc incorrect for y = i . The 
total error on [0,1] is approximately o . For y = cosnx 
this leading term is ft . For y = cos 2nx the error is very 
small because [0, 1] is a complete i , 

A much better method is 71 = jJC + I — 
Axfjv /1 + n Vi+ tti + t v n ]. This m rule is 
n -order because the error for y = x is o The error 
for y — x 2 from a to b is p . The g rule is twice as 

accurate, using = Ax[ r ] . 

Simpson’s method is S n = jAf* + > . It is t -order, 

because the powers m are integrated correctly. The 

coefficients of yo^ > 1 / 2^1 are v times Ax. Over three 
intervals the weights are Ax/6 times 1-4- w . Gauss uses 
a points in each interval, separated by Ax/^/3. For a 
method of order p the error is nearly proportional to r , 

1 What is the difference L n — T n l Compare with the leading 
error term in (2). 

2 If you cut Ax in half, by what factor is the trapezoidal 
error reduced (approximately)? By what factor is the error in 
Simpson's rule reduced? 

3 Compute R„ and L „ for j^x 3 dx and w= 1, 2, 10. Either 

verify (with computer) or use (without computer) the formula 
l 3 -h 2 3 + + n l = in 2 (n + l) 2 . 


4 One way to compute T n is by averaging ifL^ + R,,). 
Another way is to add iy 0 + >i + " + + iy n - Which is more 
efficient? Compare the number of operations. 

5 Test three different rules on / = J ^ x A dx for n = 2, 4, 8. 

6 Compute it to six places as 4 j* dxj( 1 +x 2 ), using any 
rule. 

7 Change Simpson’s rule to Ax{i> 0 + i>i /2 + i^i) in each 
interval and find the order of accuracy p, 

8 Demonstrate superdecay of the error when 1/(3 + sin x) is 
integrated from 0 to 2n. 

9 Check that (Ax) 2 (j^ + 1 — y})/12 is the correct error for 
y — 1 and y — x and y = x 2 from the first trapezoid (j — 0). 
Then it is correct for every parabola over every interval. 


10 Repeat Problem 9 for the midpoint error 
— (Ax) 2 (y} +1 — y})/24. Draw a figure to show why the rectan- 
gle M has the same area as any trapezoid through the mid- 
point (including the trapezoid tangent to >(x))l 

11 In principle sin 2 x dxjx 2 =^n. With a symbolic algfr- 
bra code or an HP-28S, how many decimal places do you 
get? Cut off the integral to and test large and small A , 


12 These four integrals all equal rc: 

f 1 dx f® sin x , 8 f* . d _ f- x” 1 ' 3 

f I ax - sm* x dx — — 

Jo yfx(l -x) x 3 J 0 Jo 1 + 


~ 1/2 dx 

X 


(a) Apply the midpoint rule to two of them until 
n* 3,1416. 

(b) Optional: Pick the other two and find n & 3. 
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13 To compute In 2 = dx/x = . 69315 with error less than 
,001, how many intervals should T n need? Its leading error is 
(Ax) 2 [y'(f>) - y{a)]/ll Test the actual error with y ^ 1/x. 

14 Compare T H with \f H for y/x dx and n = 1, 10, 100. The 
error prediction breaks down because y'(0) — co. 

15 Take f(x) = y(x) dx in error formula 3R to prove that 
iT y( x ) ^ x — y(0) Ax k exactly ^(Ax) 2 /(c) for some point c. 

16 For the periodic function y(x) = 1/(2 + cos 6nx) from — t 
to 1, compare T and S and G for n — 2, 

17 For / = >/l -x 2 dx, the leading error in the trapezoi- 
dal rule is . Try n = 2, 4, 8 to defy the prediction. 

18 Change to x = sin 0, ^/l — x 2 = cos 0, dx = cos $ dQ , and 
repeat T 4 on j* /2 cos 2 0 dO. What is the predicted error after 
the change to 0? 

19 Write down the three equations vty(0) + Z?y(£) + Cy(l) — / 
for the three integrals / = |^ 1 dx, x dx, j* x 2 dx. Solve for 
Ay By C and name the rule. 

20 Can you invent a rule using j4y 0 + By 1/4 + Cy 1!2 + 
Dym + Eyj to reach higher accuracy than Simpson’s? 

21 Show that T„ is the only combination of L„ and R n that 
has second-order accuracy. 


22 Calculate j e"* 1 dx with ten intervals from 0 to 5 and 0 
to 20 and 0 to 400. The integral from 0 to oo is What 
is the best point to chop off the infinite integral? 

23 The graph of y(x)=* l/(x 2 + 10“ l0 ) has a sharp spike and 
a long tail. Estimate J* y dx from T l0 and T 10G (don't expect 
much). Then substitute x = 1(T 5 tan 0, dx = 10 -5 sec 2 0 d0 
and integrate 10 3 from 0 to tt/4, 

24 Compute jj|x — n)dx from T 4 and compare with the 
divide and conquer method of separating — w| dx from 
J* |x - n\ dx. 

25 Find a, b t c so that y^ax 2 + bx + c equals 1,3,7 at 

x = 0, 1 (three equations). Check that i*l+f-3+£*7 

equals y dx . 

26 Find c in S-I = c(Ax)*[/"(l)- y"'(0)] by taking y = x 4 
and Ax = 1. 

27 Find c in G — I “c(Ax) 4 [y"'(l) — y*"[— 1)] by taking 

y = x\ Ax = 2. and G - (~ 1 + (t//*) 4 . 

28 What condition on y(x) makes = for the 

integral J* y(x) dx? 

29 Suppose y(x) is concave up. Show from a picture that the 
trapezoidal answer is too high and the midpoint answer is 
too low. How does y" > 0 make equation (5) positive and (6) 
negative? 
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CHAPTER 6 


Exponentials and Logarithms 


This chapter is devoted to exponentials like 2* and 10* and above all e*. The goal is 
to understand them, differentiate them, integrate them, solve equations with them, 
and invert them (to reach the logarithm). The overwhelming importance of e 1 makes 
this a crucial chapter in pure and applied mathematics. 

In the traditional order of calculus books, e* waits until other applications of the 
integral are complete. I would like to explain why it is placed earlier here. I believe 
that the equation dyjdx = y has to be emphasized above techniques of integration. 
The laws of nature are expressed by differentia I equations, and at the center is e*. Its 
applications are to life sciences and physical sciences and economics and engineering 
(and more — wherever change is influenced by the present state). The model produces 
a differential equation and I want to show what calculus can do. 

The key is always b m+n = (b m )(f? n ). Section 6.1 applies that rule in three ways: 

1. to understand the logarithm as the exponent ; 

2, to draw graphs on ordinary and semilog and log-log paper; 

3* to find derivatives . The slope of b x will use b**** = {& x )(& Ax ) T 



6.1 An Overview 



There is a good chance you have met logarithms. They turn multiplication into 
addition, which is a lot simpler. They are the basis for slide rules (not so important) 
and for graphs on log paper (very important). Logarithms are mirror images of 
exponentials — and those I know you have met. 

Start with exponentials. The numbers 10 and 10 2 and 10 3 are basic to the decimal 
system. For completeness I also include 10°, which is “ten to the zeroth power” or 
1. The logarithms of those numbers are the exponents . The logarithms of 1 and 10 and 
100 and 1000 are 0 and 1 and 2 and 3. These are logarithms “to base 10,” because 
the powers are powers of 10. 

Question When the base changes from 10 to b, what is the logarithm of 1? 

228 Answer Since b° = 1, log b l is always zero . To base h, the logarithm of if is n. 
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Negative powers are also needed. The number 10* is positive, but its exponent x can 
be negative. The first examples are 1/10 and 1/100, which are the same as 10“ 1 and 
10 2 . The logarithms are the exponents — l and — 2: 

1000 = IQ 3 and log 1000 = 3 

1/1000= 10" 3 and log 1/1000=* “3. 

Multiplying 1000 times 1/1000 gives 1 = 10°. Adding logarithms gives 3 + (— 3) — 0. 
Always 10 m times 10" equals 10 m + n . In particular 10 5 times 10“ produces five tens: 

(I0)fl0)(!0) limes (10)(10) equals (10)( 10)(10)(10)( 10) = 10 5 . 

The law for h m times h n extends to all exponents, as in 10 46 times 10*. Furthermore 
the law applies to all bases (we restrict the base to h> 0 and h ^ 1). In every case 
multiplication of numbers is addition of exponents, 

6A b m times h n equals h m+n , so logarithms (exponents) add 

h m divided by h n equals h m ~ n , so logarithms (exponents) subtract 

log b (yr) = log(,y + log t z and \o% b (yjz) = log b y - log b z. (1) 


Historical note In the days of slide rules, 1.2 and 1.3 were multiplied by sliding 
one edge across to 1,2 and reading the answer under 1.3, A slide rule made in 
Germany would give the third digit in 1,56, Its photograph shows the numbers on a 
log scale. The distance from 1 to 2 equals the distance from 2 to 4 and from 4 to 8, 
By sliding the edges, you add distances and multiply numbers. 

Division goes the other way. Notice how 1000/10 = \00 matches 3 - 1 = 2. To divide 
1,56 by 1,3, look back along line D for the answer 1.2. 

The second figure, though smaller, is the important one. When x increases hy 1, 2* 
is multiplied hy 2, Adding to x multiplies y. This rule easily gives y = 1, 2, 4, 8, but 
look ahead to calculus— which doesn't slay with whole numbers. 

Calculus will add Ax. Then y is multiplied by 2 Ajf , This number is near 1, If 
Ax — jts then ^ 1,07— the tenth root of 2. To find the slope ; we have to consider 

(2** 1)/Ax. The limit is near ( 1 .07 — l)Mi = -7, but the exact number will take time. 



Fig, 6,1 An ancient relic (the slide rule). When exponents x add, powers 2 X multiply. 
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Base Change Bases other than 10 and exponents other than 1, 2, 3, . are needed 
for applications. The population of the world x years from now is predicted to grow 
by a factor close to L02* + Certainly x does not need to be a whole number of years. 
And certainly the base 1 .02 should not be 10 (or we are in real trouble). This prediction 
will be refined as we study the differential equations for growth. It can be rewritten 
to base 10 if that is preferred (but look at the exponent): 

1.02* is the same as io' 10 * 1 - 02 * 

When the base changes from 1.02 to 10, the exponent is multiplied — as we now see. 

For practice, start with base b and change to base a , The logarithm to base a will 
be written “log, 1 ’ Everything comes from the rule that logarithm = exponent: 

base change for numbers: b = 

Now raise both sides to the power x, You see the change in the exponent: 
base change for exponentials: b x = 

Finally set y = b x . Its logarithm to base b is x. Us logarithm to base a is the exponent 
on the right hand side: log fl y ~ (log^bjx. Now replace x by \og b y: 

base change for logarithms : log^ v = (log^Klog,, y). 

We absolutely need this ability to change the base. An example with a = 2 is 

b = 8 = 2 3 8 2 = (2 3 ) 2 = 2 6 log 2 64 = 3 '2 « (log 2 8)(log B 64), 

The rule behind base changes is (a m ) x = a mx . When the mth power is raised to the 
xth power, the exponents multiply. The square of the cube is the sixth power: 

(a)(a)(a) times (a){a){fl) equals (a)(a)(tf){tf)(j)(tf): (a 3 ) 2 = a 6 . 

Another base will soon be more important than 10™here are the rules for base 
changes: 


6B To change numbers , power s, and logarithms from base b to base a , use 

b = a ia ^ b b x = log a y = {log a b)(log 6 y) (2) 


The first is the definition. The second is the xth power of the first. The third is the 
logarithm of the second (remember y is b x )♦ An important case is y = a: 

log a a = (log a ii)(log t a)= 1 so log u b= l/log^a. (3) 

EXAMPLE 8 = 2 3 means 8 1 ' 3 = 2. Then (log 2 8)(log 8 2) = (3)(l/3) = 1. 

This completes the algebra of logarithms. The addition rules 6A came from 
(6 m )(b n ) = b m + n . The multiplication rule 6B came from ( a m f = a mx * We still need to 
define b x and a x for all real numbers x , When x is a fraction, the definition is easy. 
The square root of a s is a* (m — 8 times x = 1/2). When x is not a fraction, as in 2 rt , 
the graph suggests one way to fill in the hole. 

We could define 2" as the limit of 2 3mo , 2 3 14/1 00 As the fractions r approach 

7i, the powers 2 r approach 2*. This makes y = 2 X into a continuous function, with the 
desired properties (2 ffl )(2 N ) = 2 m + 71 and (2 m ) x = 2 mx — whether m and n and x are inte- 
gers or not. But the e’s and <5’s of continuity are not attractive, and we eventually 
choose (in Section 6,4) a smoother approach based on integrals. 
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GRAPHS OF b* AND log b y 

It is time to draw graphs. In principle one graph should do the job for both functions, 
because y — h* means the same as x — log*y. These are inverse functions. What one 
function does, its inverse undoes. The logarithm of g(x ) = fr* is x: 

g ~ ‘(g(*)) = lo&(M) = X. (4) 

In the opposite direction, the exponential of the logarithm of y is y: 

gig~\y)) = b^ = y. (5) 

This holds for every base b t and it is valuable to see b = 2 and b = 4 on the same 
graph. Figure 6.2a shows y — 2 X and y — 4*. Their mirror images in the 45° line give 
the logarithms to base 2 and base 4, which are in the right graph. 

When x is negative* y=b x is still positive. If the first graph is extended to the left, 
it stays above the x axis. Sketch it in with your pencil. Also extend the second graph 
down, to be the mirror image. Don’t cross the vertical axis. 



Fig. 6.2 Exponentials and mirror images (logarithms). Different scales for x and y. 

There are interesting relations within the left figure. All exponentials start at 1, 
because 6° is always 1. At the height y= 16, ooe graph is above x — 2 (because A 2 — 
16). The other graph is above x = 4 (because 2 4 = 16). Why does 4' in one graph equal 
2 2x in the other ? This is the base change for powers, since 4 — 2 2 . 

The figure on the right shows the mirror image — the logarithm. All logarithms 
start from zero at y — 1. The graphs go down to - co at y = 0. (Roughly speaking 
2” 30 is zero.) Again x in one graph corresponds to 2x in the other (base change for 
logarithms). Both logarithms climb slowly, since the exponentials climb so fast. 

The number log 2 10 is between 3 and 4, because 10 is between 2 3 and 2 4 . The slope 
of 2* is proportional to 2 X — which never happened for x\ But there are two practical 
difficulties with those graphs: 

1. 2 X and 4* increase too fast. The curves turn virtually straight up. 

2 * The most important fact about Ab x is the value of b — and the base 
doesn’t stand out in the graph. 

There is also another point. In many problems we don’t know the function y = 
/(x). We are looking for it! All we have are measured values of y (with errors mixed 
in). When the values are plotted on a graph, we want to discover f{x). 

Fortunately there is a solution. Scale the y axis differently . On ordinary graphs, 
each unit upward adds a fixed amount to y. On a log scale each unit multiplies y by 
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a fixed amount . The step from y = 1 to y = 2 is the same length as the step from 3 to 
6 or 10 to 20. 

On a log scale, y = 1 1 is not halfway between 10 and 12. And y = 0 is not there at 
all. Each step down divides by a fixed amount — we never reach zero . This is com- 
pletely satisfactory for Ab x , which also never reaches zero. 

Figure 6.3 is on semilog paper (also known as log-linear ), with an ordinary x axis. 
The graph of y— Ab x is a straight line. To see why, take logarithms of that equation: 

log y - log A + x log b. (6) 

The relation between x and log y is linear . It is really log y that is plotted, so the graph 
is straight. The markings on the y axis allow you to enter y without looking up its 
logarithm — you get an ordinary graph of log y against x. 

Figure 6.3 shows two examples. One graph is an exact plot of y = 2* 10*. It goes 
upward with slope 1, because a unit across has the same length as multiplication by 
10 going up. 10* has slope 1 and 10 (log6) * (which is If) will have slope log b. The 
crucial number log b can he measured directly as the slope. 



Rg. 6.3 2- 10* and 4* 10 " xt2 on semilog paper. 


Fig. 6.4 Graphs of Ax k on log-log paper. 


The second graph in Figure 6.3 is more typical of actual practice, in which we start 
with measurements and look for fix). Here are the data points: 


x = 0.0 0.2 0.4 0.6 0.8 1.0 

y = 4.0 3.2 2.4 2.0 1.6 1.3 


We don’t know in advance whether these values fit the model y = Ab x . The graph is 
strong evidence that they do. The points lie close to a line with negative slope — 
indicating log b < 0 and b < 1. The slope down is half of the earlier slope up, so the 
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model is consistent with 


v = A * 10 xl2 or log y = log A - ^x* (7) 

When x reaches 2, y drops by a factor of 10, At x = 0 we see A ^ 4* 

Another model — a power y = Ax k instead of an exponential — also stands out with 
logarithmic scaling* This time we use log-tog paper , with both axes scaled. The 
logarithm of y = Ax k gives a linear relation between log y and log x: 

log y = log A + k log x. (8) 

The exponent k becomes the slope on tog-log paper. The base b makes no difference. 
We just measure the slope, and a straight line is a Jot more attractive than a power 
curve. 

The graphs in Figure 6,4 have slopes 3 and \ and -1. They represent Ax* and 
A^fx and Ajx, To find the A's, look at one point on the line* At x = 4 the height is 
8, so adjust the A's to make this happen: The functions are x 3 /8 and 4 ^Jx and 32/x* 
On semilog paper those graphs would not be straight! 

You can buy log paper or create it with computer graphics, 

THE DERIVATIVES OF y = & AND x= log b y 

This is a calculus book. We have to ask about slopes. The algebra of exponents is 
done, the rules are set, and on tog paper the graphs are straight. Now come limits. 

The central question is the derivative. What is dyjdx when y= b x ? What is dx/dy 
when x is the logarithm log^y? Thpse questions are closely related, because b x and 
log^y are inverse functions. If one slope can be found, the other is known from 
dx/dy = \j{dy/dx ), The problem is to find one of them, and the exponential comes 
first. 

You will now see that those questions have quick (and beautiful) answers, except 
for a mysterious constant. There is a multiplying factor c which needs more time* I 
think it is worth separating out the part that can be done immediately, leaving c in 
dyjdx and 1 jc in dx/dy. Then Section 6*2 discovers c by studying the special number 
called e {but c ^ e ). 


6C The derivative of b* is a multiple cb x . The number c depends on the 
base b , 


The product and power and chain rules do not yield this derivative* We are pushed 
all the way back to the original definition, the limit of Ay/Ax: 


dy y(x -b /i) — y(x) 

= l im — 

dx h >o h 


lim 

*-►0 


b x + h - b x 


h 


(9) 


Key idea : Split b x ~ h into b x times b h . Then the crucial quantity b x factors out* More 
than that, b x comes outside the limit because it does not depend on h. The remaining 
limit, inside the brackets, is the number c that we don't yet know: 


dy ,* b x b h -b x 

— = Inn : = b 

dx fi-o h 


lim 

fc -0 


b h ~ 1 


= cb x 


( 10 ) 


This equation is central to the whole chapter: dy/dx equals cb x which equals cy. The 
rate of change of y is proportional to y. The slope increases in the same way that b x 
increases (except for the factor c). A typical example is money in a bank, where 
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interest is proportional to the principal. The rich get richer, and the poor get slightly 
richer. We will come back to compound interest, and identify b and c. 

The inverse function is x = log^y. Now the unknown factor is 1 jc\ 

60 The slope of Iog*y is 1 jcy with the same c (depending on b ). 

Proof If dyjdx = cb x then dxjdy - 1 jch x = 1 fey. (1 1) 

That proof was like a Russian toast, powerful but too quick! We go more carefully: 
f(b*) = x (logarithm of exponential) 

/'(&*)(£&*)= 1 (x derivative by chain rule) 
f*(h x ) = 1 jch x (divide by cf> x ) 
f\y)= 1 jcy (identify b x as y) 

The logarithm gives another way to find c. From its slope we can discover 1 jc. This 
is the way that finally works (next section). 




Fig. 6.5 The slope of 2* is about .7* 2*. The slope of log 3 y is about l/.7y. 

Final remark It is extremely satisfying to meet an J\y) whose derivative is I jcy. 
At last the “ — 1 power” has an antiderivative. Remember that = x fl + 1 /( n + 1) 
is a failure when n= — 1. The derivative of x° (a constant) does not produce x~ 
We had no integral for x" 1 , and the logarithm Jilts that gap . If y is replaced by x or t 
(all dummy variables) then 

d ( 1 , d , 1 

— log A .v=— and -log^ = --. (12) 

ax cx at ct 

The base b can be chosen so that c = 1. Then the derivative is 1/x. This final touch 
comes from the magic choice b = e — the highlight of Section 6.2. 


6.1 EXERCISES 


Read-through questions 

In \Q* = 10,000, the exponent 4 is the q of 10,000. The 
base is b = b . The logarithm of 10™ times 10" is c , 
The logarithm of 10 m /10" is d The logarithm of 10,000* 
is e If y = b x then x — t Here x is any number, 
and y is always a 

A base change gives b = a — ■ — and b x = a — ' — . Then 
8 s is 2 15 . In other words log 2 y is i times log s y, When 
y — 2 it follows that log : 8 times !og s 2 equals * 


On ordinary paper the graph of y = 1 is a straight line. 

Its slope is m . On semilog paper the graph of y = n 
is a straight line. Its slope is o On log-log paper the 
graph of y = p is a straight line. Its slope is Q 

The slope of y = b x is dv/dx — r where c depends on 

b. The number c is the limit as h of s . Since x = 

log(,y is the inverse, f dx/dy)(dyfdx) = i Knowing 

dy/dx = cb* yields dx/dv = u Substituting b x for y, the 
slope of log ft v is v With a change of letters, the slope of 
log fr x is w 
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Problems 1-10 me the rales for logarithms. 


14 Draw semiJog graphs of y = 10 1 1 and y = if^/lO)** 


1 Find these logarithms (or exponents): 

(a) log 2 32 (b) log,(l/32) (c) log 32 (l/32) 

(d) log 32 2 (e) log IQ (10yi0) (f) log 2 {log z 16) 

2 Without a calculator find the values of 

(a) (b) 3 2IoS3S 

(c) log 10 5 + log 10 2 (d) (log^Jflog^) 

(e) 1 G 5 1 0 " 4 1 0* (f ) log 2 56 - log 2 7 

3 Sketch y = 2~ x and y — i(4*) from —1 to 1 on the same 
graph* Put their mirror images x= “log 2 y and x = log 42 y 
on a second graph. 

4 Following Figure 6.2 sketch the graphs of y — and x = 
logi^y* What are log 1/2 2 and log 1/2 4? 

5 Compute without a computer: 

(a) log 2 3 + log 2 i (b) log 2 (i) 10 

(c)Iog I0 10(r (d)Oog I0 e)(log t 10) 

(e)2 21 /(2 J ) 3 (f) logjl/e) 

6 Solve the following equations for x: 

(a) logiodO 1 ) = 7 (b) log 4x - log 4 = log 3 

(c)log*10 = 2 (d) log 2 (l/jt) = 2 

(e) log x + log x = log 8 (f) logjfx*) = 5 

7 The logarithm of y = x* is log^ = * 

*8 Prove that (1og*0}(logjc) = (logjjJflog b c). 

9 2 10 is close to 10* (1024 versus 1000). If they were equal 

then 1og 2 10 would be . Also log J0 2 would be 

instead of 0.301. 

10 The number 2 1000 has approximately how many (decimal) 
digits? 

Questions 11-19 are about the graphs of y — b* and x = log^y* 

11 By hand draw the axes for semilog paper and the graphs 
of y — 1.1* and y = 10(1,1)*. 

12 Display a set of axes on which the graph of y = log to x is 
a straight line* What other equations give straight lines on 
those axes? 

13 When noise is measured in decibels, amplifying by a factor 

A increases the decibel level by 10 log A. If a whisper is 20db 
and a shout is 70db then 10 log A = 50 and A — . 


15 The Richter scale measures earthquakes by logi 0 (J//o) = 

R. What is R for the standard earthquake of intensity / 0 ? If 
the 1989 San Francisco earthquake measured R — 7, how did 
its intensity / compare to / 0 ? The 1906 San Francisco quake 
had R = 8*3* The record quake was four times as interne with 
R = , 

16 The frequency of A above middle C is 440/second* The 

frequency of the next higher A is . Since 2 7/12 a 1.5* 

the note with frequency 660/sec is * 

17 Draw your own semiJog paper and plot the data 

y = 7, 11, 16, 28, 44 at x = 0, 1/2, 1, 3/2, 1 
Estimate A and b in y = Ah*. 

18 Sketch log-log graphs of y = x 2 and y — 

19 On log-log paper, printed or homemade, plot y = 4, 11, 
21, 32, 45 at x = 1, 2, 3, 4, 5* Estimate A and k in y = Ax*. 

Questions 20-29 are about the derivative dy/dx = cfr\ 

20 g(x) = b* has slope £ = cg. Apply the chain rule to 
gfj(y)) = y to prove that dfjdy = 1/ey. 

21 If the slope of log x is 1/cx, find the slopes of log (2x) and 
log (x 2 ) and log (2 1 )* 

22 What is the equation (including c) for the tangent line to 
y = 10“ at x =0? Find also the equation at x = 1. 

23 What is the equation for the tangent line to x — log 10 y at 
y = 1? Find also the equation at y = 10. 

24 With b = 10, the slope of 10“ is clO \ Use a calculator for 
small h to estimate c — lim (10* — l}/h. 

25 The unknown constant in the slope of y = (.l)* is 
L = iim (.1*— l)/h. (a) Estimate L by choosing a small h. 
(b) Change h to — h to show that L— —c from Problem 24. 

26 Find a base b for which (b h — 1 )jh a? 1. Use h = 1/4 by hand 
or h = 1/10 and 1/100 by calculator. 

27 Find the second derivative of y = b x and also of x = log^y. 

28 Show that C — lim (100* — l)/h is twice as large as c = 
lim (10* — l)jh. (Replace the last h'$ by 2 h*) 

29 In 28, the limit for b = 100 is twice as large as for b = 10, 

So c probably involves the of b. 
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6 Exponentials and Logarithms 


6.2 The Exponential e* 


The last section discussed b* and logh>. The base b was arbitrary— it could be 2 or 6 
or 9,3 or any positive number except 1, But in practice, only a few bases are used. 
I have never met a logarithm to base 6 or 9.3. Realistically there are two leading 
candidates for b, and 10 is one of them. This section is about the other one, which is 
an extremely remarkable number, Tbis number is not seen in arithmetic or algebra 
or geometry, where it looks totally clumsy and out of place. In calculus it comes into 
its own. 

The number is e. That symbol was chosen by Euler (initially in a fit of selfishness, 
but he was a wonderful mathematician). It is the base of the natural logarithm , 
It also controls the exponential c 1 , which is much more important than In x, 
Euler also chose n to stand for perimeter — anyway, our first goal is to find e . 

Remember that the derivatives of b x and log^y include a constant c that depends 
on b. Equations (10) and (11) in the previous section were 


-j~b x = cb x and 
dx 


d i 1 

-Hog fcy s - 

dy cy 


0 ) 


At x - 0, the graph of b x starts from b° = 1. The slope is c, At y = 1, the graph of 
logi,y starts from log b l = 0. The logarithm has slope 1/c. With the right choice of the 
base b those slopes will equal 1 (because c will equal 1), 

For y = 2 X the slope c is near .7. We already tried Ax = . 1 and found Ay % ,07, The 
base has to be larger than 2, for a starting slope of c = 1. 

We begin with a direct computation of the slope of log^y at y = 1: 


~ = slope at 1 = lim ^[log b (l + ft) - log^l] = lim log t [(l + ft) 1 '*]. 


( 2 ) 


Always log^l — 0. The fraction in the middle is logrfl + h) times the number 1 /ft. This 
number can go up into the exponent, and it did. 

The quantity (1 + hf !k is unusual, to put it mildly. As h -► 0, the number 1 + ft is 
approaching 1, At the same time, 1/ft is approaching infinity. In the limit we have 
l 00 . But that expression is meaningless (like 0/0). Everything depends on the 
balance between “nearly 1” and “nearly go.” This balance produces the extraordinary 
number e: 


DEFINITION 


The number e is equal to lim (1 + ft) 1; \ Equivalently e = lim 

h-»0 n-** 




Before computing e, look again at the slope 1/c. At the end of equation (2) is the 
logarithm of e: 

1/c = log fc e. (3) 

When the base is b = e f the slope is log e c = 1. That base e has c= 1 as desired : 


The derivative of c* is 1 • ef and the derivative of\og e y is 


1 


( 4 ) 


This is why the base e is all-important in calculus. It makes c = 1. 

To compute the actual number e from (1 + ft) 1/fc , choose ft = 1, 1/10, 1/100, ,,,, Then 
the exponents 1/ft are n= 1, 10, 100, .... (All limits and derivatives will become official 
in Section 6,4,) The table shows (1 + h) 1}h approaching e as ft -* 0 and n -> oc: 
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n 


h 


1 

n 


1 / 1 \* 

1 + /i = 1 + - (1 + h) llh = 1 + - 

n \ n 


1 

1.0 

2.0 

2 

0.5 

1.5 

10 

0.1 

U 

100 

0.01 

1.01 

1000 

0.001 

1.001 

10000 

0.0001 

1.0001 


2,0 

2,25 

2,593742 

2.704814 

2,716924 

2.718146 


The last column is converging to e {not quickly). There is an infinite series that 
converges much faster. We know 125,000 digits of e (and a billion digits of tt). There 
are no definite patterns, although you might think so from the first sixteen digits: 


e = 2.7 1828 1828 45 90 45 - (and !/<?*. 37). 


The powers of e produce y — e x . At x — 2.3 and 5, we are dose to y = 10 and 1 50, 

The logarithm is the inverse function. The logarithms of 150 and 10, to the base e , 
are dose to x = 5 and x = 2,3, There is a special name for this logarithm — the natural 
logarithm , There is also a special notation “In” to show that the base is e: 

In y means the same as log^y. The natural logarithm is the exponent in e x = y. 

The notation In y (or In x — it is the function that matters, not the variable) is standard 
in calculus courses. After calculus, the base is generally assumed to be e. In most of 
science and engineering, the natural logarithm is the automatic choice. The symbol 
"exp(xC means e x , and the truth is that the symbol "log x" generally means In x. 
Base e is understood even without the letters In, But in any case of doubt — on a 
calculator key for example — the symbol "In x' 1 emphasizes that the base is e. 


THE DERIVATIVES OF e* AND In jr 


Come back to derivatives and slopes. The derivative of b x is cb x t and the derivative 
of log,,; 1 is 1 icy. If h = e then c=l. For all bases, equation (3) is ljc = log b e. 
This gives c — the slope of b x at x = 0: 


6E The number c is l/log fr e = log^b, Thus c equals In b , 


( 5 ) 


e = In h is the mysterious constant that was not available earlier . The slope of 2 X is 
In 2 times 2 X , The slope of e x is In e times e x (but In e — 1). We have the derivatives 
on which this chapter depends: 


6F The derivatives of e x and In y are e x and 1/y. For other bases 


" 7 - b x = (In b)b x and 
ax 


d_ 

dy 


Jog b y = 


1 

(In b)y 


( 6 ) 


To make clear that those derivatives come from the functions (and not at all from 
the dummy variables), wc rewrite them using t and x: 



d . 1 

— In x — - . 

ax x 


and 


V) 
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Remark on slopes at x = 0: It would be satisfying to see directly that the slope of 2* 
is below 1, and the slope of 4* is above L Quick proof: e is between 2 and 4, 
But the idea is to see the slopes graphically. This is a small puzzle, which is fun to 
solve but can be skipped. 

2 X rises from 1 at x~0 to 2 at x = 1. On that interval its average slope is 1. Its 
slope at the beginning is smaller than average, so it must be less than 1 — as desired. 
On the other hand 4 X rises from ^ at x = — ^ to 1 at x = 0, Again the average slope 
is j!j = 1* Since x = 0 comes at the end of this new interval, the slope of 4 X at that 
point exceeds 1. Somewhere between 2 J and 4 X is e x t which starts out with slope 1. 

This is the graphical approach to e . There is also the infinite series, and a fifth 
definition through integrals which is written here for the record: 


1, e is the number such that e x has slope 1 at x = 0 

2. e is the base for which In y = log^y has slope 1 at y = 1 


/ lV 

3. e is the limit of 1 1 + -1 as n 


oc 


, 1 1,1 1 , 

4. e — — + — + — 4 — 4 
0! 1! 2! 3! 


, , , 1 1 
=, + i +I+ - 


5* the area x 1 dx equals 1. 


The connections between 1, 2, and 3 have been made. The slopes are 1 when e is the 
limit of (1 4- l/n) fl , Multiplying this out will lead to 4, the infinite series in Section 6.6. 
The official definition of lnx comes from [dx/x, and then 5 says that In e = 1. This 
approach to e (Section 6.4) seems less intuitive than the others. 

Figure 6.6b shows the graph of e~ x . It is the mirror image of e x across the vertical 
axis. Their product is e x e~ x =\ . Where e x grows exponentially, e~ x decays 
exponentially — or it grows as * approaches - oo. Their growth and decay are faster 
than any power of x . Exponential growth is more rapid than polynomial growth, so 
that e x jx n goes to infinity (Problem 59). It is the fact that e x has slope e x which keeps 
the function climbing so fast. 




Fig. 6.6 e x grows between 2 X and 4 X . Decay of e x , faster decay of e * 2/2 . 


The other curve is y = This is the famous “bell-shaped curve" of probability 

theory. After dividing by it gives the normal distribution , which applies to so 

many averages and so many experiments. The Gallup Poll will be an example in 
Section 8.4. The curve is symmetric around its mean value x = 0, since changing x to 
— x has no effect on x 2 . 

About two thirds of the area under this curve is between x = — 1 and x = 1 . If you 
pick points at random below the graph, 2/3 of all samples are expected in that 
interval. The points x = - 2 and x “ 2 are “two standard deviations'" from the center. 
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enclosing 95% of the area. There is only a 5% chance of landing beyond. The decay 
is even faster than an ordinary exponential, because \x l has replaced jc. 

THE DERIVATIVES OF 4* AND B ** 

The slope of e x is e x . This opens up a whole world of functions that calculus can deal 
with. The chain rule gives the slope of e 2x and e siax and every e u(x) : 


6 G The derivative of e u{x} is e* ix) times dujdx. 

Special case u = cx\ The derivative of e cx is ctf x . 


(8) 

( 9 ) 


EXAMPLE 1 The derivative of e 3x is 3e 3 * (here c = 3). The derivative of ^ inx is 
e* inx co$x (here w = sinx). The derivative of /(w{;c)) is dfjdu times dujdx. Here 
/= ^ so dfjdu = e u . The chain rule demands that second factor dujdx . 


EXAMPLE 2 e an2>x is the same as 2 X . Its derivative is In 2 times 2 X . The chain rule 
rediscovers our constant c- In 2. In the slope of b x it rediscovers the factor c = In ft. 

Generally is preferred to the original ft*. The derivative just brings down the 
constant c. It is better to agree on e as the base , and put ail complications (like c - 
In ft) up in the exponent. The second derivative of e* x is c 2 e cx . 

EXAMPLE 3 The derivative of e~ x2n is -xe~ x2jl (here u~ - x 2 jl so dujdx - - jc). 

EXAMPLE 4 The second derivative of /= by the chain rule and product rule, 

is 

f" = (- l)*c”* 2/2 + (— x) 2 e~ xlfl = (x 2 — \)e ~ x2!1 . (10) 

Notice how (he exponential survives. With every derivative it is multiplied by more 
factors, but it is still there to dominate growth or decay. The points of inflection , 
where the bell-shaped curve has f* = 0 in equation (10), are x = t and x = - L 


EXAMPLE 5 (u — n In x). Since e nlnx is in disguise, its slope must be nx H ~ l : 


slope = e Blnx — (n In x) 
dx 


l = .v"(^) = nx"“ 1 . 


( 11 ) 


This slope is correct for all n, integer or not. Chapter 2 produced 3x 2 and 4x 3 from 
the binomial theorem. Now nx" _1 comes from In and exp and the chain rule. 


EXAMPLE 6 An extreme case is x* = (e ln *)*■ Here w = x In x and we need du/dx: 

d 


dx 


(x*) = e* lni ^ln x + xT) = x*(ln x + 1). 


INTEGRALS OF AND e u du/dx 


The integral of e x is e*. The integral of e cx is not e cx . The derivative multiplies by c so 
the integral divides by c. The integral of e cx is e cx /c (plus a constant). 

* i C h x 

e lx dx = -e 2x + C h x dx = - — - + C 

2 J inh 


EXAMPLES 
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I 


^3<, + l> dxss ^3£x+l) +C 


e x * i2 dx -* failure 


The first one has c = 2. The second has c = In b — remember again that b x — e (lnfr)Jc . 
The integral divides by In b. In the third one, e 3tx+ 1) is e 3x times the number e 3 and 
that number is carried along. Or more likely we see e 3tJf + n as e u , The missing dujdx = 
3 is fixed by dividing by 3, The last example fails because dujdx is not there. We 
comet integrate without dujdx: 


6H The indefinite integral 


d*—dx equals e -4 ** + C, 
ax 


Here are three examples with dujdx and one without it: 


I 

J 


e? inx cos x dx - e s,nx 


e^dx 


= 2e + C 


+ C 


x^ 2/2 dx = e* lfl + C 

‘ edx = ^L + C 

(1+e*) 3 l+«* 


The first is a pure e*du. So is the second. The third has u = and dujdx = 1/2^/x, 
so only the factor 2 had to be fixed. The fourth example does not belong with the 
others. It is the integral of duju 1 2 , not the integral of e u du. I don't know any way to 
tell you which substitution is best — except that the complicated part is 1 -f e x and it 
is natural to substitute u. If it works, good. 

Without an extra e* for dujdx , the integral \dxj{ 1 + e x ) 2 looks bad. But u=\+ e* 
is still worth trying. It has du = e x dx = (u — 1 }dx: 


dx 

f du 

J(l+e *) 2 J 

(u-l)u 2 J 


du 


( 


1 

ii- 1 


1 

ti 



(12) 


That last step is “partial fractions The integral splits into simpler pieces (explained 
in Section 7.4) and we integrate each piece. Here are three other integrals: 


e ijx dx 


e x {4 + e*)dx 


e *(4 + e*) dx 


The first can change to — J e^duju 2 , which is not much better. (It is just as impossible.) 
The second is actually J u du , but I prefer a split: J 4e x and [ e 2x are safer to do 
separately. The third is j(4e~ x + 1 )Jx, which also separates. The exercises offer prac- 
tice in reaching e** dujdx ~ ready to be integrated. 


Warning about definite integrals When the lower limit is x = 0, there is a natural 
tendency to expect /(0) = 0— in which case the lower limit contributes nothing. For 
a power/= x 3 that is true. For an exponential/ = e 3x it is definitely not true , because 

yt0)= 1: 


I 


1 1 
e 3x dx = -e 3x 


= ^ 3 -l) 

0 J 


xe x dx= - e x 
o 2 


1 1 

= =(«-«■ 
0 L 
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6.2 EXERCISES 


Read-through questions 

The number e is approximately o . It is the limit of (1 4- h) 
to the power b . This gives 1.01 100 when h = c . An 
equivalent form is e = lim f d y. 

When the base is b = e< the constant c in Section 6.1 is 
Therefore the derivative of y = e* is dyjdx = 1 . The deriv- 
ative of x = log is dxjdy = 0 . The slopes at x = 0 and 

y = I are both h . The notation for log e y is I which 
is the J logarithm of y. 

The constant c in the slope of b x is c — k . The function 
b x can be rewritten as J Its derivative is m . The 
derivative of e“ (x > is n , The derivative of e*™* is o , 

The derivative of brings down a factor p . 

The integral of e* is g . The integral of is i 

The integral of e“ (x) du/<ix is < . In general the integral of 

e mx) ^ jtself | s t to find. 


Find the derivatives of the functions in 1-18. 


1 le lx 

3M 8 
5 3 X 
7 (2/3)* 


2 -le~ lx 
4 (c-*)-® 
6 ^ ,ft3 
8 4 4x 


9 1/(1 + f) 

10 e l ' (1 + x) 

11 e lnl + x ln< 

12 xe lfx 

13 xe 1 - e 1 

14 xV- 2x^ + 2^ 

e* — e x 

15 

16 e ln(x3) 4- ln(e xi ) 

e* + e~* 


17 ' + sin 

18 x~ llx (which is e - — -] 


19 The difference between e and (1 + l/n) fl is approximately 
Ce/n. Subtract the calculated values for n = 10, 100, 1000 from 
2.7183 to discover the number C. 

20 By algebra or a calculator find the limits of (1 + 1/n) 2 " and 
(1 + l/n)A 


... is e. So the limit of 
So the limit of 
The last sequence is 


21 The limit of (11/10) 10 , (101/100) 1 

{10/1 1) 10 , (lOO/lOl) 100 , ... is 

(10/11) 11 , (100/101) 101 , ... is 

(i-W 

22 Compare the number of correct decimals of e for 
(l.OOl) 1000 and (l.OOOl) 10000 and if possible (l.OOOOl) 100000 . 
Which power n would give all the decimals in 2.71828? 

23 The function y = e x solves dyjdx = y, Approximate this 
equation by AY/ Ax = Y, which is Y[x 4- h) — Y(x) = h 7(x), 
With /t = t^ find Y(h) after one step starting from Y{ 0) = 1. 
What is Y( 1) after ten steps? 


24 The function that solves dyjdx = — y starting from y = 1 

at x = 0 is . Approximate by Y(x + h) - T(x) = 

- h T(x). If h = j what is Y(h) after one step and what is Y( 1) 
after four steps? 

25 Invent three functions f> g , h such that for x > 10 

(1 4- l/x) x <f(x) <e x <g(x)< e 2x < h(x) < x*. 

26 Graph f and JP at x - - 2, —1, 0, 1, 2. Another form 

of y/P is . 


Find antiderivatives for the functions in 27-36. 


27 e 3x + e* x 


28 (e 3x ){e 7x ) 


29 l x + 2 X + 3 X 
31 (leY + lf 
33 xe x * + xe~* 2 
35 y/P+if) 1 


30 2~ x 

32 (l/e*) + (l/jO 
34 (sin x)e CMI + (cos 
36 xe 1 (trial and error) 


37 Compare e~* with e~ x *. Which one decreases faster near 
i = 0? Where do the graphs meet again? When is the ratio of 
e~* 2 to e~ x less than 1/100? 


38 Compare e? with x* : Where do the graphs meet? What 
are their slopes at that point? Divide x x by e 1 and show that 
the ratio approaches infinity. 

39 Find the tangent line to y = e 1 at x = a. From which point 
on the graph does the tangent line pass through the origin? 

40 By comparing slopes, prove that if x > 0 then 

(a)e*>l+x (b)*"* > 1 - x. 

41 Find the minimum value of y = x* for x > 0. Show from 
d 2 yjdx 2 that the curve is concave upward. 

42 Find the slope of y — x 1 ^ and the point where dyjdx — 0. 

Check d 2 yjdx 2 to show that the maximum of x i/x is . 

43 If dyjdx = y find the derivative of e^ x y by the product 
rule. Deduce that y(x ) = Ce* for some constant C. 

44 Prove that x e = e* has only one positive solution. 


Evaluate the integrals in 45-54. With infinite limits, 49-50 are 
"improper.” 


45 

"i 

e 2x dx 

0 

46 

"it 

sin x e*°* x t 

0 

47 

M 

2 X dx 

- 1 

48 

2~ x dx 

- 1 

49 

e~ x dx 

0 

50 

xe~ x2 dx 

0 

51 

* i 

e l+x dx 

0 

52 

"i 

e x dx 

O 
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53 |* 2 s '”* cos xdx 54 J* (1 — e*) 10 dx 

55 Integrate the integrals that can be integrated: 

Half* 

56 Find a function that solves /(x) = 5 y(x) with yO) — 2. 

57 Find a function that solves /(x) = l/y(x) with y(0) = 2. 

58 With electronic help graph the function (1 + 1 jxY. What 
are its asymptotes? Why? 


59 This exercise shows that F(x) = x*/e* 0 as x cx>. 

(a) Find dF/dx. Notice that F(x) decreases for x > n >0, 
The maximum of x"/**, at x = n, is n"/e". 

(b) F( 2x) = (2 xT/e u = 2 V/e* • e x < 2 n rf/S • 

Deduce that F(2x) -► 0 as x -► oo, Thus F(x) -+ 0. 

60 With n = 6, graph F(x) = x 6 je* on a calculator or com- 
puter. Estimate its maximum. Estimate x when you reach 
F(x) = 1. Estimate x when you reach F(x) = 

61 Stirling’s formula says that n\ « Jinn n n fe*. Use it to esti- 
mate 6 6 /e 6 to the nearest whole number. Is it correct? How 
many decimal digits in 10!? 

62 x 6 /e* -> 0 is also proved by I’Hdpital’s rule (at x = oo): 

lim x 6 le* = lim 6X 5 /* 1 = fill this in - 0. 


6.3 Growth and Decay in Science and Economics 

The derivative of y = tf x has taken time and effort. The result was y' = ctf x > which 
means that y — cy. That computation brought others with it, virtually for free — the 
derivatives of b * and x* and e u(x K But I want to stay with / = cy — which is the most 
important differential equati 6 n in applied mathematics. 

Compare y' — x with y = y. The first only asks for an antiderivative of x. We quickly 
find y = ^x 2 + C. The second has dyjdx equal to y itself— which we rewrite as dy/y = 
dx. The integral is In y = x + C. Then y itself is e*e c > Notice that the first solution is 
7 X 2 plus a constant, and the second solution is e x times a constant. 

There is a way to graph slope x versus slope y. Figure 6.7 shows “ tangent arrows" 
which give the slope at each x and y. For parabolas, the arrows grow steeper as x 
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Rg.6.7 The slopes are y* = x and y' = y. The solution curves fit those slopes. 
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grows — because y' = slope = x. For exponentials, the arrows grow steeper as y 
grows — the equation is / = slope = y. Now the arrows are connected by y = Ae x . 
A differential equation gives a field of arrows (slopes). Its solution is a curve that stays 
tangent to the arrows — then the curve has the right slope. 

A field of arrows can show many solutions at once (this comes in a differential 
equations course). Usually a single y 0 is not sacred. To understand the equation we 
start from many y 0 — on the left the parabolas stay parallel, on the right the heights 
stay proportional. For / = - y all solution curves go to zero. 

From / = y it is a short step to / = cy. To make c appear in the derivative, put c 
into the exponent. The derivative of y = e cx is ce cx , which is c times y. We have reached 
the key equation, which comes with an initial condition — a starting value y 0 : 

dy/dt = cy with y = y 0 at t = 0. (1) 

A small change: x has switched to t. In most applications time is the natural variable, 
rather than space. The factor c becomes the "growth rate” or "decay rate” — and e cx 
converts to e ct . 

The last step is to match the initial condition. The problem requires y = y 0 at 
t = 0. Our e ct starts from e c0 = 1. The constant of integration is needed now — the 
solutions are y = Ae ct . By choosing A = y 0 , we match the initial condition and solve 
equation (1). The formula to remember is y 0 e ct . 


61 The exponential law y = y 0 e ct solves y' = cy starting from y 0 . 


The rate of growth or decay is c. May I call your attention to a basic fact? The 
formula y 0 e ct contains three quantities y 0 >c,t. If two of them are given, plus one 
additional piece of information, the third is determined. Many applications have one 
of these three forms: find i. find c\find y 0 . 

1. Find the doubling time T if c= 1/10. At that time y 0 e cT equals 2 y 0 : 

e cT = 2 yields cT= In 2 so that T = ^ (2) 

c .1 

The question asks for an exponent T. The answer involves logarithms. If a cell grows 
at a continuous rate of c= 10% per day, it takes about .7/.1 = 7 days to double in 
size. (Note that .7 is close to In 2.) If a savings account earns 10% continuous interest, 
it doubles in 7 years. 

In this problem we knew c. In the next problem we know T. 

2. Find the decay constant c for carbon- 14 if y = %y 0 in T= 5568 years. 

e cT = y yields cT = In 4 so that c ^ (In £)/5568. (3) 

After the half-life T= 5568, the factor e cT equals Now c is negative (In j = - In 2). 

Question 1 was about growth. Question 2 was about decay. Both answers found 
e^ T as the ratio y\T)/y(0). Then cT is its logarithm. Note how c sticks to T. 
T has the units of time, c has the units of "1/time.” 

Main point : The doubling time is (In 2 )/c, because cT= In 2. The time to multiply 
by e is 1/c. The time to multiply by 10 is (In 10)/c. The time to divide by e is — 1/c, 
when a negative c brings decay. 

3. Find the initial value y 0 if c = 2 and y(l) = 5: 

y(t)=*yae? yields y 0 = y(t)e~ a = 5e~ 2 . 
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cT = In 2 5 10 15 20 yea re 


Hg.4.8 Growth (c > 0) and decay ( c < 0). Doubling time T = (In 2)/c, Future value at 5%, 

Alt we do is run the process backward. Start from 5 and go back to y 0 , With time 
reversed, becomes e~ c \ The product of e 2 and e~ 2 is 1 — growth forward and 
decay backward. 

Equally important is T+ t. Go forward to time T and go on to T + t: 

y(T+t) is y 0 ^ tr+0 which is (4) 

Every step t , at the start or later, multiplies by the same \ This uses the fundamental 
property of exponentials, that e T+t = e T e t . 

EXAMPLE 1 Population growth from birth rate b and death rate d (both constant): 

dyjdt = by — dy= cy (the net rate is c = b — d). 

The population in this model is y^tf* = y 0 e bI e~ dt . It grows when b > d (which makes 
c > 0). One estimate of the growth rate is c = 0.02/year: 

In 2 7 

The earth's population doubles in about T~ —— ^ — = 35 years. 

First comment: We predict the future based on c . We count the past population 
to find c. Changes in c are a serious problem for this model. 

Second comment: is not a whole number. You may prefer to think of bacteria 

instead of people, {This section begins a major application of mathematics to economics 
and the life sciences.) Malthus based his theory of human population on this equation 
f - cy — and with large numbers a fraction of a person doesn’t matter so much. To 
use calculus we go from discrete to continuous. The theory must fail when r is very 
large, since populations cannot grow exponentially forever. Section 6.5 introduces the 
logistic equation y* = cy — by 1 , with a competition term - by 2 to slow the growth. 

Third comment: The dimensions of b , c , d are “l/time.” The dictionary gives birth 
rate = number of births per person in a unit of time. It is a relative rate — people 
divided by people and time. The product ct is dimensionless and makes sense (also 
dimensionless). Some texts replace c by X (lambda). Then 1 jX is the growth time or 
decay time or drug elimination time or diffusion time. 

EXAMPLE 2 Radioactive dating A gram of charcoal from the cave paintings in 
France gives 0.97 disintegrations per minute. A gram of living wood gives 6.68 disin- 
tegrations per minute. Find the age of those Lascaux paintings. 

The charcoal stopped adding radiocarbon when it was burned (at r = 0). The 
amount has decayed to y 0 ^*. In living wood this amount is still y 0 , because cosmic 
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rays maintain the balance. Their ratio is e" = 0.97/6.68. Knowing the decay rate c 
from Question 2 above, we know the present time t : 

, /°.97\ 5568, /0.97\ 

yi " dS '’T7 1 “( l 6^j" l4 ’* X)i,earS ' 

Here is a related problem — the age of uranium. Right now there is 140 times as much 
U-238 as TJ-235. Nearly equal amounts were created, with half-lives of (4.5)10 9 and 
(0.7)10® years. Question: How long since uranium was created? Answer: Find t by 
sy bstituting c = (In })/(4.5) 1 0 9 and C = (In ^)/{0.7) 1 0 9 : 

In 140 

=140 =* ct — Ct = In 140 => t= — — — 6(10 9 ) years. 

c 

EXAMPLE 3 Calculus in Economics: price inflation and the value of money 

We begin with two inflation rates — a continuous rate and an annual rate. For the 
price change Ay over a year, use the annual rate: 

Ay = ( annual rate) times (y) times (At). (5) 

Calculus applies the continuous rate to each instant dt. The price change is dy : 

dy = (continuous rate ) times (y) times (dt). (6) 

Dividing by dt, this is a differential equation for the price: 

dyjdt - (continuous rate) times (y) = .05 y. 

The solution is y 0 e O5t . Set t- 1, Then e 05 % 1.0513 and the annual rate is 5.13%, 

When you ask a bank what interest they pay, they give both rates: 8% and 8.33%. 
The higher one they call the “effective rate.” It comes from compounding (and depends 
how often they do it). If the compounding is continuous, every dt brings an increase 
of dy — and e 0% is near 1.0833. 

Section 6.6 returns to compound interest. The interval drops from a month to a 
day to a second. That leads to (1 4- l/n)\ and in the limit to e . Here we compute the 
effect of 5% continuous interest: 

Future value A dollar now has the same value as e ,05T dollars in T years. 

Present value A dollar in T years has the same value as e~ 05T dollars now. 
Doubling lime Prices double (e 05T ” 2) in T= In 2/.05 ^ 14 years. 

With no compounding, the doubling time is 20 years. Simple interest adds on 20 
times 5% = 100%. With continuous compounding the time is reduced by the factor 
In 2 ^ .7, regardless of the interest rate. 

EXAMPLE 4 In 1626 the Indians sold Manhattan for $24. Our calculations indicate 
that they knew what they were doing. Assuming 8% compound interest, the original 
$24 is multiplied by e 0St . After t = 365 years the multiplier is e 292 and the $24 has 
grown to 115 trillion dollars. With that much money they could buy back the land 
and pay off the national debt. 

This seems farfetched. Possibly there is a big flaw in the model. It is absolutely 
true that Ben Franklin left money to Boston and Philadelphia, to be invested for 200 
years. In 1990 it yielded millions (not trillions, that takes longer). Our next step is a 
new model. 
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Question How can you estimate e 19 2 with a S24 calculator (log but not In)? 
Answer Multiply 29.2 by log 10 e = .434 to get 12.7. This is the exponent to base 10. 
After that base change, we have 10 12 7 or more than a trillion. 

GROWTH OR DECAY WITH A SOURCE TERM 

The equation / = y will be given a new term. Up to now, all growth or decay has 
started from y 0 . No deposit or withdrawal was made later. The investment grew by 
itself — a pure exponential. The new term s allows you to add or subtract from the 
account . It is a “source”— or a “sink” if s is negative. The source s = 5 adds 5dt , 
proportional to it but not to y: 

Constant source: dyjdt = y + 5 starting from y = y 0 . 

Notice y on both sides! My first guess y = failed completely. Its derivative is + 5 
again, which is not y -f 5. The class suggested y = e* + 5t, But its derivative & + 5 is 
still not y+ 5. We tried other ways to produce 5 in dyjdt. This idea is doomed to 
failure. Finally we thought of y = Ae f - 5. That has / = Ae x “ y 4- 5 as required. 
Important: A is not y 0 . Set t — 0 to find y 0 — A - 5. The source contributes 5c* - 5: 

The solution is (y 0 + 5)J - 5. That is the same as y Q e l + 5(e l - 1). 

s= 5 multiplies the growth term e 1 - 1 that starts at zero. y Q e t grows as hefore. 

EXAMPLE 5 dyjdt = - y + 5 has y - (y 0 - 5)c" r + 5, This is y 0 e" r + 5(1 - O* 

That final term from the sotftce is still positive. The other term y 0 e _( decays to zero. 
The limit as t -> oo is y^ — 5. A negative c leads to a steady state y * . 

Based on these examples with c — 1 and c = — 1, we can find y for any c and s, 

EQUATION WITH SOURCE ^ = cy + s starts from y - y 0 at t = 0. (7) 

dt 

The source could be a deposit of s = $1000/year 5 after an initial investment of y 0 = 
$8000. Or we can withdraw funds at s = — $200/year. The units are “dollars per year” 
to match dyjdu The equation feeds in S1000 or removes S200 continuously — not all 
at once. 

Note again that y = e (c +s)r is not a solution. Its derivative is (c + s)y. The combina- 
tion y = + s is also not a solution (but closer). The analysis of / = cy + s will be 

our main achievement for differential equations (in this section). The equation is not 
restricted to finance — far from it — but that produces excellent examples. 

I propose to find y in four ways. You may feel that one way is enough. t The first 
way is the fastest — only three lines — but please give the others a chance. There is no 
point in preparing for real problems if we don’t solve them. 

Solution by Method 1 (fast way) Substitute the combination y = Af* + B. The solu- 
tion has this form — exponential plus constant. From two facts we find A and B: 

the equation / = cy -F s gives cAtf' — c(A + B) + s 

the initial value at t = 0 gives A + B = y 0 . 

tMy class says one way is more than enough. They just want the answer. Sometimes I cave 
in and write down the formula: y is y 0 plus — l)/c from the source term. 



Hg.6.9 





The first line has cAtf' on both sides. Subtraction leaves cB + s = 0, or B = — s/c. 
Then the second line becomes A = y 0 - B = y 0 + (s/c): 

KEY FORMULA y=(v 0 + -V-- or y = y 0 e" + 1). (8) 

\ c) c c 

With s = 0 this is the old solution (no source). The example with c = 1 and 
s = 5 produced (y 0 -I- 5)^ - 5. Separating the source term gives y 0 tf + 5(^ — 1). 

Solution by Method 2 (slow way) The input y 0 produces the output y 0 eF. After t 
years any deposit is multiplied by eF. That also applies to deposits made after the 
account is opened. If the deposit enters at time T, the growing time is only t — T 
Therefore the multiplying factor is only e* (t ~ T K This growth factor applies to the small 
deposit (amount sdT) made between time Tand T + dT. 

Now add up all outputs at time t. The output from y 0 is y 0 eF. The small deposit 
sdTnear time T grows to e^ l ~ T) sdT The total is an integral: 

y(t)=yo<f'+ P <f {t ~ T) sdT (9) 

Jr=o 


This principle of Duhamel would still apply when the source s varies with time. 
Here $ is constant, and the integral divides by c: 


s 



T) dT - 


“ c _ o 


s . s 

— H — ef 1 . 
c c 


( 10 ) 


That agrees with the source term from Method 1, at the end of equation (8). There 
we looked for “exponential plus constant/' here we added up outputs. 

Method 1 was easier. It succeeded because we knew the form Ae ft + B — with 
“undetermined coefficients.” Method 2 is more complete. The form for y is part of 
the output, not the input. The source 5 is a continuous supply of new deposits, all 
growing separately. Section 6.5 starts from scratch, by directly integrating / — cy + s . 


Remark Method 2 is often described in terms of an integrating factor. First write 
the equation as y f - cy — s. Then multiply by a magic factor that makes integration 
possible: 

{/ - cy)e~ ct = se~ ct multiply by the factor e~ ct 


j.-*-*.- -V*-„ 

C 


integrate both sides 
substitute 0 and t 


y = e c, yo + -(e a - 1) 
c 


isolate y to reach formula (8) 


The integrating factor produced a perfect derivative in line 1. 1 prefer Duhamel’s idea, 
that all inputs y 0 and s grow the same way. Either method gives formula (8) for y . 


THE MATHEMATICS OF FINANCE (AT A CONTINUOUS RATE) 


The question from finance is this: What inputs give w hat outputs ? The inputs can 
come at the start by y 0 , or continuously by s , The output can be paid at the end or 
continuously. There are six basic questions, two of which are already answered. 

The future value is y 0 f from a deposit of y 0 To produce y in the future, deposit 
the present value ye~ ci . Questions 3-6 involve the source term s . We fix the continuous 
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rate at 5% per year (c = .05), and start the account from y 0 = 0. The answers come 
fast from equation (8). 

Question 3 With deposits of s = $ 1000/year, how large is y after 20 years? 

y = - 1) = - l) = 20,000(e - 1) » $34,400. 

c .05 

One big deposit yields 20,000e # $54,000, The same 20,000 via s yields $34,400. 

Notice a small by-product (for mathematicians). When the interest rate is c = 0, 
our formula )/c turns into 0/0. We are absolutely sure that depositing 

$ 1000/year with no interest produces $20,000 after 20 years. But this is not obvious 
from 0/0. By FHopitaTs rule we take c-derivatives in the fraction: 

lim ~ = lim — = st. This is (1000)(20) = 20,000. (11) 

c-* 0 C 1 

Question 4 What continuous deposit of s per year yields $20,000 after 20 years? 

20,000 = (e ( O5)<20) - 1) requires s = 155? * 58 2. 

.05 e — 1 

Deposits of $582 over 20 years total $11,640. A single deposit of y 0 = 20,000/e = 
$7,360 produces the same $20,000 at the end. Better to be rich at t = 0. 

Questions 1 and 2 had s~ 0 (no source). Questions 3 and 4 had y 0 = 0 (no initial 
deposit). Now we come to y — 0. In 5, everything is paid out by an annuity. In 6, 
everything is paid up on a loan. 

Question 5 What deposit y G provides $ 1000/year for 20 years? End with y = 0. 

y - yo “ 1) = 0 requires y 0 = — -(1 - e ~ ct ), 
c c 

Substituting s = — 1000, c = .05, t — 20 gives y 0 & 12,640. If you win $20,000 in a 
lottery, and it is paid over 20 years, the lottery only has to put in $12,640. Even less 
if the interest rate is above 5%. 

Question 6 What payments s will clear a loan of y 0 — $20,000 in 20 years? 

Unfortunately, s exceeds $1000 per year. The bank gives up more than the $20,000 
to buy your car (and pay tuition). It also gives up the interest on that money . You pay 
that back too, but you don’t have to stay even at every moment. Instead you repay 
at a constant rate for 20 years. Your payments mostly cover interest at the start and 
principal at the end. After t - 20 years you are even and your debt is y = 0. 

This is like Question 5 {also y = 0), but now we know y 0 and we want s: 

y = y o^ r + -(£" - 1) = 0 requires s = - cy^tf 7(tf" - 1). 
c 

The Joan is y 0 = $20,000, the rate is c = .05/year, the time is t = 20 years. Substituting 
in the formula for s, your payments are $1582 per year. 

Puzzle How is s = $1582 for loan payments related to s = $582 for deposits? 

0 $582 per year -► $20,000 and $20,000 ^ - $1582 per year ^ 0. 
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That difference of exactly 1000 cannot be an accident. 1582 and 582 came from 

€ 1 C — 1 

1000- and 1000 — — with difference 1000 — — - 1000. 

1 e — 1 e — 1 

Whyl Here is the real reason. Instead of repaying 1582 we can pay only 1000 (to 
keep even with the interest on 20,000), The other 582 goes into a separate account. 
After 20 years the continuous 582 has built up to 20,000 (including interest as in 
Question 4), From that account we pay back the loan. 

Section 6*6 deals with daily compounding— which differs from continuous com- 
pounding by only a few cents. Yearly compounding differs by a few dollars. 





34400 

20000 


20000 

12640 


- -1582 


s = -1000 20 


_ 6_2 

" 3” I 


Hg, 6.10 Questions 3-4 deposit s. Questions 5-6 repay loan or annuity. Steady state —s/e. 


TRANSIENTS VS. STEADY STATE 

Suppose there is decay instead of growth. The constant c is negative and y 0 e CI dies 
out. That is the transient ” term, which disappears as t -*■ oo. What is left is the 
“steady state” We denote that limit by y m . 

Without a source, y is zero (total decay). When s is present, y*. = — s/c: 



At this steady state, the source s exactly balances the decay cy. In other words 
cy + s — 0. From the left side of the differential equation, this means dy/dt = 0. There 
is no change , That is why y, x is steady. 

Notice that y x depends on the source and on c — but not on y 0 . 

EXAMPLE 6 Suppose Bermuda has a birth rate h - .02 and death rate d = .03. The 
net decay rate is c — - .01. There is also immigration from outside, of s ~ 1200/year. 
The initial population might be y 0 = 5 thousand or y 0 - 5 million, but that number 
has no effect on y^. The steady state is independent of y 0 . 

In this case y, x = - s/c = 1200/.01 - 120,000. The population grows to 120,000 if 
y 0 is smaller. It decays to 120,000 if y 0 is larger. 

EXAMPLE 7 Newton's Law of Cooling: dy/dt = c(y - y^), (12) 

This is back to physics. The temperature of a body is y. The temperature around it 
is y x . Then y starts at y 0 and approaches y^, following Newton’s rule: 77ie rate is 
proportional to y — y^, The bigger the difference, the faster heat hows. 

The equation has - cy fJC , where before we had s. That fits with y m - - s/c . For the 
solution, replace s by -cy K in formula (8). Or use this new' method: 
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Solution by Method 3 The new idea is to look at the difference y — y^. Its derivative 
is dy/dt, since y ^ is constant. But dy/dt is c(y — yj — this is our equation. The differ- 
ence starts from y 0 - , and grows or decays as a pure exponential: 


j t (y^ y<*) = c(y- y«>) 


has the solution (y - y<J = {y 0 - y ao )^ 1 . 


(13). 


This solves the law of cooling. We repeat Method 3 using the letters s and c: 

+ c) ” + c) * as ^ e solution ^y + ^ = ^y 0 + (14) 


Moving s/c to the right side recovers formula (8). There is a constant term and an 
exponential term. In a differential equations course, those are the tl particular solution" 
and the 4 ‘ homogeneous solution” In a calculus course, it's time to stop. 


EXAMPLE 8 In a 70° room, Newton's corpse is found with a temperature of 90°. A 
day later the body registers 80°. When did he stop integrating (at 98.6°)? 

Solution Here y ^ = 70 and y 0 = 90. Newton’s equation (13) is y = 20^ + 70. Then 
y — 80 at t — 1 gives 20e* — 10. The rate of cooling is c = In Death occurred when 
20^ + 70 = 98.6 or e* = 1.43. The time was t = In 1 . 43 /In £ = half a day earlier. 


6.3 EXERCISES 


Read-through exercises 

If y “ cy then y(t) = o If dy/dt = ly and y ft = 4 then 
y(f) — , b » This solution reaches 8 at i = c , if the dou- 
bling time is Tthen c= <* . If / = 3y and y(l) = 9 then y D 

was e When c is negative, the solution approaches 
t as t oo. 

The constant solution to dy/dt = y + 6 is y — q . The 
general solution is y = A& — 6. If ya = 4 then A = tj i M . The 
solution of dy/dt = cy + s starting from y 0 is y = A# 1 + B = 
i The output from the source s is f An input at 
time T grows by the factor k at time f. 

At 10%, the interest in time dt is dv= > This 
equation yields y(t) = m , with a source term instead of 
y 0> a continuous deposit of s = 4000/year yields y — n 
after 10 years. The deposit required to produce 10,000 in 10 
years is s = o (exactly or approximately). An income of 
4000/year forever (!) comes from y 0 = p . The deposit to 
give 4000/year for 20 years is y 0 = q « The payment rate 
s to clear a loan of 10,000 in 10 years is r 

The solution to / = —3 y + s approaches y« = * 


Solve 1-4 starting from y 0 = 1 and from y 0 = — 1. Draw both 
solutions on the same graph. 


l? = 2t 

dt 


2 ~ = —t 

dt 




, dy 

4 — = — y 
dt * 


Solve 5-8 starting from y 0 = 10. At what time does y increase 
to 100 or drop to 1? 




dt 


„ dy , 

8 * =e 


— 4i 


9 Draw a field of “tangent arrows" for y f = — y, with the 
solution curves y = e~ x and y = — e ~ x . 


10 Draw a direction field of arrows for / = y — 1, with solu- 
tion curves y = e* + 1 and y = 1, 


Problems 11-27 involve y 0 e*. They ask for c or t or y 0 . 

11 If a culture of bacteria doubles in two hours, how many 
hours to multiply by 10? First find c. 

12 If bacteria increase by factor of ten in ten hours, how 
many hours to increase by 100? What is c? 

13 How old is a skull that contains i as much radiocarbon 
as a modern skull? 

14 If a relic contains 90% as much radiocarbon as new mate- 
rial, could it come from the time of Christ? 

15 The population of Cairo grew from 5 million to 10 million 
in 20 years. From j/ — cy find c. When was y = 8 million? 

16 The populations of New York and Los Angeles are grow- 
ing at 1% and 1.4% a year. Starling from 8 million (NY) and 
6 million (LA), when will they be equal? 
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17 Suppose the value of SI in Japanese yen decreases at 2% 
per year. Starling from SI = Y240, when will 1 dollar equal 1 
yen? 

18 The effect of advertising decays exponentially. If 40% 
remember a new product after three days, fin d c , How long 
will 20% remember it? 

19 If y = 1000 at t = 3 and y — 3000 at £ = 4 (exponential 
growth), what was y 0 at f = 0? 

20 If y — 100 at t = 4 and y = 10 at t = 8 (exponential decay) 
when will y = 1? What was y 0 ? 

21 Atmospheric pressure decreases with height according to 
dpjdh “ cp. The pressures at h = 0 (sea level) and h — 20 km 
ar e 1013 and 50 millibars. Find c. Explain why p = 
^1013-50 halfway up at h = 10. 

22 For exponential decay show that y(£) is the square root of 
y( 0) times y(2£). How could you find y(3f) from y(r) and y(2£)? 

23 Most drugs in the bloodstream decay by y' = cy {first- 
order kinetics), (a) The half-life of morphine is 3 hours. Find 
its decay constant c (with units), (b) The half-life of nicotine 
is 2 hours. After a six-hour flight what fraction remains? 

24 How often should a drug be taken if its dose is 3 mg, it is 
cleared at c =. 01/hour, and 1 mg is required in the blood- 
stream at all times? (The doctor decides this level based on 
body size.) 

25 The antiseizure drug dilantin has constant clearance rate 
/ = — a until y = yi . Then / — — ay/yi . Solve for >^£) in two 
pieces from y 0 . When does y reach yi? 

26 The actual elimination of nicotine is multiexponential; y = 
Ae a + Be c *. The first-order equation {djdt — c)y = 0 changes 
to the second-order equation {djdt — c){djdt — Qy = 0, Write 
out this equation starting with y ", and show that it is satisfied 
by the given y. 

27 True or false. If false, say what’s true. 

(a) The time for y = e ci to double is (In 2)/(ln c ). 

(b) If y 1 = cy and z ' = cz then (y + z)' - 2c(y + z\ 

(c) If y = cy and z r = cz then ( yjz )' = 0. 

(d) If / = cy and z ' = Cz then ( yz) f = (c + C)yz. 

28 A rocket has velocity v . Burnt fuel of mass Am leaves at 
velocity v — 7. Total momentum is constant: 

mi? = (m — Am)(r + Au) + Am(r — 7). 

What differential equation connects m to vl Solve for u(m) not 
u(0> starting from v 0 = 20 and m 0 = 4. 


Problems 29-36 are about solutions of y f = cy + s. 

29 Solve y' = 3y+l with y 0 = 0 by assuming y = Ae 3 * + B 
and determining A and B. 


30 Solve / = 8 — y starting from y 0 and y = Ae f + B. 


Solve 31-34 with y 0 = 0 and graph the solution. 


dy 

31 i ^ +1 

dy 


„ d y , 

32^-j-l 

34 — = “V- 1 

* y 


35 (a) What value y = constant solves dyjdt = — 2y + 12? 

(b) Find the solution with an arbitrary constant A , 

(c) What solutions start from y 0 = 0 and y 0 = 10? 

(d) What is the steady state y^? 

36 Choose ± signs in dyjdt = ±3y±6 to achieve the 
following results starting from y 0 = L Draw graphs. 

(a) y increases to oo (b) y increases to 2 

(c) y decreases to — 2 (d) y decreases to — oo 


37 What value y “ constant solves dyjdt = 4 — y? Show that 
y(£) = Ae~* + 4 is also a solution. Find y(l) and y M if y 0 = 3. 

38 Solve y = y + e l from y 0 =0 by Method 2, where the 
deposit e T at time Tis multiplied by d~ T . The total output 

at time t is Xt) = y o e r d~ T dT= . Substitute back to 

check y = y + d. 


39 Rewrite y' = y + d as / - y = e\ Multiplying by e~\ the 

left side is the derivative of . Integrate both sides 

from y 0 = 0 to find y(£). 


40 Solve y = — y + 1 from y 0 = 0 by rewriting as y* + y = 1, 
multiplying by e*, and integrating both sides. 

41 Solve y' = y + 1 from y 0 = 0 by assuming y = Ad + Bt + C. 


Problems 42-57 are about the mathematics of finance. 

42 Dollar bills decrease in value at c = — ,04 per year because 
of inflation. If you hold 51 000, what is the decrease in dt 
years? At what rate s should you print money to keep even? 

43 If a bank offers annual interest of or continuous 
interest of 7i%, which is better? 

44 What continuous interest rate is equivalent to an annual 
rate of 9%? Extra credit; Telephone a bank for both rates 
and check their calculation. 

45 At 100% interest (c — 1) how much is a continuous deposit 
of $ per year worth after one year? What initial deposit y 0 
would have produced the same output? 

46 To have 550,000 for college tuition in 20 years, what gift 
y 0 should a grandparent make now? Assume c = 10%. What 
continuous deposit should a parent make during 20 years? If 
the parent saves s = 51000 per year, when does he or she reach 
550,000 arid retire? 
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47 1 ncome per person grows 3%, the population grows 2%, 

the total income grows . Answer if these are (a) 

annual rates (b) continuous rates, 

48 When dyjdi = cy + 4, how much is the deposit of 4dT at 
time T worth at the later time f? What is the value at r = 2 of 
deposits 4^Tfrom T= 0 to T= 1? 

49 Depositing s = $1000 per year leads to £34,400 after 20 
years (Question 3), To reach the same result, when should you 
deposit $20,000 all at once? 

50 For how long can you withdraw — $5G0/year after 
depositing = $5000 at 8%, before you run dry? 

51 What continuous payment s clears a £1000 loan in 60 
days, if a loan shark charges 1% per day continuously? 

52 You arc the loan shark. What is £1 worth after a year of 
continuous compounding at 1% per day? 

53 You can afford payments of s = $100 per month for 48 
months. If the dealer charges c = 6%, how much can you 
borrow? 

54 Your income is t Q e 2ci per year. Your expenses are E {} e a 
per year, (a) At what future time are they equal? (b) If you 
borrow the difference until then, how much money have you 
borrowed? 

55 If a student loan in your freshman year is repaid plus 20% 
four years later, what was the effective interest rate? 

56 Is a variable rate mortgage with t = .09 +,001r for 20 
years better or worse than a fixed rate of 10%? 

57 At 10% instead of 8%, the S24 paid for Manhattan is 
worth ____ after 365 years. 


Problems 58-65 approach a steady state v r as t x. 

58 If dyidt = — y + 7 what is y /■ What is the derivative of 

y - y ? Then y — y , equals y 0 — v y times . 

59 Graph y{t) when y r = 3y — 12 and y 0 is 

(a) below 4 (b) equal to 4 (c) above 4 

60 The solutions to dyidt = c[y — 12) converge to = 

provided c is 

61 Suppose the time unit in dyidt = cy changes from minutes 
to hours, I low does the equation change? How does dyidt — 
— y + 5 change? How does \\ change? 

62 True or false, when y, and both satisfy y 1 = cy + s. 

(a) The sum y= y 3 -t y 2 also satisfies this equation. 

(b) The average y = ■?( y, + v 2 ) satisfies the same equation, 

(c) The derivative y = y\ satisfies the same equation. 

63 If Newton's coffee cools from 80 to 60 in 12 minutes 

(room temperature 20 ), find c\ When was the coffee at 100 ? 

64 If y n = 100 and rf 1 ) = 90 and y(2) = 84, w hat is y , ? 

65 If y i} = 100 and y(l) = 90 and y(2) = 81, what is y\ ? 

66 To cool down coffee, should you add milk now or later? 
The coffee is at 70 J C, the milk is at 10 , the room is at 20\ 

(a) Adding 1 part milk to 5 parts coffee makes it 60 . With 

_r, = 20 , the white coffee cools to y(f j = . 

(b) Thc black coffee cools to y,(0 = . The milk 

warms to yj/) - , Mixing at time i gives 

tfve + yj 6 = , 



6.4 Logarithms 



We have given first place to e x and a lower place to In x. In applications that is 
absolutely correct. But logarithms have one important theoretical advantage (plus 
many applications of their own). The advantage is that the derivative of In x is 1/x, 
whereas the derivative of e x is e x . We can't define e x as its own integral, without 
circular reasoning. But we can and do define In x (the natural logarithm) as the 
integral of the hL — 1 power" which is 1 lx: 



Note the dummy variables, first x then u. Note also the live variables, first x then v. 
Especially note the lower limit of integration, which is 1 and not 0. The logarithm is 
the area measured from 1. Therefore In 1 = 0 at that starling point — as required. 
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Earlier chapters integrated all powers except this — 1 power/' The logarithm is 
that missing integral. The curve in Figure 6.1 1 has height y = 1/x — it is a hyperbola* 
At x — Q the height goes to infinity and the area becomes infinite: log 0“ - oo. 
The minus sign is because the integral goes backward from 1 to 0. The integral 
does not extend past zero to negative x* We are defining In x only for x > Q.t 





Fig. 6.1 1 Logarithm as area . Neighbors In a + In b = In ab. Equal areas: — In I = In 2 = 4 In 4. 


With this new approach, In x has a direct definition. It is an integral (or an area). 
Its two key properties must follow from this definition. That step is a beautiful 
application of the theory behind integrals. 


Property l : In ab = In a + In b * The areas from 1 to a and from a to ab combine into 
a single area (1 to ab in the middle figure): 


Neighboring areas: 


Ji 


- dx + 
x 



m ah i 

« - dx. 

Jt * 


( 2 ) 


The right side is In ab, from definition (1). The first term on the left is In a , The 
problem is to show that the second integral (a to ab) is In b: 

m ab i rt j 

i -dx = -du - In b. (3) 

J- * Ji w 


We need u = 1 when x = a (the lower limit) and u = b when x = ab (the upper limit)* 
The choice u~xja satisfies these requirements* Substituting x = au and dx = a du 
yields dx/x = dtt/u * Equation (3) gives In b , and equation (2) is In a + In b = In ab * 

Property 2: In b n — n In b. These are the left and right sides of 



— n 


rb 


du. 


(4) 


This comes from the substitution x = if. The lower limit x = f corresponds to u - 1, 
and x = b n corresponds to u = b. The differential dx is nu n ~ l du * Dividing by x = if 
leaves dxjx = n duju * Then equation (4) becomes In b n — n In b. 

Everything comes logically from the definition as an area* Also definite integrals: 


EXAMPLE 1 

EXAMPLE 2 


Compute 

Compute 


*3* 

J * 



t 


* i 

.i 



Solution: In 3x - In x — In — = In 3. 

x 

Solution: In 1 - In .1 = In 10. (Why?) 


tThe logarithm of —1 is ni (an imaginary number). That is because e m = — 1. The logarithm 
of i is also imaginary — it is In general, logarithms are complex numbers* 
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re* j 

EXAMPLE 3 Compute - du. Solution: In e 2 - 2, The area from 1 to e 1 is 2. 

Ji « 

Remark While working on the theory this is a chance to straighten out old debts. 
The book has discussed and computed (and even differentiated) the functions e x and 
b x and x\ without defining them properly. When the exponent is an irrational number 
like jt, how do we multiply e by itself n times'! One approach (not taken) is to come 
closer and closer to n by rational exponents like 22/7. Another approach (taken now) 
is to determine the number e n = 23.1 ... by its logarithm. f Start with e itself: 

e is (by definition) the number whose logarithm is 1 

e n is (by definition) the number whose logarithm is n. 

When the area in Figure 6.12 reaches 1, the basepoint is e . When the area reaches n, 
the basepoint is e K . We are constructing the inverse function (which is e x ). But how 
do we know that the area reaches n or 1000 or -1000 at exactly one point? (The 
area is 1000 far out at e 1000 . The area is - 1000 very near zero at e~ loo °.) To define 
e we have to know that somewhere the area equals If 

For a proof in two steps, go back to Figure 6.1 Ic, The area from 1 to 2 is more 
than \ (because 1/x is more than \ on that interval of length one). The combined area 
from 1 to 4 is more than 1. We come to area = 1 before reaching 4. (Actually at 
£> = 2,718.,..) Since 1/x is positive, the area is increasing and never comes back to 1. 
To double the area we have to square the distance. The logarithm creeps upwards: 

* , In x 

In x -* oo but ► 0. (5) 

x 

The logarithm grows slowly because e x grows so fast (and vice versa— they are 
inverses). Remember that e* goes past every power x w . Therefore In x is passed by 
every root x 1/n , Problems 60 and 61 give two proofs that (In x)/x 1/n approaches zero. 
We might compare In x with x /x, At x - 10 they are dose (2.3 versus 3.2), But out 
at x = e 10 the comparison is 10 against e and In x loses to ^/x. 





Fig. 6.12 Area is logarithm of basepoint. 


Fig, 6,13 In x grows more slowly 
than x. 


t Chapter 9 goes on to imaginary exponents, and proves the remarkable formula e 71 *— — 1. 
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APPROXIMATION OF LOGARITHMS 

The limiting cases In 0 = - oo and In co = + oo are important. More important are 
logarithms near the starting point In 1 = 0. Our question is: What is In (1 + x) for x 
near zero' ? The exact answer is an area. The approximate answer is much simpler. 
If x (positive or negative) is small, then 

In (1 + x) ss x and e * « 1 + x. (6) 

The calculator gives In 1.01 = .0099503. This is close to x = ,01. Between 1 and 1 + x 
the area under the graph of 1/x is nearly a rectangle. Its base is x and its height is 1. 
So the curved area In (1 + x) is close to the rectangular area x, Figure 6.14 shows 
how a small triangle is chopped off at the top. 

The difference between .0099503 (actual) and .01 (linear approximation) is 
— .0000497. That is predicted almost exactly by the second derivative: % times (Ax) 2 
times (In x)" is J(.0l) 2 {— 1) = - .00005. This is the area of the small triangle! 

ln(l + x) % rectangular area minus triangular area = x - jx 2 . 

The remaining mistake of .0000003 is close to ^x 3 (Problem 65). 

May I switch to e 1 ? Its slope starts at e° = 1. so its linear approximation is I + x. 
Then In (e*) a In (1 + x) m x. Two wrongs do make a right: In (e x ) - x exactly. 

The calculator gives e 01 as 1.0100502 (actual) instead of 1.01 (approximation). The 
second-order correction is again a small triangle: jx 2 = .00005. The complete series 
for In (I + x) and e x are in Sections 10. 1 and 6.6: 

In (1 + x) = x - x 2 /2 + x 3 /3 - ... e s = 1 + x + x 2 /2 + x 3 /6 + .... 

DERIVATIVES BASED ON LOGARITHMS 


Logarithms turn up as antiderivatives very often. To build up a collection of integrals, 
we now differentiate In w(x) by the chain rule. 



The slope of In x was hard work in Section 6.2. With its new definition (the integral 
of 1/x) the work is gone. By the Fundamental Theorem, the slope must be 1/x. 

For In u(x) the derivative comes from the chain rule. The inside function is u, the 
outside function is In. (Keep u > 0 to define In u.) The chain rule gives 

d I 1 d 3 

— lncx= — c=-(!) — In x 3 = 3x 2 /x 3 = - 

dx cx x dx x 

4- In (x 2 + 1) = 2x/(x 2 +1) ~ In cos x = — = - tan x 

dx dx cos x 

4~ In e x = e*/e* =1 ~ In (In x) = — ! — -. 

dx dx In x x 

Those are worth another look, especially the first. Any reasonable person would 
expect the slope of In 3x to be 3/x. Not so. The 3 cancels, and In 3x has the same 
slope as In x. (The real reason is that In 3x = In 3 + in x.) The antiderivative of 3/x is 
not In 3x but 3 In x, which is In x 3 . 



0 x 

Fig. 6.14 




6 Exponentiate and Logarithms 


Before moving to integrals, here is a new method for derivatives: logarithmic differen- 
tiation or LD. It applies to products and powers. The product and power rules are 
always available, but sometimes there is an easier way. 

Main idea: The logarithm of a product p(x) is a sum of logarithms * Switching to 
In p> the sum rule just adds up the derivatives. But there is a catch at the end, as you 
see in the example. 


EXAMPLE 4 Find dpjdx if p(x) — x*yfx - 1 - Here In p(x) = x In x + i ln(x - 1). 

Take the derivative of In p : — ^ = x * - + In x + 

F pdx x 2(x l) 


Now multiply by p(x): 


= p| 1 + In x + 
dx * \ 2(x - 1) 


The catch is that last step. Multiplying by p complicates the answer. This can’t be 
helped — logarithmic differentiation contains no magic. The derivative of p =fg is the 
same as from the product rule: In p = In/ + In g gives 


P -= f - + S - and 
P f R 


P=p( f + - I =/'£ +fg'- 
\J g. 


0) 


For p = xe x sin x, with three factors, the sum has three terms: 


in p — In x t x + In sin x and p — p 


1 cos x 

- + 1 + 

x sin x 


We multiply p times p f /p (the derivative of In p). Do the same for powers: 


EXAMPLE 5 p= x l; ' x => In p = - In x => -f- = p 

x dx 


1 In x 

72 ? — 


EXAMPLE 6 p= x 


_ .JllX 


In p = (In x) 2 => / = p\ 
ax 


2 In 


EXAMPLE 7 /> = a 1 lni => In p = 2- In a = 1 ^ ^ = 0 (!) 

In x dx 


INTEGRALS BASED ON LOGARITHMS 


Now comes an important step. Many integrals produce logarithms. The foremost 
example is l/w whose integral is In x. In a certain way that is the only example, but 
its range is enormously extended by the chain rule. The derivative of In u(x) is u'-u, 
so the integral goes from w7 it hack to In u: 


u(x) 


or equivalently 



u 


Try to choose u(x) so that the integral contains dujdx divided by u. 


J 


dx 

a + 7 


' dx 
c x 4- 7 


- ln|cx t- 7| 
c 


EXAMPLES 


ln|x + 7| 
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Final remark When u is negative, In u cannot be the integral of 1/u. The logarithm 
is not defined when u < 0. But the integral can go forward by switching to — u: 


dujdx 

dx 

u 


'-dujdx 
— u 


dx = ln(— a). 


( 8 ) 


Thus ln{— u) succeeds when In u fails.! The forbidden case is u — 0. The integrals In u 
and ln( — u), on the plus and minus sides of zero, can be combined as ln|u|. Every 
integral that gives a logarithm allows u < 0 by changing to the absolute value |u|: 

J ^ = [ln|x|]_* =ln 1 — In e J ^^ = [ln|x — = ln 1 — In 3, 

The areas are —1 and —In 3. The graphs of 1/x and l/(x — 5) are below the x axis. 
We do not have logarithms of negative numbers, and we will not integrate 1 /(jc — 5) 
from 2 to 6. That crosses the forbidden point x = 5, with infinite area on both sides. 


The ratio du/u leads to important integrals. When u = cos x or u = sin x, we are 
integrating the tangent and cotangent. When there is a possibility that u < 0, write 
the integral as ln|u|. 


I 

I 


tan x dx = 


cot x dx = 


sin x 
cos x 


cos X 
sin x 


dx = —In |cos 


dx — In |sin x| 


*1 


X dx 
x 2 + 7 

dx 

x In x 


= iln(* 2 + 7) 
= In |ln x| 


Now we report on the secant and cosecant. The integrals of 1/cos x and 1/sin x 
also surrender to an attack by logarithms — based on a crazy trick: 


I 

I 


sec x dx = 


esc x dx = 


sec x 


esc x 


/ sec x + tan x\ 
\sec x + tan xj 

/esc x — cot x\ 
\csc x — cot xj 


dx — In |sec x + tan x|. 
dx — In |csc x — cot x|. 


( 9 ) 

( 10 ) 


Here u — sec x + tan x is in the denominator; dujdx — sec x tan x + sec 2 x is above it. 
The integral is In ju|. Similarly (10) contains dujdx over u = csc x — cot x. 

In closing we integrate In x itself. The derivative of x In x is In x + 1. To remove 
the extra 1 , subtract x from the integral: J In x dx — x In x - x. 

In contrast, the area under l/(lnx) has no elementary formula. Nevertheless it is 
the key to the greatest approximation in mathematics— the prime number theorem . 
The area j b a dx/In x is approximately the number of primes between a and b. Near e 1000 , 
about 1/1000 of the integers are prime. 


6.4 EXERCISES 


Read-through questions 

The natural logarithm of x is Q . This definition leads 
to lnxv= and lnx*= c . Then e is the number 

whose logarithm (area under 1/x curve) is d Similarly 
e 1 is now defined as the number whose natural logarithm is 


As x — * oo. In x approaches * But the ratio 
(In x)j^fx approaches a The domain and range of In x 
are n . 

The derivative of In x is < The derivative of ln(l + x) 


+The integral of 1/x (odd function) is In |x| (even function). Stay clear of x — 0. 
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is 1 . The tangent approximation to ln(l + x) at x = 0 is 

fc . The quadratic approximation is The quadratic 

approximation to e* is m . 

The derivative of In u(x) by the chain rule is " . Thus 

(In cos x) 1 - o . An antiderivative of tan x is P . The 
product p-xe 3j has In p = q The derivative of this equ- 
ation is g . Multiplying by p gives p f - * , which is 

LD or logarithmic differentiation. 

The integral of u'(x)/u(jc) is _t_. The integral of 
2x/fx z + 4) is » . The integral of 1/cx is y . The integ- 

ral of l/(cf + s ) is w . The integral of 1/cos x, after a trick, 
is x , We should write In |x| for the antiderivative of 1/x, 
since this allows v . Similarly fdu/u should be written 
z 


31 y = i** x) 32 y = x* 

33 y = x f *" ) 34 

35 y = x _1/lDJ 36 y = e~* nj 


Evaluate 37-42 by any method. 




1 dx 

2 * 




„ [sec^x + sec x tan x 
42 dx 


sec x + tan x 


Find the derivative dyjdx in 1-10, 

1 y = ln(2x) 2 y = ln(2x + l) 


3 y^(ln xy l 
5 y = x In x — x 
7 y = ln(sin x) 

9 y = 7 In 4x 


4 y = (In x)/x 
<5 y = logic* 

0 y = ln(ln x) 
10 y = ln((4x) 7 ) 


Find the indefinite (or definite) integral in 11-24. 


“JS 

12 

f 1 dx 

13 J.rs 

14 

is r ** 

Jo X 2 + 1 

16 

17 r * 

Jj x(ln x) 

18 

13 fcosxrfx 

J Sin X 

20 

21 j* tan 3x dx 

22 

22 r (in x) 2 dx 

24 


dx 


xfln x) 2 


tan x dx 


dx 


xfln x)(ln In x) 
26 Graph y = In (sin 


25 Graph y = ln(I +x) 

Compute dyjdx by differentiating In y. Th 

27 y = y/PTl 28 y — y/x 

29 y = 30 y = x - ' 


Verify the derivatives 43-46, which give l&eful antiderivatives; 





Estimate 47-50 to linear accuracy, then quadratic accuracy, 
by e* a* 1 + x + ix 2 . Then use a calculator. 

47 ln(l.l) 48 e- 1 49 In(.99) 50 e 2 

51 Compute lim + ^ 52 Compute lim 

x-*0 X x-*0 X 

53 Compute lim +x ^ 54 Compute lim - ■ 
x-*0 X jt-+ 0 x 

55 Find the area of the “hyperbolic quarter-circle” enclosed 
by x = 2 and y = 2 above y = 1/x. 

56 Estimate the area under y = 1/x from 4 to 8 by four upper 
rectangles and four lower rectangles. Then average the 
answers (trapezoidal rule). What is the exact area? 

57 Why is ^ \ + - near In n? Is it above or below? 

2 3 tt 

58 Prove that In x ^ 2( y fx — 1) for x > L Compare the integ- 
rals of 1/r and 1 jy/t, from 1 to x, 

59 Dividing by x in Problem 58 gives (In x)/x < 2(^/x — l)/x. 
Deduce that (In x)/x -*0 as x-*> ao. Where is the maximum 
of (In x)/x? 

60 Prove that Gnx)/x 1/Jt also approaches zero. (Start with 
(In x v *)fx lltl -* 0.) Where is its maximum? 
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61 For any power n, Problem 6.2.59 proved e x > x n for large 

x. Then by logarithms, x > n In x. Since (In x)/x goes below 
1/n and stays below, it converges to . 

62 Prove that y In y approaches zero as y -+ 0, by changing 
y to 1/x. Find the limit of y* (take its logarithm as y -> 0). 
What is . I 1 on your calculator? 

63 Find the limit of In x/log l0 x as x oo. 

64 We know the integral J* t H _1 dt = [t*/^ = (x h - \)ih. 

Its limit as h -> 0 is . 

65 Find linear approximations near x = 0 for e~ x and 2 X . 

66 The x 3 correction to ln(l + x) yields x — \x l + ^x 3 . Check 
that In 1.01 ^.0099503 and find In L02. 

67 An ant crawls at 1 fool/seeond along a rubber band whose 
original length is 2 feet. The band is being stretched at 1 
foot/second by pulling the other end. At what time T, if ever, 
does the ant reach the other end? 

One approach: The band's length at time t is t + 2, Let y(r) 
be the fraction of that length which the ant has covered, and 
explain 

(a)y = l/(f + 2) (b ) y = ln(r + 2) — In 2 (c)T = 2e-2. 

68 If the rubber band is stretched at 8 feet/second, when if 
ever does the same ant reach the other end? 

69 A weaker ant slows down to 2/(f 4- 2) fcet/second, so y H = 
2/(f + 2) 2 . Show that the other end is never reached. 


70 The slope of p = x x comes two ways from In p = x In x: 

1 Logarithmic differentiation (LD): Compute (In pj and 
multiply by p. 

2 Exponential differentiation (ED): Write x x as e* lnx y 
take its derivative, and put back x J , 

71 If p = 2 X then In p = * LD gives p f = (p)(ln p)’ = 

r ED gives p = e and then p* = . 

72 Compute In 2 by the trapezoidal rule and/or Simpson's 
rule, to get five correct decimals. 

73 Compute In 10 by either rule with Ax = 1, and compare 
with the value on your calculator. 

74 Estimate 1 /In 90,000, the fraction of numbers near 90,000 
that are prime. (879 of the next 10,000 numbers arc actually 
prime.) 

75 Find a pair of positive integers for which x v = y x . Show 
how to change this equation to (In x)/x = (In y)/y. So look for 
two points at the same height in Figure 6. 1 3. Prove that you 
have discovered all the integer solutions, 

*76 Show that (In x)/x = (In y)jy is satisfied by 



with f / 0. Graph those points to show the curve x y — y 4 It 
crosses the line y = x at x = , where i -*■ oo. 



6.5 Separable Equations Including the Logistic Equation 

This section begins with the integrals that solve two basic differential equations: 

dy dy 

— = cy and — = cy + s. (1) 

dr dt 

We already know the solutions , What we don't know is how to discover those solu- 
tions, when a suggestion “ try has not been made. Many important equations, 

including these, separate into a y-integral and a Mntegral. The answer comes directly 
from the two separate integrations. When a differential equation is reduced that far — 
to integrals that we know or can look up — it is solved. 

One particular equation will be emphasized. The logistic equation describes the 
speedup and slowdown of growth. Its solution is an S-curve, which starts slowly, 
rises quickly, and levels off. (The 1990’s are near the middle of the S, if the 
prediction is correct for the world population.) S-curves are solutions to nonlinear 
equations, and we will be solving our first nonlinear model. It is highly important 
in biology and all life sciences. 
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SEPARABLE EQUATIONS 


The equations dyjdt ~ cy and dy/dt ~cy + s {with constant source s) can be solved 
by a direct method. The idea is to separate y from t: 

dy dy 

— = c dt and - — 7 - 7 - = c dt . ( 2 ) 

y y + {sic) 

All / s are on the left side. All Ts are on the right side (and c can be on either side). 
This separation would not be possible for dyjdt — y + t. 

Equation (2) contains differentials. They suggest integrals. The t-integrals give ct 
and the y-integrals give logarithms: 

In y = ct -h constant and In^y + ^ = cf + constant. (3) 

The constant is determined by the initial condition. At f = 0 we require y = and the 

In^ +^j = ct + In^o + 0 (4) 

is a formula for y itself, not its logarithm, so 
is y): 

, + £ = (5) 

It is wise to substitute y back into the differential equation, as a check. 

This is our fourth method for / = cy + s. Method 1 assumed from the start that 
y = A£ l + B. Method 2 multiplied all inputs by their growth factors e cit ~ T] and added 
up outputs. Method 3 solved for y — . Method 4 is separation of variables (and all 

methods give the same answer). This separation method is so useful that we repeat 
its main idea, and then explain it by using it. 

To solve dyjdt = ufyMf), separate dyju(y) from v(t)dt and integrate both sides : 


right constant will make that happen: 

In y = ct + In y 0 and 

Then the final step isolates y. The goal 
take the exponential of both sides (e* nv 

y = y 0 €* and 


dy/u(y) = 


v{t)dt + C. 


( 6 ) 


Then substitute the initial condition to determine C, and solve for y(t). 


EXAMPLE 1 dyjdt = y 2 separates into dy/y 2 - dt. Integrate to reach -\}y = t + C, 
Substitute t = 0 and y = y 0 to find C - - l/y 0 - Now solve for y: 


_ 1 _ J_ 

y yo 


and 


y = 


yo 




This solution blows up (Figure 6.15a) when t reaches l/y 0 + If the bank pays interest 
on your deposit squared (/ = y 2 ), you soon have all the money in the world. 


EXAMPLE 2 dyjdt = ty separates into dyjy — t dt. Then by integration In y = \t 2 + C. 
Substitute t = 0 and y=y 0 to find C=lny 0 * The exponential of £t 2 -l- In y 0 gives 
y = y 0 e‘ 2/2 . When the interest rate is c — f, the exponent is t 2 /2. 


EXAMPLE 3 dyjdt = y -\- 1 is not separable. Method 1 survives by assuming y = 
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y y 



dy dy y dy dt 

Hg. 6.15 The solutions to separable equations = y and — = n- or — = n — t 


Ae' + B + Dt — with an extra coefficient D in Problem 23. Method 2 also succeeds — 
but not the separation method, 

EXAMPLE 4 Separate dy/dt — ny/t into dyjy = n dt/L By integration In y = n In t + C. 
Substituting r = G produces InO and disaster. This equation cannot start from time 
zero (it divides by t), However y can start from at t ~ 1, which gives C = In y 1 . The 
solution is a power function y = 

This was the first differential equation in the book (Section 2.2). The ratio of dyjy 
to dtjt is the “ elasticity^ in economics. These relative changes have units like 
dollars/dollars — they are dimensionless, and y = f has constant elasticity n. 

On log-log paper the graph of In y = n In t + C is a straight line with slope n, 

THE LOGISTIC EQUATION 

The simplest model of population growth is dy/dt = cy. The growth rate c is the birth 
rate minus the death rate. If c is constant the growth goes on forever — beyond the 
point where the model is reasonable. A population can’t grow all the way to infinity! 
Eventually there is competition for food and space, and y = must slow down. 

The true rate c depends on the population size y , It is a function dy) not a constant. 
The choice of the model is at least half the problem: 

Problem in biology or ecology: Discover c(y). 

Problem in mathematics: Solve dy/dt = c(y)y. 

Every model looks linear over a small range of y* s — but not forever. When the rate 
drops off, two models are of the greatest importance. The Michaelis-Menten equation 
has c(y) = cj{y + K). The logistic equation has c(y) ~ c — by, It comes first. 

The nonlinear effect is from “interaction.” For two populations of size y and z, the 
number of interactions is proportional to y times z . The Law of Mass Action produces 
a quadratic term byz. It is the basic model for interactions and competition. Here we 
have one population competing within itself, so z is the same as y. This competition 
slows down the growth, because —by 2 goes into the equation. 

The basic model of growth versus competition is known as the logistic equation: 

dy/dt = cy— by 2 . (7) 

Normally b is very small compared to c. The growth begins as usual (close to e ct ). 
The competition term by 2 is much smaller than cy> until y itself gets targe , Then by 2 
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(with its minus sign) slows the growth down. The solution follows an S-curve that 
we can compute exactly. 

What are the numbers b and c for human population? Ecologists estimate the 
natural growth rate as c = .029/year. That is not the actual rate, because of b. About 
1930, the world population was 3 billion. The cy term predicts a yearly increase of 
(.Q29)(3 billion) = 87 million. The actual growth was more like dyjdt = 60 million/year. 
That difference of 27 million/year was by 2 \ 

27 million/year = b { 3 billion) 2 leads to b = 3 * 10‘ 12 /year. 

Certainly b is a small number (three trillionths) but its effect is not small. It reduces 
87 to 60. What is fascinating is to calculate the steady state , when the new term by 2 
equals the old term cy. When these terms cancel each other, dyjdt — cy - by 2 is zero. 
The loss from competition balances the gain from new growth: cy = by 2 and y = c/b. 
The growth stops at this equilibrium point — the top of the S-curve: 

c .029 

y lX) — - = — — IQ 12 fts 10 billion people. 
b 3 

According to Verhulst’s logistic equation, the world population is converging to 10 
billion. That is from the model. From present indications we are growing much faster. 
We will very probably go beyond 10 billion. The United Nations report in Section 3.3 
predicts 11 billion to 14 billion. 

Notice a special point halfway to y w = cjb. (In the model this point is at 5 billion.) 
It is the inflection point where the S-curve begins to bend down. The second derivative 
d 2 yjdt 2 is zero. The slope dyjdt is a maximum. It is easier to find this point from the 
differential equation (which gives dyjdt) than from y. Take one more derivative: 

y" = (cy - by 2 )' = cy' - 2byy' = (c - 2by)y'. {8) 

The factor c - 2 by is zero at the inflection point y = c/2b, halfway up the S-curve. 

THE S-CURVE 


The logistic equation is solved by separating variables y and t: 


dyjdt — cy — by 2 becomes 


dyj(cy - by 2 ) = 



( 9 ) 


The first question is whether we recognize this y-integral. No. The second question 
is whether it is listed in the cover of the book. No . The nearest is j dxj{a 2 — x 2 ), which 
can be reached with considerable manipulation (Problem 21). The third question is 
whether a general method is available. Yes . “Partial fractions” is perfectly suited to 
1 j(cy — by 3 }, and Section 7.4 gives the following integral of equation (9): 


In' — = ct + C and then In — — — = C. (10) 

c- by c — by 0 


That constant C makes the solution correct at t = 0. The logistic equation is integ- 
rated, but the solution can be improved. Take exponentials of both sides to remove 
the logarithms: 


c- by c — by 0 


( 11 ) 


This contains the same growth factor e ct as in linear equations. But the logistic 
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equation is not linear — it is not y that increases so fast- According to (11), it is 
yj{c - by) that grows to infinity. This happens when c - by approaches zero. 

The growth stops at y — c/b . That is the final population of the world (10 billion?). 
We still need a formula for y. The perfect S -curve is the graph of y = 1/(1 + e _f ). It 
equals 1 when r = oo, it equals j when r = 0, it equals 0 when t = — cc. It satisfies 
y , = y — y 2 , with c = b=l. The general formula cannot be so beautiful, because it 
allows any c, b t and y 0 . To find the S-curve, multiply equation (11) by c — by and 
solve for y: 

y=— C - ■■ ■■ or y — — . (12) 

b + e~ ci (c-by 0 )ly 0 b + de ct 

When t approaches infinity, e~ ct approaches zero. The complicated part of the for- 
mula disappears. Then y approaches its steady state c/fr, the asymptote in Figure 6.16. 
The S-shape comes from the inflection point halfway up. 




Fig. 6.16 The standard S-curve y = 1/(1 + e *). The population S-curve (with prediction). 


Surprising observation: z = 1/y satisfies a linear equation . By calculus z = — fly 2 - So 


z 


- cy + by 2 


— - 4- h = " €2+ h. 

y 


(13) 


Year 

US 

Population 

Model 

1790 

3,9 = 

3.9 

1800 

5,3 

5.3 

1810 

7.2 

7.2 

1820 

9.6 

9.8 

1830 

12,9 

13.1 

1840 

17.1 

17.5 

1850 

23,2 = 

23.2 

1 1860 

31,4 

30,4 

: 1870 

38.6 

39.4 ; 

1880 

50.2 

50.2 

1890 

62.9 

62.8 

1900 

76.0 

76,9 

1910 

92.0 = 

92,0 

1920 

105,7 

107.6 

1930 

122.8 

123.1 

1 1940 

131,7 / 

136.7 

1950 

150.7 

149.1 


This equation z = — cz + h is solved by an exponential e ct plus a constant: 


z = Ae" ct + 


b 


c 



(14) 


Turned upside down, y= 1 /z is the S-curve (12). As z approaches fr/t, the S-curve 
approaches cjh. Notice that z starts at l/>’ 0 . 


EXAMPLE 1 (United Stoles population) The table shows the actual population and 
the model. Pearl and Reed used census figures for 1790, 1850, and 1910 to compute 
c and h. In between, the fit is good but not fantastic. One reason is war — another is 
depression. Probably more important is immigration. i In fact the Pearl-Reed steady 
state c/b is below 200 million, which the US has already passed. Certainly their model 
can be and has been improved. The 1990 census predicted a stop before 300 million . 
For constant immigration s we could still solve y' = cy — by 2 + s by partial fractions — 
but in practice the computer has taken over. The tabic comes from Braun's book 
Differential Equations (Springer 1975), 


flmmigration does not enter for the world population model (at least not yet). 
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Remark For good science the y 1 term should be explained and justified. It gave a 
nonlinear model that could be completely solved, but simplicity is not necessarily 
truth. The basic justification is this: In a population of size y, the number of encounters 
is proportional to y 2 . If those encounters are fights, the term is — by 1 . If those 
encounters increase the population, as some like to think, the sign is changed. There 
is a cooperation term + by 2 , and the population increases very fast. 

EXAMPLE 5 / = cy + by 2 : y goes to infinity in a finite time ♦ 

EXAMPLE 6 y - ~ dy + by 2 : y dies to zero if y 0 < dfb. 

In Example 6 death wins, A small population dies out before the cooperation by 1 
can save it. A population below djb is an endangered species. 

The logistic equation can't predict oscillations — those go beyond dyfdt ~/(y). 

The y line Here is a way to understand every nonlinear equation y = /{>/ Draw a 
“y line” Add arrows to show the sign of/(y). When / = J\y) is positive, y is increasing 
(it follows the arrow to the right). When /is negative, y goes to the left. When /is zero, 
the equation is / = 0 and y is stationary: 



The arrows take you left or right, to the steady state or to infinity. Arrows go toward 
stable steady states. The arrows go away , when the stationary point is unstable. The 
y line shows which way y moves and where it stops. 

The terminal velocity of a falling body is v ^ = J^g in Problem 6.7,54. For f(y) = 
sin y there are several steady states: 


<■ 





fulling body: dvjdt = g ~ i 2 


dyfdt = sin y 


EXAMPLE 7 Kinetics of a chemical reaction mA + nB pC ♦ 

The reaction combines m molecules of A with n molecules of B to produce p 
molecules of C, The numbers m, n , p are 1, I, 2 for hydrogen chloride: H 2 + Cl 2 = 
2 HC1. The Law of Mass Action says that the reaction rate is proportional to the 
product of the concentrations [A] and [JJ]. Then [A] decays as [C] grows: 

d\fAf'dt= -r|Xl[fl] and d[C]/dr= + k[/4][B]. (15) 

Chemistry measures r and k . Mathematics solves for [A] and [C]. Write y for the 
concentration [C], the number of molecules in a unit volume. Forming those y 
molecules drops the concentration [A] from a 0 to a 0 - (mjp)y. Similarly [B] drops 
from b 0 to 6 0 — (n/p)y. The mass action law (15) contains y 2 : 


dy 

dt 


= k\ 


m 

P . 


fro 


( 16 ) 
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This fits our nonlinear model (Problem 33-34). We now find this same mass action 
in biology. You recognize it whenever there is a product of two concentrations, 

THE MM EQUATION dyfdt= - cyfty+ K) 


Biochemical reactions are the keys to life. They take place continually in every living 
organism. Their mathematical description is not easy! Engineering and physics go 
far with linear models, while biology is quickly nonlinear. It is true that y f — cy is 
extremely effective in first-order kinetics (Section 6,3), but nature builds in a nonlinear 
regulator. 

It is enzymes that speed up a reaction. Without them, your life would be in slow 
motion. Blood would take years to clot. Steaks would take decades to digest. Calculus 
would take centuries to learn. The whole system is awesomely beautiful — DNA tells 
amino acids how to combine into useful proteins, and we get enzymes and elephants 
and Isaac Newton. 

Briefly, the enzyme enters the reaction and comes out again. It is the catalyst. Its 
combination with the substrate is an unstable intermediate, which breaks up into a 
new product and the enzyme (which is ready to start over). 

Here are examples of catalysts, some good and some bad. 


1. The platinum in a catalytic converter reacts with pollutants from the car engine. 
(But platinum also reacts with lead — ten gallons of leaded gasoline and you 
can forget the platinum.) 

2. Spray propellants (CFCs) catalyze the change from ozone (0 3 ) into ordinary 
oxygen (0 2 ). This wipes out the ozone layer— our shield in the atmosphere. 

3. Milk becomes yoghurt and grape juice becomes wine. 

4. Blood clotting needs a whole cascade of enzymes, amplifying the reaction at 
every step. In hemophilia — the “Czar’s disease” — the enzyme called Factor VIII 
is missing. A small accident is disaster; the bleeding won’t stop. 

5. Adolph’s Meat Tenderizer is a protein from papayas. It predigests the steak. 
Tbe same enzyme (chymopapain) is injected to soften herniated disks. 

6 . Yeast makes bread rise. Enzymes put the sour in sourdough. 

Of course, it takes enzymes to make enzymes. The maternal egg contains the material 
for a cell, and also half of the DNA, The fertilized egg contains the full instructions. 


We now look at the Michaelis-Menten (MM) equation, to describe these reactions. 
It is based on the Law of Mass Action . An enzyme in concentration z converts a 
substrate in concentration y by dyjdt = — byz* The rate constant is b, and you see 
the product of “enzyme times substrate,” A similar law governs the other reactions 
(some go backwards). The equations are nonlinear, with no exact solution. It is 
typical of applied mathematics (and nature) that a pattern can still be found. 

What happens is that the enzyme concentration z(t) quickly drops to z 0 Kj(y + if). 
The Michaelis constant K depends on the rates (like b) in the mass action laws. 
Later the enzyme reappears (z^ = z 0 ). But by then the first reaction is over. Its law 
of mass action is effectively 


dy 

dt 


— byz = 


cy 

y+ K 


( 17 ) 


with c = bz 0 K. This is the Michaelis-Menten equation — basic to biochemistry. 
The rate dyjdt is all-important in biology. Look at the function cyj{y+ K ): 


when y is large, dyjdt zz — c when y is small, dyjdt cy/K. 
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The start and the finish operate at different rates, depending whether y dominates K 
or K dominates y. The fastest rate is c. 

A biochemist solves the MM equation by separating variables: 

™ v + k r 

— - — dy — - e dt gives y + K In y = - ct + C, (18) 

Set t = 0 as usual. Then C = y Q + K In y 0 . The exponentials of the two sides are 

e y = e -v^;. (19) 

We don't have a simple formula for y. We are lucky to get this close. A computer 
can quickly graph y(t) — and we see the dynamics of enzymes. 

Problems 27-32 follow up the MiehaeJis-Menten theory. In science, concentrations 
and rate constants come with units. In mathematics, variables can be made dimen- 
sionless and constants become 1. We solve dYjdT = Yj(Y + 1) and then switch back 
to y, r, t\ K. This idea applies to other equations too. 

Essential point: Most applications of calculus come through differential equations 
That is the language of mathematics— with populations and chemicals and epidemics 
obeying the same equation. Running parallel to dyidt = cy are the difference equations 
that come next. 


6.5 EXERCISES 


Read-th rough questions 

The equations dyjdt = cy and dyidt = cy + s and dyidt — 
u(y)y(r) are called o because we can separate y from r. 
Integration of \ dyiy = j c dt gives b Integration of 
{ dyi\y + sic) = \ e dt gives c The equation dyidx = 
— xiy leads to d Then y 2 + x 2 = e and the solution 
stays on a circle. 

The logistic equation is dyidt = t The new term - by 2 
represents g when cy represents growth, Separation gives 
j dyi{cy — fey 2 ) = f dt , and the y-integral is 1 ;c times In h 
Substituting y 0 at f=0 and taking exponentials produces 
yi\c — by) = e cr i i l As t cc, y approaches J That 
is the steady state where cy — by 2 = k The graph of y 
looks like an i because it has an inflection point at 

y= . m 

In biology and chemistry, concentrations y and z react at 
a rate proportional to y times n This is the Law r of 
o In a model equation dyidt — y)y the rate r depends 
on P The MM equation is dyjdt = q Separating 
variables yields J r dy - * = - ct + C. 

Separate, integrate, and solve equations 1-8. 

1 dyjdt = y + 5, v 0 = 2 

2 dyidt = 1 /y, y 0 = 1 

3 dyidx = xiy 2 , y 0 = 1 


4 dyidx = y 2 +i, V,, = 0 

5 dyidx = (y + 1 )/(.v + 1 ), >- 0 = 0 

6 dyidx = tan v cos x, y 0 = 1 

7 dyjdt = y sin q y 0 = I 

8 dyjdt = e t ~ > \ y 0 = e 

9 Suppose the rate ofjprowth is proportional to y instead 
of v. Solve dyidt = y starting from y 0 . 

10 The equation dyidx = nyjx for constant elasticity is the 
same as d(\n y)!d{\n x) = . The solution is lny = 


1 1 When c = 0 in the logistic equation, the only term is y = 
— hy 2 . What is the steady state y x ? How long until y drops 
from y 0 to yy 0 - 

12 Reversing signs in Problem 11, suppose y = -f by 2 . At 
what time does the population explode to y = oc, starting 
from y () = 2 {Adam + EveY! 

Problems 1 3-26 deal with logistic equations y 1 = cy — by 2 . 

13 Show that y = 1/(1 + c“ T ) solves the equation y f = y — y 1 . 
Draw r the graph of y from starting values j and 3 . 

14 (a) What logistic equation is solved by y = 2/(1 - e“ r )? 

(b) Find c and h in the equation solved by y = 1/(1 + 

15 Solve z* = — ~ + 1 w f ith z G = 2, Turned upside down as in 
(13), what is >■= 1 /-? 
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16 By algebra find the S-curve (12) from y — l/z in (14). 

17 How many years to grow from y 0 ~ ^ c/b to y = jc/h? Use 
equation (10) for the time t since the inflection point in 1988. 
When does y reach 9 billion = .9 c/b? 

18 Show by differentiating u = y/(c — by) that if / — cy — by 2 
then u' = cu. This explains the logistic solution (1 1) — it is 
u = 

19 Suppose Pittsburgh grows from y 0 = 1 million people in 
1900 to y = 3 million in the year 2000. If the growth rate is 
y = 12,000/year in 1900 and / = 30,000/year in 2000, substi- 
tute in the logistic equation to find c and b. What is the steady 
state? Extra credit: When does y = y*,/2 = c/2fi? 

20 Suppose c = 1 but b = — 1, giving cooperation / = y 4- y 2 . 
Solve for y(t) if y 0 — 1. When does y become infinite? 

21 Draw an S-curve through (0, 0) with horizontal asymp- 
totes y = — 1 and y = 1. Show that y - (d - + e~ { ) has 

those three properties. The graph of y 2 is shaped like 


22 To solve / = cy — by 3 change to u= 1/y 2 . Substitute for 
y in u f = — 2y7y* to find a linear equation for u. Solve it as 
in (14) but with u 0 ~ 1 jy\. Then y = l/^/u, 

23 With y—TY and t = sT> the equation dyjdt — cy — by 2 
changes to dY/dT~ Y— Y 2 . Find r and s. 

24 In a change to y = rY and t = sT, how are the initial values 
y 0 and y f 0 related to y 0 and YJ? 

25 A rumor spreads according to y — y(N — y). If y people 
know, then JV — y don’t know. The product y(N — y) measures 
the number of meetings (to pass on the rumor). 

(a) Solve dyjdt = y(N — y) starting from y 0 m 1, 

(b) At what time T have N/2 people heard the rumor? 

(c) This model is terrible because T goes to as 

N -* oo. A better model is y = by(N — y). 

26 Suppose b and c are bcth multiplied by 10. Does the 
middle of the S-curve get steeper or flatter? 

Prohlems 27-34 deal with mass action and the MM equation 

-cy/{y + K). 

27 Most drugs are eliminated acording to y=— cy but 


aspirin follows the MM equation. Withc = K = y 0 =l, does 
aspirin decay faster? 

28 If you take aspirin at a constant rate d (the maintenance 
dose), find the steady state level where d = cy/(y + K). Then 

y f — 0. 

29 Sbow that the rate I* = cy/(y + K) in the MM equation 
increases as y increases, and find the maximum as y oo. 

30 Graph the rate R as a function of y for K = 1 and K = 
10. (Take c = 1.) As the Michaelis constant increases, the rate 

. At what value of y is R — ic? 

31 With y-KY and ct = KT, find the 4 ‘non dimensional” 
MM equation for dYjdT From the solution e y Y- 
e~ T e Y °Y 0 recover the y, t solution (19). 

32 Graph y(t) in (19) for different c and K (by computer). 

33 The Law of Mass Action for A + B->C is y'~ 

Kao ~ y)(b o “ y )■ Suppose y 0 — 0, = b 0 - 3, k — 1. Solve for 

y and find the time when y = 2. 

34 In addition to the equation for d[C]jdt, the mass action 

law gives d^A^/dt = . 

35 Solve y = y + 1 from y 0 — 0 by assuming y = Ad + B + Dt. 
Find A, B y D , 

36 Rewrite cy — by 2 as a 2 — x 2 , with x = N /by — cjlyfb and 

a = . Substitute for a and x in the integral taken 

from tables, to obtain the y-integral in the text: 

dx _ 1 a + x f dy _ 1 y 

a 2 — x 2 2a U a-x jcy-by 2 c n c—by 

37 (Important) Draw the y-lines (with arrows as in the text) 
for y = y/(l - y) and y = y — y 3 . Which steady slates are 
approached from which initial values y 0 ? 

38 Explain in your own words how the y-line works. 

39 (a) Solve / = tan y starting from y 0 = n/6 to find 
sin y = 

(b) Explain why t = 1 is never reached. 

(c) Draw arrows on the y-line to show that y approaches 
nj2 — when does it get there? 

40 Write tbe logistic equation as y 1 — cy( 1 — y/K). As y' 

approaches zero, y approaches . Find y, y\ y* at the 

inflection point. 


6.6 Powers Instead of Exponentials 



You may remember our first look at e. It is the special base for which e* has slope 1 
at x = 0. That led to the great equation of exponential growth: The derivative of 
equals e x . But our look at the actual number e = 2.71828... was very short. 
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It appeared as the limit of (1 + l/n)\ This seems an unnatural way to write down 
such an important number. 

I want to show how (1 + 1 jn) R and (1 + xjrif arise naturally. They give discrete 
growth in finite steps — with applications to compound interest. Loans and life insur- 
ance and money market funds use the discrete form of / = cy + s. (We include extra 
information about bank rates, hoping this may be useful some day.) The applications 
in science and engineering are equally important. Scientific computing, like account- 
ing, has difference equations in parallel with differential equations. 

Knowing that this section will be full of formulas, I would like to jump ahead and 
tell you the best one. It is an infinite series for e x . What makes the series beautiful is 
that its derivative is itself. 

Start with y = 1 + x. This has y = 1 and / = 1 at x - 0. But /' is zero, not one. 
Such a simple function doesn’t stand a chance! No polynomial can be its own deriva- 
tive, because the highest power x n drops down to The only way is to have no 

highest power , We are forced to consider infinitely many terms — a power series — to 
achieve “derivative equals function,” 

To produce the derivative 1 + x, we need l + jc + jx 2 . Then \x 2 is the derivative 
of ^x 3 , which is the derivative of The best way is to write the whole series at 
once: 

infinite series e* = 1 + jc + ?x 2 + £x 3 + ^x 4 4- (1) 

This must be the greatest power series ever discovered. Its derivative is itself: 

de*jdx = 0+ 1 + jc 4- + £x 3 + = e x . (2) 

The derivative of each term is the term before it. The integral of each term is the one 
after it (so j" e x dx = e x + C). The approximation appears in the first two 

terms. Other properties like ( e x )(e x )= e 2x are not so obvious. (Multiplying series is 
hard but interesting.) It is not even clear why the sum is 2.718.., when x=l. 
Somehow 1 + 1 + ^ + £ + *■' equals e. That is where (1 4- 1 jnY will come in. 

Notice that x? is divided by the product 1*2*3 n. This is “n factorial .” Thus 

x 4 is divided by 1 • 2 ■ 3 * 4 = 41 = 24, and x 5 is divided by 5! = 1 20, The derivative of 
x 5 /120 is x 4 /24, because 5 from the derivative cancels 5 from the factorial. In general 
x n jn\ has derivative x ll ~ 1 }(n — 1)! Surprisingly 0! is 1, 

Chapter 10 emphasizes that x n /n\ becomes extremely small as n increases. The 
infinite series adds up to a finite number — which is e x . We turn now to discrete 
growth, which produces the same series in the limit. 

This headline was on page one of the New York Times for May 27, 1990. 

213 Years After Loan, Uncle Sam is Dunned 

San Antonio, May 26 — More than 200 years ago, a wealthy Pennsylvania 
merchant named Jacob DeHaven lent $450,000 to the Continental Congress to 
rescue the troops at Valley Forge. That loan was apparently never repaid. 

So Mr. DeHaven’s descendants are taking the United States Government to 
court to collect what they believe they are owed. The total: $141 billion if the 
interest is compounded daily at 6 percent, the going rate at the time. If com- 
pounded yearly, the bill is only $98 billion. 

The thousands of family members scattered around the country say they are 
not being greedy. “It’s not the money — it’s the principle of the thing,” said 
Carolyn Cokerham, a DeHaven on her father’s side who lives in San Antonio. 



“You have to wonder whether there would even be a United States if this man 
had not made the sacrifice that he did. He gave everything he had.” 

The descendants say that they are willing to be flexible about the amount of 
settlement. But they also note that interest is accumulating at $190 a second. 

“None of these people have any intention of bankrupting the Government,” 
said Jo Beth Kloecker, a lawyer from Stafford, Texas. Fresh out of law school, 
Ms. Kloecker accepted the case for less than the customary 30 percent 
contingency. 

It is unclear how many descendants there are. Ms. Kloecker estimates that 
based on 10 generations with four children in each generation, there could be as 
many as half a million. 

The initial suit was dismissed on the ground that the statute of limitations is 
six years for a suit against the Federal Government. The family’s appeal asserts 
that this violates Article 6 of the Constitution, which declares as valid all debts 
owed by the Government before the Constitution was adopted. 

Mr. DeHaven died penniless in 1812. He had no children. 

COMPOUND INTEREST 

The idea of compound interest can be applied right away. Suppose you invest $1000 
at a rate of 100% (hard to do). If this is the annual rate, the interest after a year is 
another $1000. You receive $2000 in all. But if the interest is compounded you receive 
more: 


after six months: Interest of $500 is reinvested to give $1500 

end of year: New interest of $750 (50% of 1500) gives $2250 total. 

The bank multiplied twice by 1.5 (1000 to 1500 to 2250). Compounding quarterly 
multiplies four times by 1.25 (1 for principal, .25 for interest): 

after one quarter the total is 1000 + (.25)(1000) = 1250 

after two quarters the total is 1250 + (.25)(1250) = 1562.50 

after nine months the total is 1562.50 + (.25)(I562.50) = 1953.12 

after a full year the total is 1953.12 + (.25)(I953.12) = 2441.41 

Each step multiplies hy 1 + (1/n), to add one nth of a year’s interest — still at 100%: 

quarterly conversion: (1 + 1/4)* x 1000 = 2441.41 

monthly conversion: (1 + 1/12) 12 x 1000= 2613.04 

daily conversion: (1 + 1/365) 365 x 1000 = 2714.57. 


Many banks use 360 days in a year, although computers have made that obsolete. 
Very few banks use minutes (525,600 per year). Nobody compounds every second 
(n = 31,536,000). But some banks offer continuous compounding. This is the limiting 
case (n -*■ oo) that produces e : 



x 1000 approaches e x 1000 = 2718.28. 


1. Quick method for (1 + 1/n)": Take its logarithm . Use ln(l + x) 


x with x = -: 

n 
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( 3 ) 


As 1/n gets smaller, this approximation gets better. The limit is 1, Conclusion: 
(1 + 1 jnf approaches the number whose logarithm is 1. Sections 6,2 and 6,4 define 
the same number {which is e). 

2 . S/ow method for (1 + 1 jrif: Multiply out all the terms . Then let n -* co. 

This is a brutal use of the binomial theorem. It involves nothing smart like logarithms, 
but the result is a fantastic new formula for e r 


Practice 




Binomial theorem for any positive integer n: 




- ? + *«-!)<« -2) 


©r— or- 


1-2-3 

Each tenn in equation (4) approaches a limit as n -*■ oo. Typical terms aie 


(4) 


"("-d /iv , i 

1-2 ""* 1-2 


and 


n(n — l)(n — 2) 


\nj 1-2-3 


1-2-3 

Next comes 1/1 • 2 * 3 • 4, The sum of all those limits in (4) is our new formula for e: 

1 


lim^t + ~J = 


1 + l + J_ + _^ 

1*2 1 * 2*3 


H — = e. 


1 *2 - 3 *4 

In summation notation this is E^_ 0 1 jk\ = e , The factorials give fast convergence: 
1 + t + ,5 + .16667 + .04167 + .00833 + ,00139 + ,00020 + ,00002= 2,71828, 


(5) 


Those nine terms give an accuracy that was not reached by n = 365 compoundings. 
A limit is still involved (to add up the whole series). You never see e without a limit] 
It can be defined by derivatives or integrals or powers (1 + 1 jn) n or by an infinite 
series. Something goes to zero or infinity, and care is required. 

All terms in equation (4) are below (or equal to) the corresponding terms in (5), 
The power (1 + 1 jrif approaches e from below , There is a steady increase with n. Faster 
compounding yields more interest. Continuous compounding at 100% yields e , as 
each term in (4) moves up to its limit in (5). 

Remark Change {1 + 1 jn) n to (1 + xjnf. Now the binomial theorem produces e x \ 

K)"- + <:) + "^© v " approaches ' +x+ £i + -- <6) 

Please recognize e x on the right side! It is the infinite power series in equation (1), 
The next term is x z j 6 (x can be positive or negative). This is a final formula for e x \ 


6L The limit of (t + xjn )* is e*. At x — 1 we find e. 


The logarithm of that power is n ln(l + xjn) ^ n(x/n) = x. The power approaches e*. 

To summarize: The quick method proves (1 + 1 jnf by logarithms. The slow 
method (multiplying out every term) led to the infinite series. Together they show the 
agreement of all our definitions of e. 
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DIFFERENCE EQUATIONS VS. DIFFERENTIAL EQUATIONS 

We have the chance to see an important part of applied mathematics. This is not a 
course on differential equations, and it cannot become a course on difference equ- 
ations. But it is a course with a purpose — we aim to use what we know. Our main 
application of e was to solve / — cy and / — cy + s. Now we solve the corresponding 
difference equations. 

Above all, the goal is to see the connections. The purpose of mathematics is to 
understand and explain patterns. The path from “discrete to continuous" is beautifully 
illustrated by these equations. Not every class will pursue them to the end, but I 
cannot fail to show the pattern in a difference equation : 

y(t + 1) = ay{t}> (7) 

Each step multiplies by the same number a . The starting value y 0 is followed by ay 0 , 
a 2 y 0 , and <j 3 y 0 . The solution at discrete times t = 0, 1, 2, . is y(0 = fl'yo- 

This formula dy 0 replaces the continuous solution of the differential equation. 



decaying 

oscillation 


growing 


, \T^ decay 

1 0 d h 


growth 


-► a 


oscillaSoft -1 ^ 0 a 1 — >0 ] 

Hg.6 .17 Growth for \a{ > 1, decay for |a| < 1. Growth factor a compares to e*. 


A source or sink (birth or death, deposit or withdrawal) is like y* — cy + s: 

& + 1) = ay{t) + s. (8) 

Each step multiplies by a and adds s. The first outputs are 

y(l) = ay a + s, y(2) = a 2 y 0 + as + s, >(3) = a 2 y 0 + a 2 s + as + s. 

We saw this pattern for differential equations — every input s becomes a new starting 
point . It is multiplied by powers of a. Since s enters later than y 0 , the powers stop at 
r- 1. Algebra turns the sum into a clean formula by adding the geometric series: 

X0 = fl'yo + s[a' _1 + a*~ 2 + "* +<i+ l] = a l y Q + ia l - \)j{a~ 1). (9) 

EXAMPLE 1 Interest at 8% from annual IRA deposits of s = $2000 (here y 0 — 0). 

The first deposit is at year t = 1. In a year it is multiplied by a = LOS, because 8% is 
added. At the same time a new s= 2000 goes in. At t= 3 the first deposit has been 
multiplied by (1.08) 2 , the second by 1.08, and there is another s= 2000. After year t, 

y(t) = 2000(1.08 f - 1)/(1.08 - 1). (10) 

With t = 1 this is 2000. With ( = 2 it is 2000 (1,08 + 1) — two deposits. Notice how 
a — 1 (the interest rate .08) appears in the denominator. 

EXAMPLE 2 Approach to steady state when \a\ < 1. Compare with c < 0. 

With a>l, everything has been increasing. That corresponds to c>0 in the 
differential equation (which is growth). But things die, and money is spent, so a can 
be smaller than one. In that case rfy 0 approaches zero — the starting balance disap- 
pears. What happens if there is also a source? Every year half of the balance y{t) is 
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spent and a new £2000 is deposited. Now a ~ 

At + 1) = + 2000 yields y(t) = (tfy 0 + 200()[((i)' - l)/(i - 1)]. 

The limit as t -*■ oo is an equilibrium point. As (£)* goes to zero, y(f) stabilizes to 

y^ = 2000(0 - l)/(i - 1) = 4000 = steady state . (1 1) 

Why is 4000 steady? Because half is lost and the new 2000 makes it up again. The 
iteration is y n + l = jy n + 2000. its fixed point is where y ^ = 7 y w + 2000. 

In general the steady equation is y^ — ay^-^- s. Solving for y ^ gives 5/(1 —a). 
Compare with the steady differential equation y* = cy + s = 0: 


y^ = — (differential equation) vs. 
c 


y*> = 


l - a 


(difference equation). (12) 


EXAMPLE 3 Demand equals supply when the price is right. 

Difference equations are basic to economics. Decisions are made every year (by a 
farmer) or every day (by a bank) or every minute (by the stock market). There are 
three assumptions: 

1. Supply next time depends on price this time: S{t + 1) = cP[t). 

2. Demand next time depends on price next time: D(t + 1) = - dP{t + 1) + b. 

3- Demand next time equals supply next time: D(t + 1) = S(f + 1), 

Comment on 3: the price sets itself to make demand = supply. The demand slope - d 
is negative. The supply slope c is positive. Those lines intersect at the competitive 
price, where supply equals demand. To find the difference equation, substitute I and 
2 into 3: 

Difference equation: - dP(t + 1) + b = cF(t) 

Steady state price : - dP w + b = cP w . Thus P ^ = bj{c + d). 

If the price starts above the difference equation brings it down. If below* the 
price goes up. When the price is it stays there. This is not news — economic 
theory depends on approach to a steady state. But convergence only occurs if c < d. 
If supply is less sensitive than demand, the economy is stable. 

Blow-up example: c~ 2,b = d = 1. The difference equation is — P(t + 1) + 1 = 2P(t). 
From P(0) = 1 the price oscillates as it grows: P = — 1, 3, — 5, 11, .... 

Stable example: c ~ 1/2, b = d ~ 1. The price moves from P( 0) = 1 to F(oo) = 2/3: 

1 13 5 2 

-P(r+ 1) + 1 = -F(r) yields P= 1,-,-, approaching 

Increasing d gives greater stability. That is the effect of price supports. For d = 0 
(fixed demand regardless of price) the economy is out of control. 

THE MATHEMATICS OF FINANCE 


It would be a pleasure to make this supply-demand model more realistic — with 
curves, not straight lines. Stability depends on the slope— calculus enters. But we 
also have to be realistic about class time. I believe the most practical application is 
to solve the fundamental problems of finance . Section 6.3 answered six questions about 
continuous interest. We now answer the same six questions when the annual rate is 
x = .05 = 5% and interest is compounded n times a year. 
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First we compute effective rates, higher than .05 because of compounding: 
compounded quarterly ^1 -I- = 1.0509 [ effective rate .0509 = 5.09%] 

compounded continuously e' 05 = 1.0513 [effective rate 5.13%] 


Now come the six questions. Next to the new answer (discrete) we write the old 
answer (continuous). One is algebra, the other is calculus. The time period is 20 years, 
so simple interest on y 0 would produce {.G5)(20)(y o ). That equals y 0 — money doubles 
in 20 years at 5% simple interest. 

Questions X and 2 ask for the future value y and present value y 0 with compound 
interest n times a year: 


1* y growing from y 0 : 

y *l * 

y = e (05 » (20 Vo 

2. deposit y 0 to reach y: 

(, . 05 Y“" 

y °T + T > 

^ = £-(.05,(20^ 


Each step multiplies by a - (1 + .05/n). There are 20n steps in 20 years. Time goes 
backward in Question 2. We divide by the growth factor instead of multiplying. The 
future value is greater than the present value (unless the interest rate is negative!). As 
n oo the discrete y on the left approaches the continuous y on the right. 

Questions 3 and 4 connect y to s (with y 0 = 0 at the start). As soon as each s is 
deposited, it starts growing. Then y = s -I- as + a 2 s H — . 


3. y growing from deposits s : 

y=s 

(1 + .05/n) 20 " - 1' 
.05/n 

4, deposits s to reach y: 

s = y 

.05/n 

_(1 + .05/n) 2OB - 1 


y=s 

s = y 


[ 

[ 


£< 051 ( 20 ) 

i05~ 

.05 

e (.0J)(20) 


t] 


Questions 5 and 6 connect y 0 to s. This time y is zero — there is nothing left at the 
end . Everything is paid. The deposit y Q is just enough to allow payments of s. This 
is an annuity, where the bank earns interest on your y 0 while it pays you s (n times 
a year for 20 years). So your deposit in Question 5 is less than 20ns. 

Question 6 is the opposite — a loan. At the start you borrow y 0 (instead of giving 
the bank y 0 ), You can earn interest on it as you pay it back. Therefore your payments 
have to total more than y 0 . This is the calculation for car loans and mortgages. 


5- Annuity: Deposit y 0 to receive 20n payments of s: 


>o = * 


1 -{1 +.05/n)- 2OB 
.05/n 


6. Loan : Repay y 0 with 20n payments of s: 

.05/n 


s =yo 


1 — (1 + .05/n)" 


>c = 5 


s = >o 


] _ e ~l05H20r 


.05 


.05 


| _ e -L05){20) 


Questions 2, 4, 6 are the inverses of 1, 3, 5. Notice the pattern: There are three num- 
bers y , y 0 , and s. One of them' is zero each time. If all three are present, go back to 
equation (9). 


The algebra for these lines is in the exercises. It is not calculus because At is not dt . 
All factors in brackets [ ] are listed in tables, and the banks keep copies. It might 
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also be helpful to know their symbols. If a bank has interest rate i per period over 
N periods, then in our notation a = l + i = 1 + .0 5/n and t = N — 20 n: 

future value of y 0 = SI (line 1): y(N) - (1 + i) N 

present value of y = SI (line 2): y 0 = (1 + i) 

future value of s = $1 (line 3): y(N) = - [(1 + if — {J/i 

present value of s= SI (line 5): y 0 = = [l - (1 + i)~*]/f 

To tell the truth, I never knew the last two formulas until writing this book. The 
mortgage on my home has N = (12)(25) monthly payments with interest rate i — 
,07/12, In 1972 tbe present value was 542,000 = amount borrowed. I am now going 
to see if the bank is honest- 1 

Remark In many loans, the bank computes interest on the amount paid back 
instead of the amount received. This is called discounting . A loan of $1000 at 5% 

for one year costs $50 interest- Normally you receive $1000 and pay back $1050. 

With discounting you receive 5950 (called the proceeds) and you pay back $1000. 
The true interest rate is higher than 5% — because the 550 interest is paid on the 
smaller amount $950. In this case the “discount rate” is 50/950 = 5. 26 Vo. 

SCIENTIFIC COMPUTING: DIFFERENTIAL EQUATIONS BY DIFFERENCE EQUATIONS 

In biology and business, most events are discrete. In engineering and physics, time 
and space are continuous. Maybe at some quantum level it's all the same, but the 
equations of physics (starting with Newton's law F = ma) are differential equations. 
The great contribution of calculus is to model the rates of change we see in nature. 
But to solve that model with a computer, it needs to be made digital and discrete. 

These paragraphs work with dy/dt = cy . It is the test equation that all analysts use, 
as soon as a new computing method is proposed. Its solution is y = e e1 7 starting from 
y Q = 1. Here we test Euler’s method (nearly ancient, and not well thought of). He 
replaced dy/dt by Ay/ At: 

Euler’s Method j ft + **) ~ >(0 = (J 3 ) 

At 

The left side is dy/dt , in the limit At -► 0, We stop earlier, when At > 0, 

The problem is to solve (13). Multiplying by At, the equation is 

y(t + At) = (1 + cAt)y(t) (with y{ 0) = 1). 

Each step multiplies by a = 1 + cAt, so n steps multiply by a n : 

y - a" = (1 + cAty at time nAt. (14) 

This is growth or decay, depending on a. The correct e* is growth or decay, depending 
on c. The question is whether a n and d* stay close . Can one of them grow while the 
other decays? We expect the difference equation to copy / = cy , but we might be 
wrong. 

A good example is y* = — y. Then c = — 1 and y = e~*~ the true solution decays. 


tit’s not. s is too big. I knew it. 
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The calculator gives the following answers a" for n — 2, 10, 20: 


At 

a = 1 + cAf 

a 2 

a 10 

a 20 

3 

-2 

4 

1024 

1048576 

1 

0 

0 

0 

0 

1/10 

.90 

.81 

.35 

.12 

1/20 

.95 

.90 

.60 

.36 


The big step At= 3 shows total instability (top row). The numbers blow up when 
they should decay. The row with At — 1 is equally useless (all zeros). In practice the 
magnitude of cAt must come down to ,10 or .05. For accurate calculations it would 
have to be even smaller, unless we change to a better difference equation. That is the 
right thing to do. 

Notice the two reasonable numbers. They are .35 and .36, approaching e~ l = .37. 
They come from n - 10 {with At = 1/10) and n “ 20 (with At = 1/20). Those have the 
same clock time nAt = 1: 



The main diagonal of the table is executing (1 + xjjif -► e* in the case x — — 1. 

Final question: How quickly are .35 and .36 converging toe ' 1 = .37? With At = .10 
the error is .02. With At - .05 the error is .01. Cutting the time step in half cuts the 
error in half. We are not keeping enough digits to be sure, but the error seems close 
to 7 At. To test that, apply the “quick method” and estimate <f = (1 - A tf from its 
logarithm: 

ln(l - At)" = n ln(l - At) as n[ - At - J<At) 2 ] - - 1 - ^Af. (15) 

The clock time is nAt - 1. Now take exponentials of the far left and right: 

a n = {l- A t) n * e~ 1 e A,/2 % e~\\ - {At). ( 16 ) 

The difference between a n and e~ l is the last term {Ate" 1 . Everything comes down 
to one question: Is that error the same as ^Af? The answer is yes , because e~ x j2 is 
1/5. If we keep only one digit, the prediction is perfect! 

That took an hour to work out, and I hope it takes longer than At to read. I wanted 
you to see in use the properties of In x and e*. The exact property In a" — n In a came 
first. In the middle of (15) was the key approximation ln(l + x) x 2 , with x = 

- At. That x 2 term uses the second derivative (Section 6.4). At the very end came 
e* « 1 + x. 

A linear approximation shows convergence: (1 + x/nT -► e*. A quadratic shows the 
error: proportional to At — 1/n. It is like using rectangles for areas, with error propor- 
tional to Ax. This minimal accuracy was enough to define the integral, and here it is 
enough to define e . It is completely unacceptable for scientific computing. 

The trapezoidal rule, for integrals or for y f — cy , has errors of order (Ax ) 2 and (At) 2 . 
All good software goes further than that. Euler’s first-order method could not predict 
the weather before it happens. 


Euler’s Method for ^ = F{y, t): — + At) ^ 

dt At 


= W), 0. 
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6.6 EXERCISES 


Read-throagh questions 

The infinite series for e 1 is o Its derivative is b The 
denominator n! is called ** c and it equals d At x = 

1 the series for e is e . 

To match the original definition of e, multiply out 
(1 H- l/n)" = t (first three terms). As n -> oo those terms 
approach g in agreement with c. The first three terms of 
(1 4 xjnf are _h_ + As n ^ go they approach I in 
agreement with Thus (1+x/n)" approaches I A 
quicker method computes Inf 1 +x/n)" & k (first term 
only) and takes the exponential. 

Compound interest (n times in one year at annual rate x) 
multiplies by (_!__)"■ As n -+ oo, continuous compounding 
multiplies by m . At x = 10% with continuous compound- 
ing, $1 grows to » in a year. 

The difference equation y(t 4 1) = ay(t) yields y(f) = o 
times y,>. The equation y(t + 1) = ay(t) 4 s is solved by y = 

fl I y 0 + s[l4’flH ha' -1 ]. The sum in brackets is p . 

When a = LOS and y 0 = 0, annual deposits of s = 1 produce 
v — g after t years. If a = { and y 0 = 0, annual deposits 
of 5 = 6 leave * after t years, approaching y^ = * . 
The steady equation y ffi = ay ^ 4 s gives y ^ = t 

When i = interest rate per period, the value of y 0 = $ 1 after 
N periods is y(N) = u . The deposit to produce y(N) = 1 
is Vo = v . The value of s = SI deposited after each period 
grows to y(N) = * . The deposit to reach y(£V) — 1 is s = 

* 

Euler’s method replaces y' = cy by A y = cyA£. Each step 
multiplies y by y . Therefore y at £ = 1 is (1 + cAf) l/ %, 
which converges to * as A£ -* 0. The error is proportional 
to A which is too B for scientific computing, 

1 Write down a power series y = 1 — x + ■ - whose derivative 
is — y. 

2 Write down a power series y = 1 + lx 4 -* whose deriva- 
tive is 2y. 

3 Find two series that are equal to their second derivatives. 

4 By comparing e=l + l+i + £ + it+— with a larger 
series (whose sum is easier) show that e < 3. 

5 At 5% interest compute the output from $1000 in a year 
with 6-month and 3-month and weekly compounding. 

6 With the quick method ln(l 4- x) ^ x, estimate Inf 1 — 1 /it)" 
and In(l + 2/n)". Then take exponentials to find the two limits. 

7 With the slow method multiply out the three terms of 
(1 — r) 2 and the five terms of (1 — i)L What are the first three 
terms of (1 — 1/n)", and what are their limits as n -*■ oo? 

8 The slow method leads to 1 — 1 4 1/2! — 1/3! + -* for the 


limit of (1 — l/n) B . What is the sum of this infinite series — 
the exact sum and the sum after live terms? 

9 Knowing that (1 4 1/n}" -> e , explain (1 4 1/n} 2 " -► e z and 
(1 4 2 jN) s ^e z . 

10 What are the limits of (1 4 1/n 2 )" and (1 4 1/n}" 3 ? 
OK to use a calculator to guess these limits. 

11 (a) The power (1 4 l/n) B (decreases) (increases) with n, as 
we compound more often, (b) The derivative of f{x) = 

x ln(l 4 1/x), which is , should be (<0)(>0). This is 

confirmed by Problem 12. 

12 Show that ln(l 4 1/x) > 1/fx 4 1) by drawing the graph of 

1/f. The area from t = 1 to 1 4 l/x is . The rectangle 

inside it has area * 

13 Take three steps of y(f 4 1) = 2y(t) from y 0 = L 

14 Take three steps of y(t 4 1) = 2 y(t) 4 1 from y 0 = 0. 

Solve the difference equations 15-22. 


15 

* + !) = 

3><t), j>o = 4 

16 

jO + 

n= 


17 


*(0 + l,>o = 0 

18 

y(i + 

n= 

= >0 = 0 

19 

J#+l) = 

3y(f) + 1, yo = 0 

20 

jO + 

i)= 

= 3y(0 + s, y„ = 1 

21 

*< + !) = 

ay(t) + s, y 0 = 0 

22 

j<* + 

i)= 

= ay(t) + s. y 0 = 5 

Id 

23-26, which initial value produces 3 

'1 = 

= y Q (steady state)? 

23 

*f + i) = 

2y{tj -e 

24 

jO + 

i)= 

so 

1 

3 

II 

25 

+ 1 ) = 

"-M + 6 

26 

y(( + 

i)= 

--W 0 + 6 

27 

In Problems 23 and 24, start 

from 

yo 

- 2 and take three 


steps to reach y 3 . Is this approaching a steady state? 

28 For which numbers a does (l — a*)/( 1 — a) approach a limit 
as f ^ oc and what is the limit? 

29 The price P is determined by supply = demand or 
— dP(£4 l)4h = cP(f), Which price P is not changed from 
one year to the next? 

30 Find P(f) from the supply -demand equation with c = 1, 
d — 2 y b = 8, P(0) = 0. What is the steady state as t -* oo? 

Assume 10% interest (so a — 1 4 i = LI) in Problems 31-38. 

31 At 10% interest compounded quarterly, what is the effec- 
tive rate? 

32 At 10% interest compounded daily, what is the effective 
rate? 

33 Find the future value in 20 years of $100 deposited now. 

34 Find the present value of $1000 promised in twenty years. 
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35 For a mortgage of $100,000 over 20 years, what is the 
monthly payment? 

36 For a car loan of $10,000 over 6 years, what is the monthly 
payment? 

37 With annual compounding of deposits s = $1000, what is 
the balance in 20 years? 

38 If you repay s = $1000 annually on a loan of 58000, when 
are you paid up? (Remember interest.) 

39 Every year two thirds of the available houses are sold, and 
1000 new houses are built. What is the steady state of the 
housing market — how many are available? 

40 If a loan shark charges 5% interest a month on the $1000 
you need for blackmail, and you pay $60 a month, how much 


do you still owe after one month (and after a year)? 

41 Euler charges c = 100% interest on his $1 fee for discover- 
ing e. What do you owe (including the $1) after a year with 
(a) no compounding; (b) compounding every week; (c) con- 
tinuous compounding? 

42 Approximate (1 -I- \/n) n as in (15) and (16) to show that 
you owe Euler about e — e/2 n. Compare Problem 6.2.5. 

43 My Visa statement says monthly rate = 1.42% and yearly 
rate= 17%. What is the true yearly rate, since Visa com- 
pounds the interest? Give a formula or a number. 

44 You borrow y 0 = $80,000 at 9% to buy a house. 

(a) What are your monthly payments s over 30 years? 

(b) How much do you pay altogether? 


6.7 Hyperbolic Functions 


This section combines e x with e~ x . Up to now those functions have gone separate 
ways — one increasing, the other decreasing. But two particular combinations have 
earned names of their own (cosh x and sinh x): 

e x + e~ x e x - e x 

hyperbolic cosine cosh x = hyperbolic sine sinh x = 


The first name rhymes with “gosh". The second is usually pronounced “cinch". 

The graphs in Figure 6.18 show that cosh x > sinh x. For large x both hyperbolic 
functions come extremely close to je x . When x is large and negative , it is e~ x that 
dominates. Cosh x still goes up to + oo while sinh x goes down to — oo (because 
sinh x has a minus sign in front of e~ x ). 



Fig. 6.18 Cosh x and sinh x. The hyperbolic Fig. 6.19 Gateway Arch courtesy of the St. 
functions combine and \e~ x . Louis Visitors Commission. 


The following facts come directly from ^e x + e~ x ) and j(e x - e ~ x ): 

cosh(— x) = cosh x and cosh 0 = 1 (cosh is even like the cosine) 
sinh(-x) = - sinh x and sinh 0 = 0 (sinh is odd like the sine) 



278 


6 Exponentials and Logarithms 


The graph of cosh x corresponds to a hanging cable (hanging under its weight). 
Turned upside down, it has the shape of the Gateway Arch in St. Louis. That must 
be the largest upside-down cosh function ever built. A cable is easier to construct 
than an arch, because gravity does the work. With the right axes in Problem 55, the 
height of the cable is a stretched-out cosh function called a catenary : 

y = a cosh ( x/a ) (cable tension/cable density = a). 

Busch Stadium in St. Louis has 96 catenary curves, to match the Arch. 

The properties of the hyperbolic functions come directly from the definitions. There 
are too many properties to memorize — and no reason to do it! One rule is the most 
important. Every fact about sines and cosines is reflected in a corresponding fact about 
sinh x and cosh x. Often the only difference is a minus sign. Here are four properties: 

1. (cosh x) 2 - (sinh x) 2 = 1 [ instead of (cos x) 2 + (sin x) 2 = l] 


Check 


dx 


jV' + e"^ 2 jV* — e ° 2x 

[“ 2 


e 2x + 2 + e 2x — e 2x + 2 — e 2x 


= 1 


2. — (cosh x) = sinh x instead of — (cos x) 


| = - sin xj 


d d 

3. — (sinh x) = cosh x like — sin x = cos x 
dx dx 


I s ' 


4. sinh x dx = cosh x + C and cosh x dx = sinh x + C 


1 ' 



Fig. 6.20 The unit circle cos 2 r + sin 2 r = 1 and the unit hyperbola cosh 2 r — sinh 2 f = 1. 


Property 1 is the connection to hyperbolas. It is responsible for the “h" in cosh and 
sinh. Remember that (cos x) 2 + (sin x) 2 = 1 puts the point (cos x, sin x) onto a unit 
circle. As x varies, the point goes around the circle. The ordinary sine and cosine are 
“circular functions." Now look at (cosh x, sinh x). Property 1 is (cosh x) 2 - (sinh x) 2 = 
1, so this point travels on the unit hyperbola in Figure 6.20. 

You will guess the definitions of the other four hyperbolic functions: 


tanh x = 


sinh x 
cosh x 


e x - e~ x 
e x + e~ x 


sech x = — : — 
cosh x 


2 

e x + e~ x 


. cosh x 

COth X = — ; 

sinh x 


csch x = -t"t — 
sinh x 


e x -e~ x 

2 

e x ~ e~ x 


I think “tanh" is pronounceable, and “sech" is easy. The others are harder. Their 
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properties come directly from cosh 2 * - sinh 2 x = 1, Divide by cosh 2 * and sinh 2 x: 
1 - tanh 2 x = sech 2 x and coth 2 x - 1 = csch 2 * 

(tanh xf — sech 2 * and (sech *)' = — sech x tanh x 


J tanh x dx - 


tanh x dx = | ^ 7 — dx — ln(cosh x ) + C. 


cosh x 

INVERSE HYPERBOLIC FUNCTIONS 

You remember the angles sm _1 x and tan _1 x and sec -1 x, In Section 4.4 we 
differentiated those inverse functions by the chain rule. The main application was to 
integrals. If we happen to meet Jd*/{1 + x 2 ), it is tan"*x + C, The situation for 
sinh -1 * and tanh -1 * and sech -1 x is the same except for sign changes — which are 
expected for hyperbolic functions. We write down the three new derivatives-. 

y " sinh" 1 * (meaning x = sinh y) has ~ = 
y = tanh" 1 * (meaning x = tanh y) has ~ = 
y — sech” l x (meaning x = sech y) has ~ = 



1 

(1) 

dx 

V* 2 + l 

dy = 

dx 

1 

l-x 2 

(2) 

dy = 

-1 

(3) 

dx 

xj\~x 2 


Problems 44-46 compute dyjdx from l/(dx/dy), The alternative is to use logarithms. 
Since In x is the inverse of e x , we can express sinh _1 x and tanh -1 * and sech" 1 * as 
logarithms. Here is y= tanh" 1 *: 


1 1 1 + *1 

> = -ln 

2 |_1 — * 


b— “"’'iHrb' 


1 1 


1 


2 1 - * 1 


a* 


(4) 


The last step is an ordinary derivative of iln(l + *)-jln(l -x). Nothing is new 
except the answer. But where did the logarithms come from? In the middle of the 
following identity, multiply above and below by cosh y: 


1 + x _ 1 + tanh y _ cosh y + sinh y _ e y 
1 — x 1 ” tanh y cosh y — sinh y e~ y 


-e 2 r 


Then 2 y is the logarithm of the left side. This is the first equation in (4), and it is the 
third formula in the following list: 


sinh 


tanh 


* = ln£* + ^/x 2 + 1 J cosh l x - ln[* + ^/* 2 - 1 J 

i+yr^i 


1 l i P 


+ X 
— X 


sech 1 x = ln 


Remark 1 Those are listed only for reference. If possible do not memorize them. The 
derivatives in equations (1), (2), (3) offer a choice of antiderivatives — either inverse 
functions or logarithms (most tables prefer logarithms). The inside cover of the book 
has 

1 -j— 

| + C (in place of tanh - ‘x + C). 


I 


dx 1 

1 2 ^ ^ 

1 — x £ 2 


1- X 


Remark 2 Logarithms were not seen for sin l x and tan l x and sec 1 *, You might 
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wonder why. How does it happen that tanh “ is expressed by logarithms, when the 
parallel formula for tan -1 * was missing? Answer: There must be a parallel formula. 
To display it I have to reveal a secret that has been hidden throughout this section. 

The secret is one of the great equations of mathematics. What formulas for cos * 
and sin* correspond to + and ^{e x — e~ x y! With so many analogies 
(circular vs. hyperbolic) you would expect to find something. The formulas do exist, 
but they involve imaginary numbers , Fortunately they arc very simple and there is 
no reason to withhold the truth any longer: 

cos x = -F e ,x ) and sin * = — (e 1 * “ e -1 *). (5) 

2 2i 

It is the imaginary exponents that kept those identities hidden. Multiplying sin x by 
i and adding to cos * gives Euler’s unbelievably beautiful equation 

cos x + i sin * = e'F (6) 

That is parallel to the non-beautiful hyperbolic equation cosh x + sinh x= e x . 

1 have to say that (6) is infinitely more important than anything hyperbolic will 
ever be. The sine and cosine are far more useful than the sinh and cosh. So we end 
our record of the main properties, with exercises to bring out their applications. 


6.7 EXERCISES 


Read-through questions 


Find the derivatives of the functions 9-18: 


Cosh x = Q and sinh x — b and cosh 2 .* — sinh 2 .* = 

c Their derivatives are d and e and f 

The point (x, y) = (cosh t, sinh 0 travels on the hyperbola 
fl A cable hangs in the shape of a catenary y = h 

The inverse functions sinh *x and tanh -1 -* are equal to 
ln[.* + v ^ + 1] and I In J Their derivatives are I 
and it So we have two ways to write the anti i The 
parallel to cosh ,* + sinh x = e x is Euler’s formula m . 
The formula cos * = 4(e ,Jf + t* -1 *) involves n exponents. 
The parallel formula for sin .* is o 

1 Find cosh x 4- sinh *\ cosh x — sinh x, and cosh r * sinh x. 

2 From the definitions of cosh x and sinh x, find their deriv- 
atives. 

3 Show that both functions satisfy y“ = v. 

4 By the quotient rule, verify (tanh x)' = sech 2 *, 

5 Derive cosh 2 x -I- sinh 2 x = cosh 2x, from the definitions, 

6 From the derivative of Problem 5 find sinh 2.x, 

7 The parallel to (cos x + i sin x) n = cos nx + i sin nx is a 

hyperbolic formula (cosh .x + sinh x)* = cosh nx + . 

8 Prove sinh(_* + y) = sinh x cosh y + cosh x sinh y by 

changing to exponentials. Then the x-derivative gives 
cosh(x -r y) = , 


9 cosh(3x + 1) 

1 1 1/cosh x 
13 cosh 2 x 4- sinh 2 x 
15 tanh v /x 2 + i 


10 sinh x 1 
12 sinh(ln x) 

14 cosh 2 x — sinh 2 ,* 


16 (1 +tanh x)/(l - tanh x) 

17 sinh 6 x 18 ln(sech x 4- tanh x) 

19 Find the minimum value of cosh(ln x) for x > 0. 

20 From tanh x = § find sech x, cosh x, sinh .*, coth x, csch x r 

21 Do the same if tanh x = — 12/13. 

22 Find the other five values if sinh x - 2, 

23 Find the other five values if cosh x = 1. 

24 Compute sinhfln 5) and tanh(2 In 4) h 


Find antiderivatives for the functions in 25-32: 

25 cosh(2.v 4 - 1 ) 26 x coshfx 2 } 

27 cosh 2 .* sinh .* 28 tanh 2 * sectrx 


29 


sinh * 


1 + cosh x 
31 sinh .* + cosh x 


30 coth x- = 


e 4 - e 


32 (sinh x + cosh x)' 
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33 The triangle in Figure 6.20 has area ^ cosh t sinh u 

(a) Integrate to find the shaded area below the hyperbola 

(b) For the area A in red verify that dA/dt = i 

(c) Conclude that A — it + C and show C = 0, 

Sketch graphs of the functions in 34-40. 

34 y = tanh x (with inflection point) 

35 y = coth x (in the limit as x oo) 

36 y = sech x 

37 y = sinh -1 x 

38 y =s cosh - *x for x > 1 

39 y = sech -1 x for 0<x ^ 1 

40 y — tanh _1 x = ^ 1n^ | for \x\ < 1 

41 (a) Multiplying x = sinh y — ife 7 — e" 7 ) by 2c 7 gives 
(e 7 ) 2 -* 2x{e*) — 1=0. Solve as a quadratic equation for e 7 . 
(b) Take logarithms to find y - sinh -1 x and compare with 
the text. 

42 (a) Multiplying x = cosh y = i(e* + e" 7 ) by 2e* gives 
(e 7 ) 2 — Irffh +1=0. Solve for e 7 . 

(b) Take logarithms to find y = cosh - *x and compare with 
the text. 

43 Turn (4) upside down to prove / = — 1/(1 — x 2 ), V y = 
coth -1 x. 

44 Compute dyjdx = l/^x 2 + 1 by differentiating x = sinh y 
and using cosb 2 y — sinh 2 y = 1. 

45 Compute dyjdx = 1/(1 — x 2 ) if y = ianh -1 x by differen- 
tiating x = tanh y and using sech 2 y + tanh 2 y - 1. 

46 Compute dyjdx = — 1/x^/l — x 2 for y = sech" 1 x J by 
differentiating x = sech y. 


jjjc/vWl 

50 Jx dx/Jx* + 1 

— x 2 

52 jixljl-x* 


[ 1!2 dx f 1 dx 

53 Compute and "j J- 

Jo 1 Jo 1 -x 

54 A falling body with friction equal to velocity squared 
obeys dvjdt = g — v 2 . 

(a) Show that u(£) = ^fg tanh % fgt satisfies the equation. 

(b) Derive this v yourself, by integrating dvj(g — v 2 ) = dL 

(c) Integrate u(r) to find the distance /(£). 

55 A cable hanging und er its ow n weight has slope S = dyjdx 
that satisfies dSjdx = c^f 1 +S 2 The constant c is the ratio of 
cable density to tension. 

(a) Show that S = sinh cx satisfies the equation. 

(b) Integrate dyjdx = sinh cx to find the cable height y(x), 
ifM0)=l/c. 

(c) Sketch the cable hanging between x = — L and x = L 
and find how far it sags down at x = 0. 

56 The simplest nonlinear wave equation (Burgers' equation) 
yields a waveform W(x) that satisfies W* = WW - W . One 
integration gives W = ifY 2 — W> 

(a) Separate variables and integrate: 

dx = dWfti W 2 —W)= — dWj{ 2 -W)- dWjW. \ 

(b) Check W f = \W 2 -W. 

57 A solitary water wave has a shape satisfying the KdV 
equation y* “ y — 6y/. 

(a) Integrate once to find /. Multiply the answer by y. 

(b) Integrate again to find y (all constants of integration 
are zero). 

(c) Show that y = isech 2 (x/2) gives the shape of the 
“soliton." 


From formulas (1), (2), (3) or otherwise, find antiderivatives in 
47-52: 

47 I" dx/(4 — x 1 ) 48 Li xl(a 2 -x 2 ) 


58 Derive cos ix ~ cosh x from equation (5). What is the 

cosine of the imaginary angle i = ? 

59 Derive sin ix = i sinh x from (5). What is sin i? 

60 The derivative of e ix = cos x + i sin x is . 
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CHAPTER 7 


Techniques of Integration 


Chapter 5 introduced the integral as a limit of sums. The calculation of areas was 
started — by hand or computer. Chapter 6 opened a different door. Its new functions 
e 1 and In x led to differential equations. You might say that all along we have been 
solving the special differential equation dfjdx = u(x). The solution is f— f i?(jc)fix. But 
the step to dy/dx — cy was a breakthrough. 

The truth is that we are able to do remarkable things. Mathematics has a language > 
and you are learning to speak it, A short time ago the symbols dy/dx and f v(x)dx 
were a mystery. {My own class was not too sure about u(x) itself — the symbol for a 
function.) It is easy to forget how far we have come, in looking ahead to what is next. 

I do want to look ahead. For integrals there are two steps to take — more functions 
and more applications. By using mathematics we make it Kve. The applications are 
most complete when we know the integral. This short chapter will widen {very much) 
the range of functions we can integrate. A computer with symbolic algebra widens it 
more. 

Up to now, integration depended on recognizing derivatives. If v(x)= sec 2 * then 
/{*) - tan *. To integrate tan * we use a substitution:, 

fsinoc f du 

dx = — — = — \nu = — In cos x, 

J cos x J u 

What we need now are techniques for other integrals , to change them around until 
we can attack them. Two examples are j x cos x dx and j ^1 — x 2 dx, which are not 
immediately recognizable. With integration by parts, and a new substitution, they 
become simple. 

Those examples indicate where this chapter starts and stops. With reasonable effort 
(and the help of tables, which is fair) you can integrate important functions. With 
intense effort you could integrate even more functions. In older books that extra 
exertion was made — it t ended to dominate the course. They had integrals like 
f(x+ \)dx/yj2x 2 - 6x + 4, which we could work on if we had to. Our time is too 
valuable for thatl Like long division, the ideas are for us and their intricate elaboration 
is for the computer. 

Integration by parts comes first. Then we do new substitutions. Partial fractions 
is a useful idea (already applied to the logistic equation / = cy — by 2 ). In the last 
section x goes to infinity or y(x) goes to infinity — but the area stays finite. These 
282 improper integrals are quite common. Chapter 8 brings the applications. 
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7.1 Integration by Parts 


There are two major ways to manipulate integrals {with the hope of making them 
easier). Substitutions are based on the chain rule, and more are ahead. Here we 
present the other method, based on t he product rule. The reverse of the product rule, 
to find integrals not derivatives, is integration by parts. 

We have mentioned J cos I 2 x dx and J In x dx. Now is the right time to compute 
them (plus more examples). You will see how j In x dx is exchanged for j 1 dx — a 
definite improvement. Also j xe x dx is exchanged for J e x dx. The difference between 
the harder integral and the easier integral is a known term — that is the point. 

One note before starting: Integration by parts is not just a trick with no meaning. 
On the contrary, it expresses basic physical laws of equilibrium and force balance. 
It is a foundation for the theory of differential equations (and even delta functions). 
The final paragraphs, which are completely optional, illustrate those points too. 

We begin with the product rule for the derivative of u(x) times v{x): 

dv du d 

u(x) — + i>(x) — = — (u(x)u(x)). ( 1 ) 


Integrate both sides . On the right, integration brings back u{x)u(x). On the left are 
two integrals, and one of them moves to the other side (with a minus sign): 


V 


dv 

u(x) — dx = u(x)v(x) — 


du 

u(x) — dx. 
ax 


( 2 ) 


That is the key to this section — not too impressive at first, but very powerful. It is 
integration by parts ( u and t> are the parts). In practice we write it without x’s: 


7A The integration by parts formula is J u dv = uu - J t> du. 


( 3 ) 


The problem of integrating u dv/dx is changed into the problem of integrating 
v du/dx. There is a minus sign to remember, and there is the “integrated term" u(x)t>(x). 
In the definite integral, that product »(x)i>(x) is evaluated at the endpoints a and b: 

dx. (4) 

The key is in choosing u and v. The goal of that choice is to make J v du easier than 
J u dv. This is best seen by examples. 


dv 

u — dx — u(h)v(b) - u(a)v(a) 


dx 


> 


du 

dx 


EXAMPLE 1 For j In x dx choose u — In a and dv = dx (so v = x): 


In x dx = uv — 


I 


v du = x In 


x — 


X - dx. 
X 


I used the basic formula (3). Instead of working with In x (searching for an antideriva- 
tive), we now work with the right hand side. There x times l/x is 1. The integral of 
1 is x. Including the minus sign and the integrated term uv~ x In x and the constant 
C, the answer is 

j In x dx = x In x - x+ C, (5) 


For safety, take the derivative. The product rule gives In x + x(l/x) 1, which is In x. 

The area under y = In x from 2 to 3 is 3 In 3 — 3 — 2 In 2 + 2, 
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To repeat: We exchanged the integral of In x for the integral of L 

EXAMPLE 2 For { x cos x dx choose u = x and dv — cos x dx (so i>(x) — sin x): 

| x cos x dx = uv — J v du = x sin x — j sin x dx . (6) 

Again the right side has a simple integral, which completes the solution: 

J x cos x dx- x sin x + cos x + C. (7) 

Note The new integral is not always simpler. We could have chosen u = cos x and 
dv — x dx. Then v — jx 2 . Integration using those parts give the true but useless result 

| x cos x dx — uv — J v du = $x 2 cos x + 1 jx 2 sin x dx. 

The last integral is harder instead of easier (x 2 is worse than x). In the forward 
direction this is no help. But in the opposite direction it simplifies \\x 2 sin x dx. The 
idea in choosing u and v is this: Try to give u a nice derivative and dv a nice integral. 

EXAMPLE 3 For J (cos x) 2 dx choose u = cos x and dv ~ cos x dx (so v = sin x): 

| (cos x) 2 dx — uv — J v du — cos x sin x + J (sin x) 2 dx. 

The integral of (sin x) 2 is no better and no worse than the integral of (cos x) 2 . But we 
never see (sin x) 2 without thinking of 1 - (cos x) 2 . So substitute for (sin x) 2 : 

| (cos x) 2 dx = cos x sin x -f j 1 dx - j (cos x) 2 dx. 

The last integral on the right joins its twin on the left, and j 1 dx = x: 

2 j (cos x) 2 dx = cos x sin x + x. 

Dividing by 2 gives the answer, which is definitely not i(cos x) 3 . Add any C: 

j (cos x) 2 dx — 7 (cos x sin x + x) + C. (8) 

Question Integrate (cos x) 2 from 0 to 2n. Why should the area be 71 ? 

Answer The definite integral is ^{cos x sin x + x)]q\ This does give n . That area can 
also be found by common sense, starting from (cos x) 2 + (sin x) 2 = I. The area under 
1 is 2n. The areas under (cos x) 2 and (sin x) 2 are the same. So each one is n. 


EXAMPLE 4 


Evaluate J tan *x dx by choosing u — tan l x and v = x: 



* 

tan _1 x dx = uv — 

v du — x tan -1 x - 

m 


dx 

1 + x 2 ’ 


The last integral has * — 1 + x 2 below and almost has dw = 2x dx above: 


(9) 


x dx 
1 +x 2 


\ f “ = ^ In w = ~ ln(l + x 2 ). 
2) w 2 2 


Substituting back into (9) gives J tan dxasx tan _1 x — j\n(\ + x 2 ). All the familiar 
inverse functions can be integrated by parts (take v = x, and add “ + C” at the end). 

Our final example shows how two integrations by parts may be needed, when the 
first one only simplifies the problem half way. 


EXAMPLE 5 For j x 2 e x dx choose u = x 2 and dv = e x dx (so v = e*); 

| x 2 e x dx — uv — jvdu = x 2 e* - J e x (2x dx). 


( 10 ) 
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The last integral involves xe x . This is better than xV, but it still needs work: 

| xe x dx = uv - J v du = xe* - J e x dx (nowi< = x). (II) 

Finally e x is alone. After two integrations by parts, we reach je x dx. In equation (11), 
the integral of xe x is xe x - e x . Substituting back into (10), 

J x 2 e x dx -■ x 1 e x — 2[xe* — e*] + C. (12) 

These five examples are in the list of prime candidates for integration by parts: 

x n e x , x"sin x, x"oosx, x"ln x, t^sin x, e*cos x, sin _1 x, tan -1 x 

This concludes the presentation of the method — brief and straightforward. 
Figure 7.1a shows how the areas judv and $vdu fill out the difference between the 
big area u(f?)r(b) and the smaller area u(n)n(a). 


u(x)8(x) 

spike 

area 

m 


o v { v 2 u 

Fig. 7 A The geometry of integration by parts. Delta function (area 1) multiplies u(x} at x = 0, 

In the movie Stand and Deliver, the Los Angeles teacher Jaime Escalante computed 
jx 2 sin x dx with two integrations by parts. His success was through exercises — plus 
insight in choosing it and v. (Notice the difference from Jx sin x 2 dx. That falls the 
other way — to a substitution.) The class did extremely well on the Advanced Place- 
ment Exam. If you saw the movie, you remember that the examiner didn’t believe 
it was possible. 1 spoke to him long after, and he confirms that practice was the key. 

THE DELTA FUNCTION 

From the most familiar functions we move to the least familiar. The delta function is 
the derivative of a step function , The step function C/(x) jumps from 0 to 1 at x — 0. 
We write <5(x) = dU/dx , recognizing as we do it that there is no genuine derivative at 
the jump. The delta function is the limit of higher and higher spikes— from the 
"burst of speed” in Section L2, They approach an infinite spike concentrated at a 
single point (where U jumps). This "non-function” may be unconventional — if is 
certainly optional — but it is important enough to come back to. 

The slope dUjdx is zero except at x = 0, where the step function jumps. Thus 
5(x) = 0 except at that one point, where the delta function has a “spike,” We cannot 
give a value for 6 at x = 0 , but we know its integral across the jump , On every interval 
from — A to A, the integral of dV /dx brings back U: 

A 

S(x] dx = 

-A 

"The area under the infinitely tall and infinitely thin spike 5(x) equals 1,” 

So far so good. The integral of <5(x) is l/(x). We now integrate by parts for a crucial 
purpose— to find the area under t?(x)<5(x), This is an ordinary function times the della 
function. In some sense i?(x) times <5(x) equals u(0) times 5(x) — because away from 
x = 0 the product is always zero. Thus e*S(x) equals S(x ), and sin x S(x) = 0. 


A fc dX=U WY-A =L < 13 > 
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The area under u(x)<5(.x) is t^O) — which integration by parts will prove: 


70 The integral of u(x) times <5(x) is \ A _ A r(x)5(x)dx = u(0). 


The area is y(0) because the spike is multiplied by u(0) — the value of the smooth 
function v(x} at the spike * But multiplying infinity is dangerous, to say the least* (Two 
times infinity is infinity). We cannot deal directly with the delta function. It is only 
known by its integrals] As long as the applications produce integrals (as they do), we 
can avoid the fact that § is not a true function* 

The integral of i>(x)<S(x) = v(x)dUjdx is computed "'by parts: 1 ’ 

J v(x)3(x) dx - u(x)t/(x)]l A — J C7(x) ^ dx. (14) 

Remember that U = 0 or U = 1* The right side of (14) is our area t?(0): 

i;(j 4) ■ 1 — f 1 ~dx = i?(A) * (l?(^ 4) — u(0)) = i?(0). (1 5) 

Jo dx 

When t^x) = 1, this answer matches j 5dx = 1. We give three examples: 

2 cos x S(x) dx =1 f_ 5 (U(x) + &(x))dx = 7 J 1 . . J (S(x)) 2 dx - oo . 

A nightmare question occurs to me. What is the derivative of the delta function ? 

INTEGRATION BY PARTS IN ENGINEERING 

Physics and engineering and economics frequently involve products. Work is force 
times distance. Power is voltage times current* Income is price times quantity* When 
there are several forces or currents or sales, we add the products. When there are 
infinitely many, we integrate (probably by parts). 

1 start with differential equations for the displacement u at point x in a bar: 

~ ^ = f{x) with v{x) = k ^ . (16) 

dx dx 

This describes a hanging bar pulled down by a force /(x). Each point x moves through 
a distance w(x). The top of the bar is fixed, so u(0) = 0. The stretching in the bar is 
dujdx. The internal force created by stretching is v = kdujdx. (This is Hooke's law*) 
Equation (16) is a balance of forces on the small piece of the bar in Figure 7.2. 


Fig. 7.2 Difference in internal Force balances external force 
—Av = /Ax or — dv/dx =J\x) 
v = W at x = 1 balances hanging weight 
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EXAMPLE 6 Suppose _/(x) - F f a constant force per unit length. We can solve (16): 

v(x)= -Fx + C and kv{x) = -%Fx 2 + Cx + D. (17) 

The constants C and D are settled at the endpoints (as usual for integrals). At x = 0 
we are given u — 0 so D — 0. At x — 1 we are given v=W so C— W ^ F. Then v(x) 
and u(x) give force and displacement in the bar. 

To see integration by parts, multiply —dvjdx — f[x ) by u(x) and integrate: 

J Vm*) dx = - £ £ u(x) dx=- v{x)v(x)]l + £ v(x) £ dx. (18) 

The left side is force times displacement, or external work . The last term is internal 
force times stretching — or internal work , The integrated term has u(0) = 0 — the fixed 
support does no work. It also has )W f the work by the hanging weight. The 
balance of forces has been replaced by a balance of work. 

This is a touch of engineering mathematics, and here is the main point. Integration 
by parts makes physical sense! When —dvjdx ~/is multiplied by other functions— 
called test functions or virtual displacements — then equation (18) becomes the 
principle of virtual work . It is absolutely basic to mechanics. 


7.1 EXERCISES 


Read-through questions 


9 | ^sin x dx 10 J ^cos x dx 


Integration by pans is the reverse of the a rule. It 
changes judu into b minus c t In case u = x and 
dv = e 2x dx y it changes J xe 2 *dx to d minus • . The 

definite integral j 2 xe 2 *dx becomes < minus g , 

In choosing u and dv f the h of u and the i of 
dvjdx should be as simple as possible. Normally In x goes into 
] and e 1 goes into h . Prime candidates are u — x or 
x 2 and v = sin x or i or m . When u = x 2 we need 
n integrations by parts. For Jsin -i x dx , the choice dv = 
dx leads to o minus p . 

If U is the unit step function, dUjdx — 6 is the unit q 
function. The integral from —A to A is U{A)—U(—A) = 
t The integral of u(x)<5{x) equals « The integral 
J l _ a cos x 6(x)dx equals t In engineering, the balance of 
forces — dvjdx =f is multiplied by a displacement u(x) and 
integrated to give a balance of u . 


Integrate 1-16, usually by parts (sometimes twice). 

1 $ x sin x dx 2 j x e* x dx 

3 fxe~ x dx 4 jx cos 3xdx 

5 | x 2 cos xdx (use Problem 1) 

6 J x In x dx 7 |ln(2x+l)dx 

8 f x 2 e 4x dx (use Problem 2) 


[9 and 10 need two integrations. I think e* can be u or u.] 


12 |*e _J£l dx 
14 J cosfln x) dx 
16 J *Mn x dx 
18 j" cos - L (2x) dx 


II J e flX sin hx dx 
13 | sin(tn x) dx 
15 | (In xfdx 
17 J sin -l x dx 

19 f x tan -1 * dx 

20 Jx 2 sin x dx (from the movie) 

21 j x 3 co$ x dx 22 J x 3 ain x dx 

23 | x 2 e x dx 24 |xsec -1 xdx 

25 f x sec 2 * dx 26 J * cosh * dx 

Compute the definite integrals 27-34. 

27 jp In x dx 28 |p dx (let u = ^/x) 

29 |p*e - 2 jf d* 30 j*ln(x 2 )dx 

31 fox cos xdx 32 |* jl xsin*dx 

33 fp ln(x 2 + l)dx 34 |p 2 x 2 sin * dx 


In 35—40 derive "reduction formulas" from higher to lower 
powers. 

35 J x*e*dx = x n e* — n J x* “ *e*dx 
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36 f x V*dx = 

37 J x* cos x dx = x n sm x — n j x*~ ^in x dx 

38 J x" sin x dx - 

39 | (In xydx = x(\n x)" — n J (In xf~ 1 dx 

40 f xflnx)*dx = 

41 How w r ould you compute f x sin x e x dx using Problem 9? 
Not necessary to do it. 

42 How r would you compute J x e* tan - *x dx? Don’t do it. 

43 (a) Integrate J x 3 sin x 2 dx by substitution and parts. 

(b) The integral J x*sin x 2 dx is possible if n is . 

44-54 are about optional topics at the end of the section. 

44 For the delta function i5(x) find these integrals: 

(a) JL , e u 6{x)dx (b) J 3 _ , v(x)6(x)dx (c) \\ cos x d(x)dx. 

45 Solve dy/dx — 3<5(x) and dy/dx = 3£(x) 4- y^x). 

46 Strange fact: £(2x) is different from <5(x). Integrate them 
both from —1 to 1 . 

47 The integral of <5(x) is the unit step C/(x), Graph the next 
integrals R(x) = j V(x)dx and Q(x) = J R{x)dx, The ramp R 
and quadratic spline Q are zero at x = 0. 

48 In S(x — i), the spike shifts to x = j. It is the derivative of 
the shifted step U(x — y). The integral of t'(x)<$(x - equals 
the value of v at x = i. Compute 

(a) f ‘ d(x - j)d.x; (b) e*d{x - \)dx' y 

(c) J‘_ , <5(x)d(x - i)dx. 

49 The derivative of £(x) is extremely singular. It is a “dipole” 
known by its integrals. Integrate by parts in(b) and (c): 

C l dd , f 1 d3 f l d8 
(a) J — dx (b) x — dx (c) I v(x) — dx = - v'{0). 

50 Why is U(x)<5(x)dx equal to }? (By parts.) 

51 Choose limits of integration in u(x) = J/(x)dx so that 
dv/dx — — /(x) and v *= 0 at x = 1 . 


52 Draw the graph of u{x ) if u(l) = 0 and — dvjdx =J\x \ : 

(a )/=-*-; (b )/= U(x-i); (c)/= d(x-& 

53 What integral u(x) solves k du/dx = p(xr) with end con* 
dition 1/(0} — 0? Find u(x) for the three v's (not /’ s) in 
Problem 52, and graph the three u’s. 

54 Draw the graph of AC/ /Ax = [U(x + Ax) — lffx)]/Ax. 
What is the area under this graph? 

Problems 55-62 need more than one integration. 

55 Two integrations by parts lead to F— integral of v : 

| uv'dx ~ uv — V\i 4- } Vu'dx, 

Test this rule on J x 2 sin x dx, 

56 After n integrations by parts, j' u(dv/dx)dx becomes 

uu - u { " v tl) + u l \ 2) 1-<— l)"f i,dx. 

u in} is the nth derivative of u, and is the nth integral of t?. 
Integrate the last term by parts to stretch this formula to 
n + 1 integrations. 

57 Use Problem 56 to find j x 3 e*dx. 

58 From /(x) — J\0) = jj/'0)dc, integrate by parts (notice di 
not dx) to reach f\x) =/l0) -h/'(0)x 4- J t */”{0(x — f)dt. Con- 
tinuing as in Problem 56 produces Taylor's formula: 

J\x)=m+f'(0)x + ^/"(0)X 2 + - + [V* 

2! Jo n! 

59 What is the difference between uw"dx and w"w dx? 

60 Compute the areas A = J* In x dx and B = e y dy. Mark 
them on the rectangle with corners (0, 0), (e, 0), ( e y 1), (0, 1). 

61 Find the mistake. I don’t believe r*cosh x — e*sinh x: 

[ e x sinh x dx — e 1 cosh x — J e x cosh x dx 

= e*cosh x — e*sinh x 4- J e x sinh x dx. 

62 Choose C and D to make the derivative of 
C e**cos bx 4- D e^sin bx equal to e ax cos bx. Is this easier 
than integrating e^cos bx twice by parts? 


7.2 Trigonometric Integrals 

The next section will put old integrals into new forms. For example Jjc 2 v / 1 - x 2 dx 
will become J sin 2 ft cos 2 ft df). That looks simpler because the square root is gone. But 
still S'n 2 ft cos 2 0 has to be integrated. This brief section integrates any product of sines 
and cosines and secants and tangents . 

There are two methods to choose from. One uses integration by parts, the other 
is based on trigonometric identities. Both methods try to make the integral easy (but 
that may take time). We follow' convention by changing the letter 6 back to x. 
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Notice that sin 4 x cos x dx is easy to integrate. It is u^du. This is the goal in 
Example 1 — to separate out cos x dx. It becomes du, and sin x is u. 

EXAMPLE 1 J sin 2 * cos 3 * dx (the exponent 3 is odd) 

Solution Keep cos x dx as du. Convert the other cos 2 * to 1 - sin 2 *: 

f . •> f . , ~ v , sin 3 * sin 5 * , _ 

snrx cos 3 * dx = surx(l — snrx)cos * dx = — — h C. 


EXAMPLE 2 | sin 5 * dx (the exponent 5 is odd) 

Solution Keep sin * dx and convert everything else to cosines. The conversion is 
always based on sin 2 * + cos 2 * “ 1: 

J(! - cos 2 *) 2 sin * dx = J (1 — 2 cos 2 * + cos 4 *) sin * dx. 

Now cos * is u and - sin * dx is du. We have [ (— 1 + 2u 2 - u 4 )du. 

General method for [ sin m x cos' 1 * dx, when m or n is odd 

If n is odd, separate out a single cosxdx. That leaves an even numher of cosines. 

Convert them to sines. Then cos * dx is du and the sines are u's. 

If m is odd, separate out a single sin * dx as du. Convert the rest to cosines. 

If m and n are both odd, use either method. 

If m and n are both even, a new method is needed. Here are two examples. 


EXAMPLE 3 J cos 2 * dx (m = 0, n = 2, both even) 

There are two good ways to integrate cos 2 *, but substitution is not one of them. If 
u equals cos x, then du is not here. The successful methods are integration by parts 
and double-angle formulas. Both answers are in equation (2) below — I don’t see 
either one as the obvious winner. 

Integrating cos 2 x by parts was Example 3 of Section 7.1. The other approach, by 
double angles, is based on these formulas from trigonometry: 

cos 2 * = j(l + cos 2*) sin 2 * = y(l - cos 2x) (1) 

The integral of cos 2x is \ sin 2*. So these formulas can be integrated directly. They 
give the only integrals you should memorize — either the integration by parts form, 
or the result from these double angles: 

| cos 2 * dx equals $(x + sin * cos x) or ?x + * sin 2* (plus C). (2) 

[ sin 2 * dx equals \{x - sin x cos *) or \x — £ sin 2* {plus C). (3) 

EXAMPLE 4 } cos 4 * dx (m — 0, n = 4, both are even) 

Changing cos 2 * to 1 - sin 2 * gets us nowhere. All exponents stay even. Substituting 
u = sin * won’t simplify sin 4 * dx, without du. Integrate by parts or switch to 2*. 

First solution Integrate by parts. Take u — cos 3 * and dv = cos * dx: 

[ (cos 3 x){cos * dx) -uv-jvdu- cos 3 * sin * - { (sin *)(- 3 cos 2 * sin * dx). 

The last integral has even powers sin 2 * and cos 2 *. This looks like no progress. 
Replacing sin 2 * by 1 — cos 2 * produces cos 4 * on the right-hand side also: 

j cos 4 * dx = cos 3 * sin * + 3 J cos 2 *(l - cos 2 x)dx. 
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Always even powers in the integrals. But now move 3 J cos 4 * dx to the left side: 

Reduction 4 J cos 4 * dx = cos 3 * sin * + 3 J cos 2 * dx. (4) 

Partial success — the problem is reduced from cos 4 x to cos 2 *. Still an even power, 
but a lower power. The integral of cos 2 x is already known. Use it in equation (4): 

| cos 4 * dx = * cos 3 * sin * + £ ■ £(x + sin * cos *) + C. (5) 

Second solution Substitute the double-angle formula cos 2 * = £ + ^ cos 2*: 

J" cos J * dx = J (£ + 3 cos 2*) 2 d* = £ J {1 + 2 cos 2* + cos 2 2 x)d*. 

Certainly J dx = *. Also 2 J cos 2* dx = sin 2*. That leaves the cosine squared: 

J cos 3 2* = f i(l + cos 4 x)dx — jx + £ sin 4* + C. 

The integral of cos 4 * using double angles is 

i [x + sin 2x + 7* + £sin 4*] + C, (6) 

That solution looks different from equation (5), but it can't be. There all angles were 
*, here we have 2x and 4*. We went from cos 4 * to cos 2 2* to cos 4x, which was 
integrated immediately. The powers were cut in half as the angle was doubled. 

Double-angle method for j sin m x cos”x dx, when m and n are even . 

Replace sin 2 * by |(1 ~cos2x) and cos 1 * by + cos2x). The exponents drop to 
mj2 and njl. If those are even, repeat the idea (2* goes to 4x). If m/2 or n/2 is odd, 
switch to the “general method” using substitution. With an odd power, we have du. 

EXAMPLE 5 (Double angle) J sin 2 x cos 2 * dx = J ^(1 - cos 2x)(l + cos 2 x)dx. 

This leaves 1 - cos 2 2* in the last integral. That is familiar but not necessarily easy. 
We can look it up (safest) or remember it (quickest) or use double angles again: 

^ (1 - cos 2 2x)dx = ^ - \ “ \ cos 4*^ dx = £ - ~y~“~ + c - 

Conclusion Every sin m x cos"x can be integrated. This includes negative m and n — 
see tangents and secants below. Symbolic codes like MACSYMA or Mathematica 
give the answer directly. Do they use double angles or integration by parts? 

You may prefer the answer from integration by parts (I usually do). It avoids 2x 
and 4*, But it makes no sense to go through every step every time. Either a computer 
does the algebra, or we use a “reduction formula” from n to n — 2: 

Reduction n J cos rt x dx = cos" - *x sin x + (n - 1) J cos B_ 2 x d*. (7) 

For n = 2 this is J cos 2 * dx — the integral to learn. For n = 4 the reduction produces 
cos 2 *. The integral of cos 6 * goes to cos 4 *. There are similar reduction formulas for 
sin ffl x and also for sm ffl x cos R *. I don’t see a good reason to memorize them. 

INTEGRALS WITH ANGLES px AND qx 

Instead of sin 8 * times cos 6 *, suppose you have sin 8* times cos 6*. How do you 
integrate? Separately a sine and cosine are easy. The new question is the integral of 
the product : 
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EXAMPLE 6 Find sin 8 x cos 6x dx. More generally find JJ* sin px cos qx dx. 

This is not for the sake of making up new problems* I believe these are the most 
important definite integrals in this chapter (p and q are 0, 1, 2, **.). They may be the 
most important in all of mathematics, especially because the question has such a 
beautiful answer* The integrals are zero * On that fact rests the success of Fourier 
series, and the whole industry of signal processing. 

One approach (the slow way) is to replace sin 8 x and cos 6x by powers of cosines. 
That involves cos u x* The integration is not fun* A better approach, which applies to 
all angles px and qx 7 is to use the identity 


sin px cos qx = £ sin(p + q)x + £ sin(p - q)x , 


( 8 ) 


Thus sin Sxcos 6 jc — j sin 14x + £ sin 2x* Separated like that, sines are easy to 
integrate: 



r 


sin 8 jc cos 6x dx = 


1 cos 14x 

2 14 


1 cos 2x 


in 


2 2 Jo 


= 0 * 


Since cos 14* is periodic, it has the same value at 0 and 2 Subtraction gives zero. 
The same is true for cos 2x, The integral of sine times cosine is always zero over a 
complete period (like 0 to 2k). 


What about sin px sin ^x and cos px cos qxl Their integrals are also zero, provided 
p is different from q * When p = q we have a perfect square. There is no negative area 
to cancel the positive area* The integral of cos 2 px or sin 2 px is n. 

EXAMPLE 7 \l* sin 8 * sin 7xdx = 0 and J*" sin 2 8 x dx = *. 

With two sines or two cosines (instead of sine times cosine), we go back to the 
addition formulas of Section 1*5* Problem 24 derives these formulas: 

sin px sin qx = - i cos(p + q)x + ^ cos(p - q)x (9) 

cos px cos qx= i cos (p + ^)x + \ cos (p — q)x. ( 10 ) 

With p — 8 and q = 7, we get cos 15x and cos x* Their definite integrals are zero. With 
p = 8 and q= 8 , we get cos 16x and cos Ox (which is 1)* Formulas (9) and (10) also 
give a factor 7 * The integral of 7 is n: 

Jo* sin 8 x sin lx dx = - cos 15x dx + i JJ* cos x dx = 0 + 0 

jl* sin 8 x sin 8 x dx — — cosT 6 x dx + cos Ox dx *= 0 + n 

The answer zero is memorable. The answer n appears constantly in Fourier series. 
No ordinary numbers are seen in these integrals* The case p — q—l brings back 
J cos 2 x dx = ^ + 3 sin 2 x* 


SECANTS AND TANGENTS 

When we allow negative powers m and n, the main fact remains true* All integrals 
| sin m x cos ff x dx can be expressed by known functions* The novelty for negative pow- 
ers is that logarithms appear . That happens right at the start, for sin x/cos x and for 
1 /cos x (tangent and secant): 

| tan x dx = - | du/u = — In [cos x| (here u — cos x) 

J sec x dx = | duju = ln|sec x + tan x| (here u = sec x + tan x). 
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For higher powers there is one key identity: 1 + tan 2 x = sec 2 x. That is the old 
identity cos 2 x + sin 2 x = 1 in disguise (just divide by cos 2 x). We switch tangents to 
secants just as we switched sines to cosines. Since (tan x)' = sec 2 x and (secx)' = 
sec x tan x , nothing else comes in. 

EXAMPLE 8 | tan 2 x dx = j(sec 2 x — l)dx ~ tan x — x + C. 

EXAMPLE 9 j tan 3 x dx — j tan x(sec 2 x - 1 )dx. 

The first integral on the right is \udu = ^u 2 , with u = tanx. The last integral is 
— J tan x dx. The complete answer is £(tan x) 2 + ln|cos x | + C. By taking absolute 
values, a negative cosine is also allowed. Avoid cos x - 0. 

(tan x) m ~ 2 dx 

Same idea — separate off (tan x) 2 as sec 2 *— 1. Then integrate (tan x) m_2 sec 2 x dx, 
which is u m ~ 2 du. This leaves the integral on the right, with the exponent lowered by 
2. Every power (tan x) m is eventually reduced to Example 8 or 9, 

EXAMPLE 11 J sec 3 x dx = uv — j v du = sec x tan x — J tan 2 x sec * dx 

This was integration by parts, with u = sec x and u = tan x. In the integral on the 
right, replace tan 2 x by sec 2 x - 1 (this identity is basic): 

j sec 3 * dx = sec * tan x — j sec 3 x dx + j sec * dx. 

Bring j" sec 3 * dx to the left side. That reduces the problem from sec 3 * to sec x. 

I believe those examples make the point— trigonometric integrals are computable . 
Every product tan™* sec 11 * can be reduced to one of these examples. If n is even we 
substitute u = tan x, If m is odd we set u = sec x, If m is even and n is odd, use a 
reduction formula (and always use tan 2 x = sec 2 x - 1). 

I mention very briefly a completely different substitution w = tan ^x. This seems to 
all students and instructors (quite correctly) to come out of the blue: 

2u 1 — m 2 2 du 

sin x = „ 2 and cos x = , and dx=— — ^ (U) 

1 + u 1 + u 1 + u 

The x-integral can involve sums as well as products — not only sin m x cos fl x but also 
1/(5 + sin x — tan x). (No square roots,) The u- integral is a ratio of ordinary polynomi- 
als. It is done by partial fractions . 

Application of j sec * dx to distance on a map (Mercator projection) 

The strange integral ln(secx + tanx) has an everyday application. It measures the 
distance from the equator to latitude x, on a Mercator map of the world. 

All mapmakers face the impossibility of putting part of a sphere onto a flat page. 
You can’t preserve distances, when an orange peel is flattened. But angles can be 
preserved, and Mercator found a way to do it. His map came before Newton and 
Leibniz. Amazingly, and accidentally, somebody matched distances on the map with 
a table of logarithms — and discovered j sec x dx before calculus . You would not be 
surprised to meet sin x, but who would recognize ln(sec x + tan x)? 

The map starts with strips at all latitudes x. The heights are dx> the lengths are 
proportional to cos x. We stretch the strips by 1/cos x — then Figure 7.3c lines up 
evenly on the page. When dx is also divided by cos x, angles are preserved — a small 


EXAMPLE 10 Reduction (tan x) m dx = 




(tan xf 
1 
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Hg. 7,3 Strips at latitude x arc scaled by sec x, making Greenland too large. 


square becomes a bigger square. The distance north adds up the strip heights 
dxj cos jc. This gives J sec x dx. 

The distance to the North Pole is infinite! Close to the Pole, maps are stretched 
totally out of shape. When sailors wanted to go from A to B at a constant angle with 
the North Star, they looked on Mercator’s map to find the angle. 


7.2 EXERCISES 


Read-through questions 

To integrate sin 4 xcos 3 x, replace cos 2 x by q . Then 
(sin 4 x — sin 6 x) cos x dx is b du* In terms of u = sin x the 
integral is c . This idea works for sin m x cos"x if either m 
or n is d . 

If both m and n are g . one method is integration by 

For J sin 4 x dx, split off dv = sin x dx. Then — j v du is 

a . Replacing cos 2 x by h creates a new sin 4 x dx that 
combines with the original one. The result is a reduction to 
j sin 2 x dx, which is known to equal * 

The second method uses the double-angle formula sin 2 x = 

1 Then sin 4 x involves cos 2 fc . Another doubling 
conies from cos 2 2x = l The integral contains the sine of 


To integrate sin6xcos4x, rewrite it as Isin lQx + n . 
The indefinite integral is o . The definite integral from 
0 to 2n is P , The product cos px cos qx is written as 
j cos ( p + q)x + « Its integral is also zero, except if 

f when the answer is « . 

With u = tan x, the integral of tan 9 x sec 2 x is t . Simi- 
larly | sec 9 x (sec x tan x dx) = u . For the combination 
tan m x sec"x we apply the identity tan 2 x = v . After reduc- 
tion we may need j tan x dx — * and j sec x dx — x 


Compute 1-8 by the “general method,” when m or n is odd. 

1 J sin 3 x dx 2 j cos 3 x dx 

3 | sin x cos x dx 4 J cos s x dx 

5 J sm 5 x cos 2 x dx 6 j sin 3 x cos 3 x dx 

7 J v ' Sin x cos x dx 8 j ^/sin x cos 3 x dx 

9 Repeat Problem 6 starting with sin x cos x = ^sin 2x. 


10 Find J sin 2 ax cos ax dx and J sin ax cos ax dx. 


Id 11-16 use the double-angle formulas (m, n even), 
11 sin 2 x dx 12 sin 4 x dx 

13 J cos 2 3x dx 14 | sin 2 x cos 2 x dx 

15 J sin 2 x dx -b J cos 2 x dx 16 J sin 2 x cos 3 2x dx 

17 Use the reduction formula (7) to integrate cos 6 x. 

18 For n > 1 use formula (7) to prove 

'*/2 n_i 

cos H x dx = cos" “ 2 x dx, 

Jo n Jo 


19 For n = 2, 4, 6, ... deduce from Problem 18 that 


w „ j ^ {l){3) ”(n — 1) 
cos x dx = - — -r— — — — — 
0 2 (2)(4)“(n) 


20 For n = 3, 5, 7, ... deduce from Problem 18 that 


**/2 

Jo 


cos n x dx = 


my -in- 1 ) 
(3 K5)-'W 


21 (a) Separate dv = sin x dx from u = sin 11 *x and integrate 
| sin B x dx by parts, 

(b) Substitute 1— stn 2 x for cos 2 x to find a reduction 
formula like equation (7). 

22 For which n does symmetry give cos rt x dx = 0? 

23 Are the integrals (a)-(f) positive, negative, or zero? 

(a) Jo cos 3x sin 3x dx (b) cos x sin 2x dx 

(c) J°. 2jt cos x sin x dx (d) (cos 2 x — sin 2 x)dx 

(e) J^ H cos px sin qx dx (f) J® cos 4 x dx 
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24 Write down equation (9) for p = q = I, and (10) for p = 2, 
q — L Derive (9) from the addition formulas for cos(s + 1) and 
cos(s — f)in Section 1.5. 


In 25-32 compute the indefinite integrals first, then the definite 


integrals. 

25 ft* cos x sin 2x dx 
27 ft cos 99x sin lOlx dx 
29 ft cos 99* cos lOlx dx 
31 ft* cos x/2 sin x/2 dx 


26 ft sin 3x sin 5x dx 
28 ft* cos 2 3x dx 
30 ft* sin x sin 2x sin 3x dx 
32 ft x cos x dx (by parts) 


33 Suppose a Fourier sine series i4sinx + Bsin2x + 
C sin 3x + adds up to x on the interval from 0 to rc. Find 
A by multiplying all those functions (including x) by sin x 
and integrating from 0 to n. (B and C will disappear.) 


34 Suppose a Fourier sine series A sin x 4- B sin 2x + 
C sin 3x+ adds up to 1 on the interval from 0 to rc. Find 
C by multiplying all functions (including 1) by sin 3x 
and integrating from 0 to it. (A and B will disappear.) 


35 In 33, the scries also equals x from -it to 0, because all 
functions are odd Sketch the “sawtooth function,” which 
equals x from —it to n and then has period 2 n. What is the 
sum of the sine series at x = rc? 


36 In 34, the series equals —1 from —n to 0, because sines 
are odd functions. Sketch the “square wave,” which is 
alternately —1 and +1, and find A and B. 


45 

47 

49 

51 


| tan x sec 3 x dx 

j tan*x dx 

J cot x dx 

" sin x 
— 3-dx 

CO$ J X 


46 J sec 4 x dx 
48 } tan 5 x dx 
50 | esc x dx 


52 


dx 


53 Choose A so that cos x — sin x = A cos(x + rc/4), Then 
integrate l/(cosx — sin x). 

54 Choose A so that cos x — ^3 sin x = A cos(x + x/3). Then 
integrate 1 /(cos x - ^3 sin x) 2 . 

55 Evaluate ft* |cos x — sin x| dx. 

56 Show that a cos x + b sin x = ^Ja 1 + b 2 cos (x - <x ) and 
find the correct phase angle at. 


57 If a square Mercator map shows 1000 miles at latitude 
30*, how many miles does it show at latitude 60°? 


58 When lengths are scaled by secx, area is sealed by 

. Why is the area from the equator to latitude x 

proportional to tanx? 

59 Use substitution (11) to find J dx/(l -hcosx). 

60 Explain from areas why ft sin 2 x dx = ft cos 2 x dx. These 

integrals add to ft 1 dx, so they both equal . 

61 What product sin px sin qx is graphed below? Check 
that (p cos px sin qx - q sin px cos qx)f{q 2 - p 2 ) has this 
derivative. 


37 The area under y = s in x from 0 to n is positive. Which 
frequencies p have ft sin px dx - 0? 

38 Which frequencies q have ft cos qx dx = 0? 

39 For which p, q is ft sin px cos qx dx — 0? 

40 Show that ft sin px sin qx dx is always zero. 

Compute the indefinite integrals 41-52. 

41 J sec x tan x dx 42 J tan 5x dx 

43 | tan 2 x sec 2 x dx 44 j tan 2 x sec x dx 


62 Finish j sec 3 x dx in Example 1 1. This is needed for the 
length of a parabola and a spiral (Problem 7.3.8 and 
Sections 8.2 and 9,3). 




7.3 Trigonometric Substitutions 



The most powerful tool we have, for integrating with pencil and paper and brain, is 
the method of substitution. To make it work, we have to think of good substitutions — 
which make the integral simpler. This section concentrates on the single most valu- 
able collection of substitutions. They are the only ones you should memorize, and 
two examples are given immediately. 
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To integrate 


v /l - x 2 , substitute x = sin 8, Do not set u — l- x 2 is missing^ 


Jl - x 2 dx 


(cos #)(cos 8 d8) 


dx 


\-x 2 


v 


fcos 0 dO 
cos 0 


The expression v /l — x 2 is awkward as a function of x. It becomes graceful as a 
function of 8. We are practically invited to use the equation 1 — (sin 8) 2 = (cos 8) 2 . 
Then the square root is simply cos 8 — provided this cosine is positive. 

Notice the change in dx , When x is sin 8 , dx is cos 8d8. Figure 7.4a shows the 
original area with new letters. Figure 7.4b shows an equal area, after rewriting 
j (cos #)(cos 6 dO) as j (cos 2 #) d8. Changing from x to 8 gives a new height and a new 
base. There is no change in area — that is the point of substitution. 

To put it bluntly: If we go from yj 1 — x 2 to cos 0, and forget the difference between 
dx and dd , and just compute j cos 8 d8 , the answer is totally wrong. 





2 2 

Fig. 7.4 Same area for yjl — x 2 dx and cos 2 0 d&. Third area is wrong: dx / dO 

We still need the integral of cos 2 #. This was Example 3 of integration by parts, and 
also equation 7,2,6. It is worth memorizing. The example shows this 8 integral, and 
returns to x: 


EXAMPLE 1 j cos 2 # d8 = \ sin 8 cos 8 + \8 is after substitution 

j yj\ - x 2 dx = jx^\ - x 2 4- £ sin - i x is the original problem. 

We changed sin 8 back to x and cos 8 to yj 1 - x 2 . Notice that 8 is sin -1 x. The answer 
is trickier than you might expect for the area under a circular arc. Figure 7.5 shows 
how the two pieces of the integral are the areas of a pie-shaped wedge and a triangle. 


EXAMPLE 2 


dx 


V 1-* 2 


* cos 8 d8 
cos 8 


= 8 4 C = sin 


1 x 4 C 


Remember: We already know sin 1 x. Its derivative l/yjl — x 2 was computed in 
Section 4.4. That solves the example. But instead of matching this special problem 





0 


Rg. 7,5 J yj 1 — x 1 dx is a sum of simpler areas. Infinite graph but finite area. 
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with a memory from Chapter 4, the substitution * = sin 9 makes the solution auto- 
matic, From J dd = 9 we go back to sin -1 *. 

The rest of this section is about other substitutions. They are more complicated 
than * = sin 0 (but closely related). A table will display the three main choices — sin 0 , 
tan 9, sec 9 — and their uses. 


TRIGONOMETRIC SUBSTITUTIONS 


After working with +J 1 x 2 , the next step is ^/A ~ x 2 . The change * = sin 0 simplified 
the first, but it does nothing for the second: 4 - sin 2 0 is not familiar. Nevertheless a 
factor of 2 makes everything work. Instead of * = sin 9 , the idea is to substitute x = 
2 sin 8: 

*Ja - x 2 = y/4 - 4 sin 2 ^ = 2 cos 9 and dx = 2 cos 9 d9. 

Notice both 2 5 s. The integral is 4 J cos 2 0 d8 = 2 sin 0 cos 9 + 29. But watch closely. 
This is not 4 times the previous J cos 2 0 d8\ Since x is 2 sin 9 , 9 is now sin^ 1 (x/2). 

EXAMPLE 3 J *Ja - x 2 dx = A\ cos 2 0 d8 - Xy/T- (x/2) 2 + 2 sin^ *(x/2). 

Based on 1 - x 1 and y /A ~ x 2 , here is the general rule for yfa 2 x 2 . Substitute 
x = a sin 0. Then the a* s separate out: 

yfa 2 - x 1 = yfa 2 - a 2 $in 2 d = a cos 8 and dx = a cos 8 d9 , 

That is the automatic substitution to try, whenever the square root appears. 


EXAMPLE 4 



'* /2 4 cos 9 dd 

e=o y/A 1 - 4 2 (sin 8) 2 



Here a 2 — 16. Then a — A and x = A sin 9. The integral has 4 cos 0 above and below, 
so it is J dd . The antiderivative is just 0, For the definite integral notice that *=4 
means sin 0=1, and this means 9— n/2, 

A table of integrals would hide that substitution. The table only gives sin _1 (x/4). 
There is no mention of [ dO = 0, Bur what if 16 - x 2 changes to x 2 — 16? 


EXAMPLE 5 , - ? 

y/x 2 - 16 

Notice the two changes — the sign in the square root and the limits on x. Example 4 
stayed inside the interval [x| ^4, where 16 - x 1 has a square root. Example 5 stays 
outside, where x 1 - 16 has a square root. The new problem cannot use * = 4 sin 0, 
because we don’t want the square root of — cos 2 0. 

The new substitution is x~ A sec 0 . This turns the square root into A tan 0: 

* = 4 sec 9 gives dx= 4 sec 0 tan 9 dd and x 2 - 16 = 16 sec 2 0 - 16 = 16 tan 2 0. 

This substitution solves the example, when the limits are changed to 0: 

f *' 3 4 sec 0 tan dO f* /3 1 „ _ -i*m r 

= sec 0 d9 = ln(sec 0 + tan 0 ) L = ln{2 + ^/3). 

Jo 4 tan u Jo 

I want to emphasize the three steps. First came the substitution x = 4sec0, An 

unrecognizable integral became J sec 0 dd , Second came the new limits (0 = 0 when 

x = 4, 0 = te/3 when * = 8). Then I integrated sec 0, 
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Example 6 has the same x 2 - 16. So the substitution is again x = 4 sec 0: 


EXAMPLE 6 


f- 16 dx 

** /2 64 sec 0 tan 0 dO 

’* /2 cos 0 dO 

Jx = 8 (* 2 - 16) 3/2 _ J 

«=,/3 (4 tan 0) 3 

,13 sin 2 0 


Step one substitutes x = 4 sec 0. Step two changes the limits to 0. The upper limit 
x = oo becomes 0 = tt/2, where the secant is infinite. The limit x = 8 again means 0 = 
te/3. To get a grip on the integral, I also changed to sines and cosines. 

The integral of cos 0/sin 2 0 needs another substitution! (Or else recognize 
cot 0 esc 0.) With u = sin 0 we have J du/u 2 = — 1/m = — 1/sin 0: 


Solution 


** /2 cos 0 dO _ - 1 * /2 _ _ j + 2 
«/3 sin 2 0 sin 0_| K/3 > /3' 


Warning With lower limit 0 = 0 (or x = 4) this integral would be a disaster. It divides 
by sin 0, which is zero. This area is infinite. 


{Warning) 2 Example 5 also blew up at x = 4, but the area was not infinite. To make 
the point directly, compare x -l/2 to x _3/2 . Both blow up at x = 0, but the first one 
has finite area: 



Section 7.5 separates finite areas (slow 
growth of x _3/2 ). 



growth of l/ v /x) from infinite areas (fast 


Last substitution Together with 16 - x 2 and x 2 — 16 comes the possibility 16 + x 2 . 
(You might ask about -16-x 2 , but for obvious reasons we don’t take its square 
root.) This third form 16 + x 2 requires a third substitution x = 4tan0. Then 
16 4- x 2 = 16 + 16 tan 2 0 = 16 sec 2 0. Here is an example: 


EXAMPLE 7 


dx _ f* /2 4 sec 2 0 d6 _ \ y * /2 _ n 
X=0 lb-fx 2 J e=0 16sec 2 0 4 _ 0 8 



Table of substitutions for a 2 - x 2 , a 2 

+ X 2 , 

X 2 

-a 2 

x = a sin 0 

replaces 

a 2 - x 2 by 

a 2 cos 2 0 and 

dx 

by 

a cos 0 dO 

x = a tan 0 

replaces 

a 2 + x 2 by 

a 2 sec 2 0 and 

dx 

by 

a sec 2 0 dO 

x = a sec 0 

replaces 

x 2 - a 2 by 

tf 2 tan 2 0 and 

dx 

by 

a sec 0 tan 0 d9 


Note There is a subtle difference between changing x to sin 0 and changing sin 0 to u: 
in Example 1, dx was replaced by cos 0 dO (new method) 


in Example 6, cos 0 dO was already there and became du (old method). 


The combination cos 0 dO was put into the first and pulled out of the second. 

My point is that Chapter 5 needed du/dx inside the integral. Then (du/dx)dx 
became du. Now it is not necessary to see so far ahead. We can try any substitution. 
If it works, we win. In this section, x = sin 0 or sec 0 or tan 0 is bound to succeed. 


NEW 


r dx 

ji +* 2 . 


dO by trying x = tan 0 


OLD 


r x d* _ r du 

J 1 + .X 2 “ J Yu 


by seeing du 
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We mention the hyperbolic substitutions tanh 0, sinh 0, and cosh 0 , The table below 
shows their use. They give new forms for the same integrals. If you are familiar with 
hyperbolic functions the new form might look simpler — as it does in Example 8. 


x = a tanh 8 

replaces a 2 - x 2 

by a 2 sech 2 0 

and 

dx 

by 

a sech 2 0 d8 

x = a sinh 8 

replaces a 2 4 x 1 

by a 2 cosh 2 0 

and 

dx 

by 

a cosh 0 dO 

x = a cosh 8 

replaces x 2 — a 2 

by a 2 sinh 2 # 

and 

dx 

by 

a sinh 8 dO 

EXAMPLE 8 

f dx 

f sinh Odd 

-C 

= cosh 1 

x 4 C. 

J Jx 2 - 1 

J sinh 8 


| dd is simple. The bad part is cosh *x at the end. Compare with x = sec 6: 


I 


dx 

7^ 


I 


sec 8 tan 8 dd 
tan 0 


- ln(sec 8 + tan 8} + C = ln(x -f yfx 1 


1)+C. 


This way looks harder, but most tables prefer that final logarithm. It is clearer than 
cosh “ L x, even if it takes more space. All answers agree if Problem 35 is correct. 


COMPLETING THE SQUARE 


We have not said what to do for x j x 2 - 2x + 2 or *J~—x 2 + 2x. Those square roots 
contain a linear term — a multiple of x. The device for removing linear terms is worth 
knowing. It is called completing the square , and two examples will begin to explain it: 

x 2 - 2x + 2 = (x~ 1) 2 + 1 = u 2 -r 1 

- x 2 + 2x~ - {x - l ) 2 + 1 = l - u 2 > 

The idea has three steps. First, get the x 2 and x terms into one square. Here that 
square was (x — l) 2 = x 2 — 2x + 1, Second, fix up the constant term. Here we recover 
the original functions by adding 1, Third, set u = x — 1 to leave no linear term. Then 
the integral goes forward based on the substitutions of this section: 

f dx _ f du f dx _ f du 

J ^Jx 2 — 2 x + 2 J yju 2 4- 1 J y/2x — x 7 J yj\ ~ u 2 

The same idea applies to any quadratic that contains a linear term 2bx : 

rewrite x 2 + 2 hx - c as (x + b) 2 + C, with C = c - b 2 
rewrite x 2 4 2 hx ■+ c as - (x - h) 2 + C with C = c + h 2 
To match the quadratic with the square, we fix up the constant: 

x 2 + 10x4 16= (x4 5) 2 4 C leads to C= 16-25= -9 

-x 2 + lOx 4 16 = - (x — 5) 2 4 C leads to C = 16 + 25 = 41. 


EXAMPLE 9 


r dx r dx _ r du 

J x 2 + 10x+ 16 J (x + 5) 2 -9 J u 2 -9' 


Here u = x + 5 and du = dx. Now comes a choice— struggle on with u = 3 sec 8 or 
look for | duj(u 2 - a 2 ) inside the front cover. Then set a = 3: 


J 


du 1 

J^9 = 6 ln 


u-3 


u+3 


= 6 ln 


x + 2 
x + 8 
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Note If the quadratic starts with Sx 2 or —5x 2 , factor out the 5 first: 

5x 2 — lOx + 25 = 5(x 2 — 2x + 5) = (complete the square) = 5[(x — l) 2 + 4]. 
Now u = x ~ 1 produces 5[u 2 + 4]. This is ready for table lookup or u — 2 tan 0: 


EXAMPLE 10 


f * = f 

J 5x 2 — 10x + 25 J 


du 


5[u 2 +4] 


2 sec 2 0 d0 
5[4 sec 3 9] 


10 


1 


de. 


This answer is 0/10 -1- C. Now go backwards: 0/10 = (tan“ 1 ju)j 10 = {tan -1 ${x - 1 ))/ 1 0. 
Nobody could see that from the start, A double substitution takes practice, from x 
to u to 0, Then go backwards from 0 to u to x. 


Final remark For u 2 + a 2 we substitute u = a tan 0* For u 2 - a 2 we substitute u = 
a sec 0, This big dividing line depends on whether the constant C (after completing 
the square) is positive or negative. We either have C= a 2 or C — —a 2 . The same 
dividing line in the original x 2 + 2bx + c is between c > b 2 and c < ft 1 . In between, 
c = b 2 yields the perfect square (x + ft) 2 — and no trigonometric substitution at all. 


7.3 EXERCISES 


Read-through questions 

The function yjl — x 1 suggests the substitution x = o . 
The square root becomes b and dx changes to c . 
The integral J(1 ^ — x 2 ) 3/2 dx becomes j d d0. The interval 
^ < x < 1 changes to o < 0 < t . 

For a 2 — x 1 the substitution is x= q with djc = 
ft . For x 2 -a 2 we use x = » with dx= \ . Then 

J dx/( 1 + x 2 ) becomes J dO , because 1 + tan 2 0 = fc . The 
answer is 9 — tan -1 x. We already knew that t is the 
derivative of tan - 

The quadratic x 2 + 2 bx + c contains a ™ term 2bx. To 
remove it we n the square. This gives (x 4- b) 2 + C with 
C = o . The example x 2 + 4x + 9 becomes p . Then 
v = x + 2. In case x 2 erners with a minus sign, — x 2 + 4x + 9 
becomes i Q ) 2 + * . When the quadratic contains 

4x 2 , start by factoring out > . 


Integrate 1-20 by substitution. Gunge 0 back to x. 


1 

3 

5 

7 

9 


1 


•jA — x 2 
Ja~x 2 dx 
x 2 dx 


f_*L_ 

J(l+X 2 ) : 


x 2 — 25 


dx 


2 

4 

6 


1 

J 

1 


dx 

Jx 2 -a 2 

dx 

1 + 9x 2 


dx 


x 2 J\^x 2 


8 f Jx 1 + a 1 dx (see 7,2.62) 


10 


x 3 dx 
y/9-x 2 


19 


I 


dx 

12 

J V** — x 8 dx 

v/* 6 -* 4 

dx 

14 

f dx 

(l+x 2 ) 3/2 

J (1 -X 2 ?> 2 

dx 


[Jl + x 2 dx 

(x 2 -9) 1 ' 2 



x 2 dx 

IS 

C X 1 dx 


)* 2 + A 

dx 

*20 

f x 2 dx 


21 (Important) This section started with x = sin 0 and 


J dxj^fl — x 2 = jd0 = S — sin^ l x. 

(a) Use x = cos 0 to get a different answer. 

(b) How can the same integral give two answers? 

22 Compute J dx/x^/x 1 — 1 with x = sec 0. Recompute with 
x = csc 0. How can both answers be correct? 


23 Integrate x/(x 2 + 1) with x = tan 0, and also directly as a 
logarithm. Show that the results agree. 

24 Show that f dxjxy/x*— 1 sec - ‘(x 2 ). 


Calculate the definite integrals 25-32. 


25 j v fa 1 — x 2 dx = area of 


X 
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r i 

32 v 1 — x z dx — area of 

J 1.2 


33 Combine the integrals to prove the reduction formula 
(« * 0): 


34 Integrate 1 /cosa and 1,(1 + cosx) and V /1 Tcos.x. 

35 {a) \ = gives f dx-^/x 2 — 1 = ln(sec + tan 0 ), 

(b) From the triangle, t his ans wer is/= ln(.v v /a 2 — 1}. 

Check that dfdx = 1 v a 2 — 1, 

(e) Verify that cosh/= \ (e f + e / ) = x. Then/= cosh 1 a, 
the answer in Example 8. 

36 (a) a = ^ gives f dx;\ x 1 + 1 — ln(sec 0 + tan 0). 

(h ) The s econd triangle converts thi s answ er to g = ln(\ + 

v a 2 + 1 ). Check that tig; tlx = 1 \/.v 2 + I , 

(c) Verify that sinh £ = \ {e* — e~ x ) = x so £ = sinh 1 a. 

(d) Substitute v = sinli£ directly into f d.v/^/x" + 1 and 
integrate. 



in 37-42 substitute \ = sinh 0, cosh 0. or tanh 0. After integ- 
ration change back to a. 


37 


r 


dx 


J v v " ” 


39 



41 


dx 

l - A 2 


38 

40 

42 


r d X 


A\ 1 - 

v 2 

V“ — 

1 



x 2 

dx 

V 1 + A 

— Jv 

..2 

u A 


Rewrite 43-48 as (a + b} 2 + C or —(a — 6) 2 + C by completing 
the square. 


43 a 2 - 4x + 8 
45 a 2 — 6a 
47 x 2 + 2 a + 1 


44 - a 2 + 2a 4- 8 

46 - x 2 + 10 
48 a 2 +4x- 12 


49 For the three functions fix) in Problems 43, 45, 47 
integrate 1 ifix). 

50 For the three functions £<a) in Problems 44, 46, 48 
integrate \!y/g(x). 

51 For j dxi(x z + 2hx + < J why does the answer have different 
forms for h 2 > c and h 1 < c? What is the answer if h 2 = c? 


52 What substitution u = x + h or u = a — h will remove the 
linear term? 


(a) I 

¥> 

dx 

,b,j 

dx 

a 2 — 4a + c 

3x 2 -r 6 a 

(c) 1 

dx 

(d) 

r dx 

-A 2 + 10a + c 

2x 2 - x 


53 Find the mistake. With a = sin 0 and V T - x 2 = cos 0, 
substituting dx = cos 0 dO changes 



cos 2 tf d0 


into 


*o 

v l - A 2 fix. 


54 (a) If a = tan 0 then J v 1 4- x 2 dx = \ ^ dO. 

(b) Convert i[sec 0 tan 0 + ln(sec 0 + tan tf)] back to x. 

(c) If v = sinh 0 then j v 1 + x 2 dx = J dO, 

(d) Convert j[sinh cosh 0 4- fi] back to x. 

These answers agree. In Section 8.2 they wall give the length 
of a parabola. Compare w ith Problem 7.2.62. 


55 Rescale x and y in Figure 7.5b to produce the equal area 
j y dx in Figure 7.5c. What happens to y and what happens 
to dx? 


56 llraw y— 1 v T — a 2 and y-1 V T6 — x 2 to the same 
scale (1" across and up; 4" across and j" up) r 


57 What is wrong, if anything, with 



= sin 


i 


7.4 Partial Fractions 


This section is about rational functions P(x) Q{x), Sometimes their integrals are also 
rational functions (ratios of polynomials). More often they are not. It is very common 
for the integral of P;Q to involve logarithms. We meet logarithms immediately in the 
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simple case l/(x — 2), whose integral is ln|x — 2| + C. We meet them again in a sum 
of simple cases: 

Jl^2 + 7T2 “ x]^ = lDl * “ 21 + 31n|jC + 21 " 4bW + C 

Our plan is to split PjQ into a sum like this — and integrate each piece . 

Which rational function produced that particular sum? It was 

1 3 4 _ (x + 2)(x) + 3(jc — 2)(jc) - 4(x - 2)(x + 2) _ -4x + 16 

x - 2 + x + 2 x (x- 2)(x + 2)(x) (x - 2)(x + 2)(x)' 

This is PjQ . It is a ratio of polynomials, degree 1 over degree 3. The pieces of P are 
collected into - Ax + 16. The common denominator (x — 2)(x + 2)(x) ~x 3 — Ax is Q. 
But I kept these factors separate, for the following reason. When we start with PjQ 7 
and break it into a sum of pieces, the first things we need are the factors of Q . 

In the standard problem PjQ is given. To integrate it, we break it up. The goal of 
partial fractions is to find the pieces — to prepare for integration . That is the technique 
to learn in this section, and we start right away with examples. 


EXAMPLE 1 Suppose PjQ has the same Q but a different numerator P: 

P 3x 2 + 8x-4 ABC 
Q (x-2)(x + 2)(x) x-2 x+2 + x' ( J 

Notice the form of those pieces! They are the “ partial fractions ” that add to PjQ . 
Each one is a constant divided by a factor of Q. We know the factors x — 2 and x + 2 
and x We don’t know the constants A, fl, C. In the previous case they were 1, 3, — A. 
In this and other examples, there are two ways to find them. 

Method 1 (slow) Put the right side of (1) over the common denominator Q : 

3x 2 + 8x — 4 _ A{x + 2)(x) + B(x - 2)(x) + C(x - 2)(x + 2) 

Q (x-2)(x + 2)(x) _ () 


Why is A multiplied by (x + 2)(x)? Because canceling those factors will leave Aj(x — 2} 
as in equation (1). Similarly we have Bj(x + 2) and C/x. Choose the numbers A, B, C 
so that the numerators match . As soon as they agree, the splitting is correct. 


Method 2 (quicker) Multiply equation (1) by x - 2, That leaves a space: 

3x 2 + 8x-4 _ B(x — 2) C(x — 2) 

(x + 2){x)“ + x + 2 + i ' 


(3) 


Sow set x = 2 and immediately you have A. The last two terms of (3) are zero, because 
x — 2 is zero when x = 2. On the left side, x ~ 2 gives 


3(2) 2 + 8(2) - 4 

(2 + 2 ) ( 2 ) 


24 

8 


3 (which is A\ 


Notice how multiplying by x — 2 produced a hole on the left side. Method 2 is the 
“cover-up method” Cover up x — 2 and then substitute x = 2, The result is 3 = 
A + 0 + 0, just what we wanted. 

In Method 1, the numerators of equation (2) must agree. The factors that multiply 
B and C are again zero at x = 2. That leads to the same A — but the cover-up method 
avoids the unnecessary step of writing down equation (2). 
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Calculation of B Multiply equation {1) by x + 2, which covers up the ( x + 2): 

3x 2 + 8x - 4 _ A(x + 2) C{x + 2) 

(x - 2 ) (x) x-2 x 

Now set x = - 2, so A and C are multiplied by zero: 

3(— 2) 2 + 8<— 2)-4 _ -8 _ 

{- 2 - 2 ) (- 2 ) 8 


( 4 ) 


This is almost full speed, but {4) was not needed. Just cover up in Q and give x the 
right value (which makes the covered factor zero). 


Calculation of C (quickest) In equation (1), cover up the factor (x) and set x = 0: 

3(0) z + 8(0) — 4 _ -4 

(0 - 2)(0 + 2) -4 


To repeat: The same result A~ — 1, C = 1 comes from Method 1. 


EXAMPLE 2 


x + 2 A B 

{x - l)(x + 3) “ x-~T + x + 3 


First cover up (x - 1) on the left and set x = 1. Next cover up (x + 3) and set x - 3: 

1 + 2 _ 3 _ . -3 + 2 _ -1 

( ){1 + 3 ) 4 (- 3 - 1 )( )“^ 4 _ 

The integral is £ln|x - 1| + *ln|x + 3| + C. 


EXAMPLES This was needed for the logistic equation in Section 6.5: 


1 

y{c - by) 


A 

y 


B 

+ 

c — by 


( 6 ) 


First multiply by y. That covers up y in the first two terms and changes B to By . 
Then set y = 0. The equation becomes 1/c = A , 

To find B f multiply by c — by . That covers up c — by in the outside terms. In the 
middle, A times c — by will be zero at y= cjb. That leaves B on the right equal to 
1 jy~bjc on the left. Then A=\jc and B = bjc give the integral announced in 
Equation 6.5.9: 

dy _ 

cy ~ by 1 


dy + 


b ay _ In y 
J c(c - by) c 


ln(c - by) 


(7) 


It is time to admit that the general method of partial fractions can be very awkward. 
First of all, it requires the factors of the denominator Q. When Q is a quadratic 
ax 2 + bx + c, we can find its roots and its factors. In theory a cubic or a quartic can 
also be factored, but in practice only a few are possible — for example x 4 - 1 is 
(x 1 - l)(x 2 + 1). Even for this good example, two of the roots are imaginary . We can 
split x 2 - 1 into (x + l)(x - 1). We cannot split x z + 1 without introducing L 
The method of partial fractions can work directly with x 1 + 1, as we now see. 


EXAMPLE 4 


1 


3x 2 + 2x + 7 
x 2 + 1 


dx 


{a quadratic over a quadratic). 


This has another difficulty. The degree of P equals the degree of Q (= 2). Partial 
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fractions cannot start until P has lower degree. Therefore I divide the leading term x 2 3 4 
into the leading term 3x 2 . That gives 3, which is separated off by itself: 


3x 2 + 2x + 7 2x + 4 
x 2 +l 3 + x 2 + 1 


( 8 ) 


Note how 3 really used 3x 2 + 3 from the original numerator. That left 2x + 4. Partial 
fractions will accept a linear factor 2x + 4 (or Ax + B, not just A) above a quadratic. 

This example contains 2xj(x 2 + 1), which integrates to ln(x 2 + 1). The final 
4 j(x 2 + 1) integrates to 4 tan _1 x. When the denominator is x 2 + x + t we complete 
the square before integrating. The point of Sections 7,2 and 7,3 was to make that 
integration possible. This section gets the fraction ready — in parts. 

The essential point is that we never have to go higher than quadratics. Every 
denominator Q can be split into linear factors and quadratic factors , There is no magic 
way to find those factors, and most examples begin by giving them. They go into 
their own fractions, and they have their own numerators — which are the A and B 
and 2x + 4 we have been computing. 

The one remaining question is what to do if a factor is repeated , This happens in 
Example 5, 


EXAMPLE 5 


2* +3 A B 

(x-i) 2 ~( X -i) + ( X ~ir 


The key is the new term Bj(x — l) 2 . That is the right form to expect. With (x — l)(x — 2) 
this term would have been Bj(x - 2), But when (x — 1) is repeated, something new is 
needed. To find B, multiply through by (x — l) 2 and set x = 1: 


2x + 3 = A(x — 1 ) + B becomes 5 = B when x = 1 , 

This cover-up method gives B. Then A = 2 is easy, and the integral is 
2 lnjx - 1| - 5/{x - 1). The fraction 5/(x - l) 2 has an integral without logarithms. 


EXAMPLE 6 


2x 2 + 9x 2 + 4 A B Cx + D E 

x\x 2 + 4)(x- \)~ x x 2 x 2 + 4 x — 1 


This final example has almost everything! It is more of a game than a calculus 
problem. In fact calculus doesn’t enter until we integrate (and nothing is new there). 
Before computing A , B, C f D f E , we write down the overall rules for partial fractions: 

1. The degree of P must be less than the degree of Q. Otherwise divide their leading 
terms as in equation (8) to lower the degree of P. Here 3 < 5. 

2. Expect the fractions illustrated by Example 6. The linear factors x and x + 1 
(and the repeated x 1 ) are underneath constants. The quadratic x 2 + 4 is under a 
linear term. A repeated (x 2 + 4) 2 would be under a new Fx + G, 

3. Find the numbers A , B, C, by any means, including cover-up. 

4. Integrate each term separately and add. 

We could prove that this method always works. It makes better sense to show that 
it works once, in Example 6, 

To find £, cover up (x - 1) on the left and substitute x=l, Then £ = 3. 
To find B , cover up x 2 on the left and set x = 0. Then B = 4/(G + 4)(G— 1)= — 1, 
The cover-up method has done its job, and there are several ways to find A , C , D. 
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Compare the numerators, after multiplying through by the common denominator Q : 
2x 3 + 9x 2 + 4= Ax(x 2 + 4)(x — 1) — (x 2 + 4){x — 1) + ( Cx + D)(x 2 )(x - 1) + 3x 2 (x 2 + 4). 
The known terms on the right, from B= — 1 and E= 3, can move to the left: 

- 3x 4 + 3x 3 — 4x 2 + 4x = Ax{x 2 + 4)(x — 1) + (Cx + D)x 2 (x - 1)* 

We can divide through by x and x - 1, which checks that B and E were correct: 

- 3x 2 — 4 = A(x 2 + 4) + (Cx + D)x. 

Finally x = 0 yields A = - 1, This leaves — 2x 2 = (Cx + D)x. Then C = — 2 and 
D = 0. 

You should never have to do such a problem] I never intend to do another one. 
It completely depends on expecting the right form and matching the numerators. 
They could also be matched by comparing coefficients of x 4 , x 3 , x 2 , x, 1 — to give five 
equations for A , B, C, D, E. That is an invitation to human error. Cover-up is the 
way to start, and usually the way to finish. With repeated factors and quadratic 
factors, match numerators at the end. 


7.4 EXERCISES 


Read-through questions 

The idea of q fractions is to express P(x)/Q(x) as a b 
of simpler terms, each one easy to integrate. To begin, the 
degree of P should be c the degree of Q . Then Q is split 
into d factors like x — 5 (possibly repeated) and quadratic 
factors like x 2 + x + 1 (possibly repeated). The quadratic 
factors have two o roots, and do not allow real linear 
factors. 

A factor like x — 5 contributes a fraction At t Its 
integral is o . To compute A , cover up ft in the 
denominator of P/Q . Then set x = 1 and the rest of 

P/Q becomes A. An equivalent method puts all fractions over 
a common denominator (which is I V Then match the 
fc At the same point x = l this matching gives A. 

A repeated linear factor (x — 5) 2 contributes not only 
A!(x — 5) but also Bt m , A quadratic factor like x 2 x + 1 
contributes a fraction n t lx 2 + x + 1) involving C and D. 
A repeated quadratic factor or a triple linear factor would 
bring in (£x + F)/(x 2 + x + l) 2 or G/(x — 5) 3 . The conclusion 
is that any PjQ can be split into partial o which can 
always be integrated. 

1 Find the numbers A and B to split l/(x 2 — x): 

] A B 

X(JC- 1)“ * + .X- f 

Cover up x and set x = 0 to find A. Cover up x — 1 and set 
x = 1 to find B. Then integrate. 

2 Find the numbers A and B to split l/(x 2 — 1): 


Multiply by x — 1 and set x = 1. Multiply by x + 1 and set 
x = — 1 . Integrate. Then find A and B again by method 1 — 
with numerator A(x + 1) + £(x — 1) equal to 1. 


Express the rational functions 3-16 as partial fractions: 


1 


(x — 3)(x — 2) 
x 2 + 1 

(x)(x + l)(x + 2) 
3x+ 1 


3x 2 

9 - 5 — r (divide first) 


11 

13 

15 


x 2 + 1 
1 


x 2 (x — 1) 


1 


x(x — l)(x — 2)(x — 3) 
1 1 


4 

6 

8 

10 

12 

14 


(x — 3)(x — 2) 
1 


3x+ 1 

(*- l ) 2 
1 

(x - 1)^+ ij 
x 


x 2 — 4 

x 2 + 1 
x+l 


(divide first) 


x 4 -l (x + l)(x — l)(x 2 + 1) 


16 — * f remember the — term | 

x*(x \) \ X } 

17 Apply Method 1 (matching numerators) to Example 3: 


■ = - + 


B A(c — by) + By 


cy — by 2 y c — by y(c — by) 


1 A B 

x l -\ ~ x- 1 + x+ 1 


Match the numerators on the far left and far right. Why does 
Ac = 1? Why does — hA + B = 0? What are A and B? 
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18 What goes wrong if we look for A and B so that 

** ^ , B , 

(x - 3)(x + 3) x — 3 x + 3 ‘ 

Over a common denominator, try to match the numerators. 
What to do first? 


19 Split 



3x 2 A Bx + C 

(x-lXx^ + x + l) mt °x-l + x 2 +x + r 


(a) Cover up x — 1 and set x = 1 to find A. 

(b) Subtract Af(x - 1) from the left side. Find Bx + C. 

(c) Integrate all terms* Why do we already know 


ln(x 3 - 1) = ln(x - I) -f ln(x 2 + x + 1)? 


20 Solve dy/dt = 1 — y 2 by separating j dyjl — y 2 = j dt. Then 


1 1 1/2 1/2 

i-j^O-jW+yr i-y + i+y 


Integration gives ^ln = t + C. With y 0 = 0 the con< 

stant is C = * Taking exponentials gives 

The solution is v = * This is the S-eurve. 


By substitution change 21-28 to integrals of rational functions* 
Problem 23 integrates 1/sin 0 with no special trick* 


21 

r e*dx 

) e lx — e* 

23 

f sin 0 dO 

J 1 — cos 2 # 

25 


27 

f * 


1 + yj X + 1 


22 

J I +^/x 

24 

r dt 

Jfc'-o 2 

26 

[& 

28 

f ^ 

J Jx + tfx 


29 Multiply this partial fraction by x — a* Then let x a: 

__1 A_ 

Q(x)~ x~a + 

Show that A — 1 IQ\q\ When x = a is a double root this fails 
because Q f (a) — 

1 A 

30 Find A in —= — - = — - + Use Problem 29* 

x B -1 x — 1 


31 {for instructors only) Which rational functions P/0 are the 
derivatives of other rational functions (no logarithms)? 



7.5 Improper Integrals 



“ Improper ” means that some part of _f* y(x)dx becomes infinite. It might be b or a or 
the function y* The region under the graph reaches infinitely far — to the right or left 
or up or down, (Those come from b ~ oo and a = — oo and y — > cc and y -> — oo*) 
Nevertheless the integral may “converge” Just because the region is infinite* it is not 
automatic that the area is infinite. That is the point of this section — to decide when 
improper integrals have proper answers* 

The first examples show finite area when b — oo, then a = - oo, then y = 1 jyfx at 
x = 0. The areas in Figure 7*6 are 1, 1, 2: 



Rg, 7,6 The shaded areas are finite but the regions go to infinity* 
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In practice we substitute the dangerous limits and watch what happens. When the 
integral is — 1/x, substituting b= oo gives “-l/oo=0.” When the integral is e x , 
substituting a = - oo gives “e“ “ = 0.” I think that is fair, and I know it is successful. 
But it is not completely precise. 

The strict rules involve a limit. Calculus sneaks up on l/oo and e~ °° just as it 
sneaks up on 0/0, Instead of swallowing an infinite region all at once, the formal 
definitions push out to the limit: 


DEFINITION 


y{x)dx = lim 

b-+ co 


y(x)dx y{x)dx = lim y(x)dx< 

J-® Ja 


The conclusion is the same. The first examples converged to 1, 1, 2. Now come two 
more examples going out to h = oo: 


The area under l/x is infinite : 


00 dx 

1 * 


= OO 

(1) 

1-pjl p-l' 

(2) 


The area under 1/x is like ! +l+i+i+ -*\ which is also infinite. In fact the sum 
approximates the integral — the curved area is dose to the rectangular area. They go 
together (slowly to infinity). 

A larger p brings the graph more quickly to zero. Figure 7.7a shows a finite area 
1 l(p— 1) = 100. The region is still infinite, but we can cover it with strips cut out of 
a square! The borderline for finite area is p— 1. ! call it the borderline, but p = I is 
strictly on the side of divergence. 

The borderline is also p = 1 when the function climbs the y axis . At x = 0, the graph 
of y = 1 jx p goes to infinity. For p = 1, the area under 1/x is again infinite. But at x = 
0 it is a small p (meaning p < 1) that produces finite area: 




a-p 


1 - p 



1 

1 


if p < L 


(3) 


Loosely speaking “-In 0 = oo.” Strictly speaking we integrate from the point x = a 
near zero, to get j* dx/x — — In a. As a approaches zero, the area shows itself as 
infinite. For y = 1/x 2 , which blows up faster, the area — l/x]p is again infinite. 

For y — 1/v^c, the area from 0 to 1 is 2. In that case p — For p = 99/100 the area 
is 1/(1 — p)— 100. Approaching p = 1 the borderline in Figure 7.7 seems clear. But 
that cutoff is not as sharp as it looks . 





Fig. 7.7 Graphs of l/x p on both sides of 1. I drew the same curves! 
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Narrower borderline Under the graph of 1 fx t the area is infinite. When we divide 
by In x or (In x) 2 , the borderline is somewhere in between. One has infinite area (going 
out to x = oo), the other area is finite: 


dx 

e x(ln x) 


ln(ln x)]" 


“ 00 


30 dx 
e x(ln x ) 2 



( 4 ) 


The first is J duju with u = In x. The logarithm of In x does eventually make it to 
infinity. At x = 10 10 , the logarithm is near 23 and ln(lnx) is near 3. That is slow! 
Even slower is ln(ln(ln x)) in Problem 11, No function is exactly on the borderline. 
The second integral in equation (4) is convergent (to 1), It is J dufu 1 with u = In x. 
At first I wrote it with x going from zero to infinity. That gave an answer I couldn't 
believe: 


dx 

0 x(ln x) 2 


1 

In x 


QO 


= 0 (??) 


There must be a mistake, because we are integrating a positive function. The area 
can't be zero. It is true that 1/ln b goes to zero as b -*■ oo. It is also true that 1/ln a 
goes to zero as a 0. But there is another infinity in this integral The trouble is at 
x = 1, where In x is zero and the area is infinite. 


EXAMPLE 1 The factor e x overrides any power x p (but only as x -► oo), 

fo x 5O e~ x dx = 50 ] but J* x~ l e~ x dx = oo. 

The first integral is (50)(49)(48)"(1), It comes from fifty integrations by parts (not 
recommended). Changing 50 to 7 , the integral defines factorial The product 
t(“ 3)(^$)”' has no way to stop, but somehow y! is See Problem 28. 

The integral x°e~ x dx = 1 is the reason behind “zero factorial” — 1, That seems the 
most surprising of all. 

The area under e~ x jx is (-1)! = 00, The factor e~ x is absolutely no help at x = 0, 
That is an example (the first of many) in which we do not know an antiderivative — 
but still we get a decision. To integrate e~ x jx we need a computer. But to decide that 
an improper integral is infinite (in this case) or finite (in other cases), we rely on the 
following comparison test: 


7C (Comparison test) Suppose that 0 ^ u(x) < v(x). Then the area under u(x) 
is smaller than the area under rix): 

J u(x)dx < oo if J <oo if J u(x)dx = 00 then J u(x)Jx = 00, 


Comparison can decide if the area is finite. We don’t get the exact area, but we learn 
about one function from the other. The trick is to construct a simple function {like 
\/x p ) which is on one side of the given function — and stays close to it: 


EXAMPLE 2 

EXAMPLE 3 


x 2 + 4x 


i; 

f" dx 

Ji y/x+ 1 


dx t , f ^ dx 

converges by comparison with — 5- = 1. 

x 


diverges by comparison with 


f; 


= 00, 
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EXAMPLE 4 


r 1 dx , . r i dx 

~ 2 — ~r~ diverges by comparison with — = oo. 
J o x + 4x J o 5x 


EXAMPLE 5 


M dx 
Jo y/x+ 1 


converges by comparison with 



in Examples 2 and 5, the integral on the right is larger than the integral on the left. 
Removing 4x and y/x increased the area. Therefore the integrals on the left are 
somewhere between 0 and h 

In Examples 3 and 4, we increased the denominators. The integrals on the right 
are smaller, but still they diverge. So the integrals on the left diverge. The idea of 
comparing functions is seen in the next examples and Figure 7.8. 


EXAMPLE 6 


r 


e * 2 dx is below 


j>*f 


e X dx- 1 + L 


EXAMPLE 7 


e dx t f dx 

v— is above — : — - oo, 

! In x x In x 


EXAMPLE 8 



is below 




= 2 + 2. 



Fig. 7.8 Comparing w(x) to ^(x); f * dx/ In x = oo and dx/yfx — x 2 < 4. But oo - oo / 0. 


There are two situations not yet mentioned, and both are quite common. The first is 
an integral all the way from a~ — qq to ft = + qd, That is split into two parts, and 
each part mast converge. By definition, the limits at — go and + oo are kept separate : 



y(x) dx = 



y(x) dx + 



y(x) dx = lim 

a -* — od 


f°y(*M* + iim T 
Jb t, -”Jo 


><x) dx. 


The bell-shaped curve y = e"* J covers a finite area {exactly y/n ). The region extends 
to infinity in both directions, and the separate areas are But notice: 

x dx is not defined even though x dx = 0 for every b. 


The area under y = x is + oo on one side of zero. The area is — oo on the other side. 
We cannot accept oo — oo = 0. The two areas must be separately finite, and in this 
case they are not. 
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EXAMPLE 9 1 jx has balancing regions left and right of x = 0 + Compute j!_ 1 dxjx. 

This integral does not exist. There is no answer, even for the region in Figure 7.8c. 
(They are mirror images because l/.v is an odd function.) You may feel that the 
combined integral from - 1 to 1 should be zero. Cauchy agreed with that— his 
“principal value integral" is zero. But the rules say no: oc - oc is not zero. 


7.5 EXERCISES 


Road" thro ugh questions 

An improper integral j* y(x) Jx has lower limit a — q or 
upper limit h= b or y becomes c in the interval 
a ^ v ^ b, The example Jx/x 3 is improper because d . 
We should study the limit of j* Jx/x 3 as e . In practice 
we work directly with — 4x = f , For p> 1 the 

improper integral g is finite. For p<l the improper 
integral h is finite. For y = £ _Jf the integral from 0 to oc 
is i . 

Suppose 0 ^ u(x) ^ r(x) for all x , The convergence of i 
implies the convergence of k . The divergence of 
J w(x) dx I the divergence of j r{x) dx. From - x to oc, 
the integral of )/(£* + e _j: ) converges by comparison with 

m . Strictly speaking we split (— x, oc) into (_n_, 0) and 
(0. o l Changing to l/(e* — e~ x ) gives divergence, because 

P . Also dxj sin x diverges by comparison with q . 
The regions left and right of zero don't cancel because co — oc 
is 


In 17-26, find a larger integral that converges or a smaller 
integral that diverges. 


17 


19 


dx 


yV dx 
X 1 + 1 


21 f 


*sin x dx 


23 


25 


f * sin 2 * 


Jx 


18 


1 Jx 
0 * 6 + 1 


20 


r ‘Lh 

Jo 1- 


d l 

X 


12 


24 


26 


n 


x~ x dx 


— In x Jx 


1 1 


1 V* 1 + *. 


Jx 


27 If p> 0, integrate by parts to show that 

Jo x l ’e ' tJx = P xP :<J X ‘ lx - 


Decide convergence or divergence in 1-16. Compute the integ- 
rals that converge. 


f ■* 


1 


Jx 


l x 

f 1 Jx 


0 - X 

^ dx 


_ X + 1 


dx 


J.T 

i: 

11 [* dx 

J i oo *( |n *)( ln lrl ■ 

13 cos 2 x Jx 

Jo 


15 


f r Jx 


dx 


i: 


dx 


1 -x 

ri Jx 


- l v ' 1 -x" 


sin x Jx 


9 In x dx (by parts) 10 


12 


xe X dx (by parts) 




X Jx 


14 


16 


-ir 

rn;2 

tan x dx 
f ' e x dx 


o 


The first integral is the definition of pi So the equation is p! = 

. In particular 0! = , Another notation for 

pf is T(p+ 1) using the gamma function emphasizes that p 
need not be an integer. 

28 Compute (— 4)1 by substituting x = u 2 : 

x" i:2 e" x dx = _____ = v /j r (known). 

Then apply Problem 27 to find (4)1 

29 Integrate x 2 e~ x ~dx by parts, 

30 The beta junction «) = .x m " 1 ( 1 — xf" ‘Jx is finite 

when m and n are greater than . 

31 A perpetual annuity pays s dollars a year forever. With 

continuous interest rate c\ its present value is y 0 = Jo se~ ft dt. 
To receive $1000; year at c = 10%, you deposit y 0 = . 

32 In a perpetual annuity that pays once a year, the present 

value is y n = si a + si a 2 + li - = . To receive 

$1000 year at 10% (now a— 1.1) you again deposit y Q — 
, Infinite sums are like improper integrals. 

33 The work to move a satellite (mass m) infinitely far from 
the Earth (radius R, mass M) is W= GMm Jx/x 2 . Evaluate 
W r What escape celocity at liftoff gives an energy that 
equals W'! 
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34 The escape velocity for a black hole exceeds the speed of 
light: v 0 > 3 * 10 8 m/sec. The Earth has GM — 4 *tO l4 m 3 /sec 2 . 

If it were compressed to radius R = , the Earth 

would be a black hole, 

35 Show how the area under y = t/2* can be covered (draw 
a graph) by rectangles of area 1 + \ + i + ■■■ =2, What is the 
exact area from x = 0 to x = oc? 


36 Explain this paradox: 


— = 0 for every b but 


xdx 

-*> I +* 


2 diverges. 


*38 Compute any of these integrals found by geniuses: 


dx = In 2 


foe f oo 

xe~ x cos x dx = 0 cos x 2 dx — ^/rr/8. 

Jo Jo 

. , t f 00 dx 
39 For which p is — = oo? 

Jo x p + *~ p 


37 Compute the area between y = secx and y = tanx for 40 Explain from Figure 7,6c why the red area is 2, when 
0 ^ x ^ rc/2. What is improper? Figure 7,6a has red area L 
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CHAPTER 8 


Applications of the Integral 


We are experts in one application of the integral — to find the area under a curve. 
The curve is the graph of y = u(x), extending from x = a at the left to x = b at the 
right. The area between the curve and the x axis is the definite integral. 

I think of that integral in the following way. The region is made up of thin strips. 
Their width is dx and their height is v(x ). The area of a strip is v(x) times dx. The 
area of all the strips is j£u(x}dx. Strictly speaking, the area of one strip is 
meaningless — genuine rectangles have width Ax. My point is that the picture of thin 
strips gives the correct approach. 

We know what function to integrate (from the picture). We also know how (from 
this eourse or a calculator). The new applications to volume and length and surface 
area cut up the region in new ways. Again the small pieces tell the story. In this 
chapter, what to integrate is more important than how. 



8.1 Areas and Volumes by Slices 



This section starts with areas between curves. Then it moves to volumes , where the 
strips become slices. We arc weighing a loaf of bread by adding the weights of the 
slices. The discussion is dominated by examples and figures— the theory is minimal. 
The real problem is to set up the right integral. At the end we look at a different way 
of cutting up volumes, into thin shells. All formulas are collected into a final table. 

Figure 8 A shows the area between two curves. The upper curve is the graph of 
y = r(x). The lower curve is the graph of y = w(x). The strip height is u(x) — w(x), from 
one curve down to the other. The width is dx (speaking informally again). The total 
area is the integral of “top minus bottom”: 


area between two curves = 



L?(X) — W(X) J dx . 


a) 


EXAMPLE 1 The upper curve is y = 6x (straight line). The lower curve is y = 3x 2 
(parabola). The area lies between the points where those curves intersect. 

To find the intersection points, solve u(.x) = vv(x) or 6x = 3x 2 . 
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5 Applications of the Integral 




Fig, 8.1 Area between curves = integral of v — w, Area in Example 2 starts with x ^ 0, 


One crossing is at x = 0, the other is at x = 2. The area is an integral from 0 to 2: 
area = - w) dx = (6x — 3.x 2 ) dx = 3x 2 - x 3 ]^ = 4. 

EXAMPLE 2 Find the area between the circle v = yj\ - x 2 and the 45° line w = x. 

First question: Which area and what limits? Start with the pie-shaped wedge in 
Figure 8Tb. The area begins at the y axis and ends where the circle meets the line. 
At the intersection point we have u(x) = w(x): 

from ^/l — x 2 = x squaring gives 1 — x 2 = .x 2 and then 2.x 2 = 1. 

Thus x 2 = The endpoint is at x = l/ v /2. Now integrate the strip height v — w: 



The area is tt/ 8 (one eighth of the circle). To integrate v 'l - x 2 dx we apply the 
techniques of Chapter 7: Set x = sin 0, convert to f cos 2 0 dO = j(0 H- sin 0 cos 0), 
convert back using 0 = sin " 1 x. It is harder than expected, for a familiar shape. 

Remark Suppose the problem is to find the whole area between the circle and the 
line. The figure shows w at two points, which arc x= l/ v /2 (already used) and 
also x = —1 lyj 2. Instead of starting at x = 0, w hich gave | of a circle, we now include 
the area to the left. 

Main point: Integrating from jc= - l/y'2 to x = l/ v /2 wi7/ give the wrong answer , 
It misses the part of the circle that bulges out over itself, at the far left. In that part, 
the strips have height 2v instead of v - w. The figure is essential, to get the correct 
area of this half-circle. 

HORIZONTAL STRIPS INSTEAD OF VERTICAL STRIPS 

There is more than one way to slice a region. Vertical slices give x integrals. Horizontal 
slices give y integrals. We have a free choice, and sometimes the y integral is better. 
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Fig. 8.2 Vertical slices [x integrals) vs. horizontal slices ( 3 ? integrals). 


Figure 8*2 shows a unit parallelogram, with base 1 and height 1. To find its area from 
vertical slices, three separate integrals are necessary* You should see why! With hori- 
zontal slices of length 1 and thickness dy, the area is just j* dy = 1, 

EXAMPLE 3 Find the area under y = In x (or beyond x = e*) out to x = e. 

The x integral from vertical slices is in Figure 8.2c. The y integral is in 8, 2d. The area 
is a choice between two equal integrals (I personally would choose y ): 

fx - 1 In x d:x = [x In x - x] j = 1 or J‘ =0 (e ~ <?) dy = [ey - e»]J = 1. 

VOLUMES BY SLICES 

For the first time in this book, we now look at volumes. The regions are three- 
dimensional solids. There are three coordinates x, y, z — and many ways to cut up a 
solid* 

Figure 8*3 shows one basic way — using slices. The slices have thickness dx, like 
strips in the plane. Instead of the height y of a strip, we now have the area A of a 
cross-section * This area is different for different slices: A depends on x , The volume 
of the slice is its area times its thickness: dV — A(x) dx. The volume of the whole solid 
is the integral : 

volume — integral of area times thickness = j A{x) dx. (2) 

Note An actual slice does not have the same area on both sides! Its thickness is Ax 
(not dx). Its volume is approximately A(x)Ax (but not exactly)* In the limit, the 
thickness approaches zero and the sum of volumes approaches the integral. 

For a cylinder all slices are the same* Figure 8*3b shows a cylinder — not circular * 
The area is a fixed number A t so integration is trivial* The volume is A times h. The 



Fig. 8,3 Cross-sections have area /l(x). Volumes are j ^4(x) dx. 
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letter h, which stands for height, reminds us that the cylinder often stands on its end. 
Then the slices are horizontal and the y integral or z integral goes from 0 to h. 
When the cross-section is a circle, the cylinder has volume nr 2 h. 


EXAMPLE 4 The triangular wedge in Figure 8.3b has constant cross-sections with 
area A = y(3)(4) = 6. The volume is 6 h. 


EXAMPLE 5 For the triangular pyramid in Figure 8.3c, the area drops from 6 
to 0. It is a general rule for pyramids or cones that their volume has an extra factor 
^ (compared to cylinders). The volume is now 2h instead of 6h. For a cone with base 
area nr the volume is ^nr 2 h. Tapering the area to zero leaves only ^ of the volume. 

Why the 3? Triangles sliced from the pyramid have shorter sides. Starting from 3 
and 4, the side lengths 3(1 - x/h) and 4(1 - x/h) drop to zero at a = h. The area is 
A — 6(1 — xih} 2 . Notice: The side lengths go down linearly, the area drops quadrati- 
cally. The factor 3 really comes from integrating x 2 to get \x*: 


A(x) dx = 


■-0 '-s 


= 2h. 


EXAMPLE 6 A half-sphere of radius R has known volume ^(IttR*). Its cross-sections 
are semicircles. The key relation is x 2 + r 1 = K 2 , for the right triangle in Figure 8,4a. 
The area of the semicircle is A = jnr 2 = jn(R 2 - x 2 ). So we integrate /4(x): 

volume = A(x) dx = jrcf J? 2 x - ^ = InR*, 

EXAMPLE 7 Find the volume of the same half-sphere using horizontal slices 
(Figure 8.4b). The sphere still has radius R. The new right triangle gives y 2 + r 1 = R 1 . 
Since we have full circles the area is nr 2 = n{R 2 - y 1 ). Notice that this is A{y) not 
,4(x). But the y integral starts at zero: 

volume = -4(y) dy — n{R 2 y - 3 y 3 )]o = (as before). 



Fig, 8.4 A half-sphere sliced vertically or horizontally. Washer area nf 2 — tt# 2 . 


SOLIDS OF REVOLUTION 

Cones and spheres and circular cylinders are “solids of revolution,' 1 Rotating a hori- 
zontal line around the x axis gives a cylinder. Rotating a sloping line gives a cone. 
Rotating a semicircle gives a sphere. If a eirclc is moved away from the axis, rotation 
produces a torus (a doughnut). The rotation of any curve y—f(x) produces a solid 
of revolution. 
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The volume of that solid is made easier because every cross-section is a circle. All 
slices are pancakes (or pizzas). Rotating the curve y = f(x) around the x axis gives 
disks of radius y , so the area is A = ny 1 = ?r [/(*)] 2 . We add the slices: 


f* 

volume of solid of revolution = 


7 zy 2 dx = 



dx. 


EXAMPLE 8 Rotating); = yfx with A - n(^fx) 2 produces a “headlight” (Figure 8.5a): 

volume of headlight = fa A dx = j 2 nx dx = - 2n. 

If the same curve is rotated around the y axis, it makes a champagne glass. The slices 
are horizontal. The area of a slice is nx 2 not ny 1 . When y — yjx this area is n y*. 
Integrating from y = 0 to gives the champagne volume jt(,/2) s /5. 


revolution around the y axis : 


volume — 


TEX 2 dy . 


EXAMPLE 9 The headlight has a hole down the center (Figure 8.5b). Volume = ? 

The hole has radius 1, All of the yfx solid is removed, up to the point where y/x 
reaches 1. After that, from x= 1 to x = 2, each cross-section is a disk with a hole. 
The disk has radius /= yfx and the hole has radius g — h The slice is a flat ring or 
a “washer ” Its area is the full disk minus the area of the hole: 

area of washer = rtf 2 — ng 2 = Tif^/x) 2 - n{t) 2 = nx - n. 

This is the area A(x) in the method of washers , Its integral is the volume: 

J 2 A dx — Jj (tex - te) dx — [^7rx 2 “ = }n. 

Please notice: The washer area is not rt(f— g) 2 . It is A = nf 2 — ng 2 . 




Fig. 8.5 


y — yfx revolved 


; y = 1 revolved inside it; circle revolved to give torus. 


EXAMPLE 10 (Doughnut sliced into washers) Rotate a circle of radius a around the 
x axis. The center of the circle stays out at a distance b > a. Show that the volume 
of the doughnut (or torus) is 2n 2 a 2 b. 
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The outside half of the circle rotates to give the outside of the doughnut. The inside 
half gives the hole. The biggest slice (through the center plane) has outer radius h + a 
and inner radius h - a. 

Shifting over b y x, the outer radius is /= b + v ; a 2 — x 2 and the inner radius is 
g = b — A Ja 1 — x 2 . Figure 8.5c shows a slice (a washer) with area n f 2 — ng 2 . 

area A = n(h T- v 'gt — x 2 ) 2 — n(b — N /a 2 — x 2 ) 2 = 4tt by' a 2 — x 2 . 

Now integrate over the washers to find the volume of the doughnut: 

A(x)dx = 4nb y - x 1 dx = (4 Kb)(jKa 2 ) - 2i i 2 a 2 b. 

That integral jna z is the area of a semicircle. When we set x= a sin 0 the area is 
| a 2 cos 2 0 d&. Not for the last time do we meet cos 2 (K 

The hardest part is visualizing the washers, because a doughnut usually breaks the 
other way. A better description is a hagel , sliced the long way to be buttered. 

VOLUMES BY CYLINDRICAL SHELLS 

Finally we look at a different way of cutting up a solid of revolution. So far it was 
cut into slices. The slices were perpendicular to the axis of revolution. Now the cuts 
arc parallel to the axis, and each piece is a thin cylindrical shell. The new formula 
gives the same volume, but the integral to be computed might be easier. 

Figure 8 + 6a shows a solid cone, A shell is inside it, The inner radius is x and the 
outer radius is x + dx . The shell is an outer cylinder minus an inner cylinder: 

shell volume tt(x 4- dx) 2 h - nx 2 h = nx 2 h 2Kx(dx)h + i i(dx) 2 h - nx 7 h. (3) 

The term that matters is 2 nx(dx)h. The shell volume is essentially 2 nx (the distance 
around) times dx (the thickness) times h (the height). The volume of the solid comes 
from putting together the thin shells: 

f* 

solid volume = integral of shell volumes = 2k xh dx. (4) 

This is the central formula of the shell method. The rest is examples. 

Remark on this volume formula It is completely typical of integration that (dx) 2 and 
(Ax) 2 disappear. The reason is this. The number of shells grows like 1/Ax. Terms of 
order (Ax) 2 add up to a volume of order Ax (approaching zero). The linear term 
involving Ax or dx is the one to get right. Its limit gives the integral j 2n xh dx . The 
key is to build the solid out of shells — and to find the area or volume of each piece, 

EXAMPLE 11 Find the volume of a cone (base area nr 7 , height b) cut into shells, 

A tall shell at the center has h near b. A short shell at the outside has h near zero. In 
between the shell height h decreases linearly, reaching zero at x = r, The height in 
Figure 8.6a is h = b — bxlr. Integrating over all shells gives the volume of the cone 
(with the expected ^}: 
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hole radius a 


sphere radius b 

-Jr (down) 


Fig. 8.6 Shells of volume 2nxh dx inside cone, sphere with hole, and paraboloid. 


EXAMPLE 12 Bore a hole of radius a through a sphere of radius b > a. 

The hole rem oves all points out to x = a, where the shells begin. The height of the 
shell is h = 2 s /fr 2 - x 2 . (The key is the right triangle in Figure 8.6b. The height upward 
is yjb 1 — x 2 — this is half the height of the shell.) Therefore the sphere-with-hole has 

volume = J* 2nxh dx = j h a 4nx % fb 2 — x 2 dx. 

With u = h 2 — x 2 we almost see du . Multiplying du = — 2x dx is an extra factor - 2n: 

volume = -2 tt | *Ju du = -2jt(|& 3/2 ), 

We can find limits on u, or we can put back u = b 2 — x 2 : 

volume = — ^ (b 2 — x 2 ) 3/2 J = ^ (b 2 - a 2 ) 3/2 . 

If a-b (the hole is as big as the sphere) this volume is zero. If a = 0 (no hole) we 
have 4 jt& 3 /3 for the complete sphere. 

Question What if the sphere-with-hole is cut into slices instead of shells? 

Answer Horizontal slices are washers (Problem 66). Vertical slices arc not good. 


EXAMPLE 13 Rotate the parabola y ~ x 1 around the y axis to form a bowk 

We go out to x-y/2 (and up to ^ = 2), The shells in Figure 8.6c have height 
/i - 2 - x 2 . The bowl (or paraboloid) is the same as the headlight in Example 8, but 
we have shells not slices: 


2nx(2 — x 2 )dx ~ 2nx 2 - 


TABLE area between curves : A = f (i7{jc) — w(x)) dx 

OF 

MU vs so ^ vo ^ ume cut * nt0 slices: V = j >4(x) dx or J ,4(j0 dy 

AND solid of revolution: cross-section A ~ ny 2 or nx 2 

DLUMES , a r2 2 

solid with hole: washer area A - nf* — ng 2 

solid of revolution cut into shells : V — ( 2nxh dx . 


AND 

VOLUMES 
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Which to use, slices or shells'* Start with a vertical line going up to y — cos x. Rotating 
the line around the x axis produces a slice (a circular disk). The radius is cos x. 
Rotating the line around the y axis produces a shell (the outside of a cylinder). The 
height is cos x. See Figure 8.7 for the slice and the shell. For volumes we just integrate 
n cos 2 x dx (the slice volume) or 2nx cos x dx (the shell volume). 

This is the normal choice — slices through the x axis and shells around the y axis , 
Then y—f(x) gives the disk radius and the shell height. The slice is a washer instead 
of a disk if there is also an inner radius #(x). No problem — just integrate small 
volumes. 

What if you use slices for rotation around the y axis? The disks are in Figure 8.7b, 
and their radius is x. This is x = cos^ 1 y in the example. It is x =/" i (y) in general. 
You have to solve y =/(x) to find x in terms of y. Similarly for shells around the x 
axis: The length of the shell is x= L f~ 1 (y). Integrating may be difficult or impossible. 

When y = cos x is rotated around the x axis, here are the choices for volume: 

(good by slices ) J n cos 2 x dx ( bad by shells) j 2ny cos " 1 y dy. 



Fig. 8.7 Slices through x axis and shells around y axis (good). The opposite way needs f~ l (y). 


8.1 EXERCISES 


Read-through questions 

The area between y = x 3 and y = x 4 equals the integral of 
q If the region ends where the curves intersect, we find 
the limits on x by solving b , Then the area equals c . 
When the area between y = and the y axis is sliced hori- 
zontally, the integral to compute is <* . 

In three dimensions the volume of a slice is its thickness dx 
times its g If the cross-sections are squares of side 1 — x, 
the volume comes from j j From x = 0 to x = 1, this 
gives the volume o of a square ft If the cross-sec- 
tions are circles of radius 1 — x, the volume comes from 
j i This gives the volume J of a circular fc 

For a solid of revolution, the cross-sections are I 
Rotating the graph of y = f(x) around the x axis gives a solid 
volume J m . Rotating around the y axis leads to J n 
Rotating the area between y—f{x) and y = g(x) around the x 
axis, the slices look like o . Their areas are p so the 
volume is J Q . 

Another method is to cut the solid into thin cylindrical 
f Revolving the area under y— f(x) around the y axis, 
a shell has height » and thickness dx and volume t 
The total volume is J u . 


Find where the curves in 1-12 intersect, draw rough graphs, 
and compute the area between them. 

1 y = x 2 — 3andy=l 2 y = x 2 — 2 and y = 0 

3 y 2 -x and x = 9 4 y 2 - x and x = y + 2 

5 y = x 4 — 2x z and y = 2x 2 6 x = y 5 and y = x 4 

7 y = x 2 and y = —x 2 4- 18x 

8 y = 1/x and y = 1 jx 2 and x = 3 

9 y = cos x and y = cos 2 * 

10 y = sin nx and y = 2x and x = 0 

11 v = c* and y = e 2jc ~ 1 and x = 0 

12 y — e and y = e* and y = e~ x 

13 Find the area inside the three lines y = 4 - *, y = 3x> and 
y = x. 

14 Find the area bounded by y = 12 — x, y — y/x, and y = 1. 

15 Does the parabola y = 1 — x 2 out to x = 1 sit inside or 
outside the unit circle x 2 + y 2 = 1? Find the area of the “skin” 
between them. 
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16 Find the area of the largest triangle with base on the x 
axis that fits (a) inside the unit circle (b) inside that parabola. 

17 Rotate the ellipse x 2 /a 2 + y 2 /b 2 = 1 around the % axis to 
find the volume of a football. What is the volume around the 
y axis? If a = 2 and 6=1, locate a point (x, y, z) that is in one 
football but not the other 


36 CavalierPs principle for volumes: If two solids have slices 
of equal area, the solids have the same volume. Find the 
volume of the tilted cylinder in the figure. 

37 Draw another region with the same slice areas as the tilted 

cylinder. When all areas A(x) are the same, the volumes 
| __ are the same. 


18 What is the volume of the loaf of bread which comes from 
rotating y = sin x (0 ^ x ^ n) around the x axis? 

19 What is the volume of the flying saucer that comes from 
rotating y = sin x (0 ^ x ^ ji) around the y axis? 

20 What is the volume of the galaxy that comes from rotating 
y — sin x (0 < x < rc) around the x axis and then rotating the 
whole thing around the y axis? 

Draw the region bounded by the curves in 21-28. Find the 
volume when the region is rotated (a) around the x axis (b) 
around the y axis. 

21 x + y = 8 f x = 0 t y = G 

22 y — £* = U x~ 1* y = 0, x = 0 

23 y-x 4 , y- U * = 0 

24 y = sin x t y = cos x, x = 0 

25 xy — l t x = 2, y = 3 

26 x 2 “ y 2 = 9, x + y = 9 (rotate the region where y > 0) 

27 x 2 ^y\x*^y 2 

28 (x — 2) 2 + (y — l) 2 = I 

In 29-34 find the volume and draw a typical slice, 

29 A cap of height h is cut off the top of a sphere of radius 
R. Slice the sphere horizontally starting at y = J? — /i, 

30 A pyramid P has height 6 and square base of side 2. Its 
volume is £(6)(2) 2 = 8. 

(a) Find the volume up to height 3 by horizontal slices. 
What is the length of a side at height y? 

(b) Recompute by removing a smaller pyramid from P, 

31 The base is a disk of radius a. Slices perpendicular to the 
base are squares. 

32 The base is the region under the parabola y — I — x 2 . 
Slices perpendicular to the x axis are squares. 

33 The base is the region under the parabola y = I — x 2 . 
Slices perpendicular to the y axis are squares. 

34 The base is the triangle with corners (0, 0), (1, 0), (0, 1). 
Slices perpendicular to the x axis are semicircles. 

35 CavalierPs principle for areas: If two regions have strips 
of equal length, then the regions have the same area. Draw a 
parallelogram and a curved region, both with the same strips 
as the unit square. Why are the areas equal? 


38 Find the volume common to two circular cylinders of 
radius a. One eighth of the region is shown (axes are perpen- 
dicular and horizontal slices are squares). 



39 A wedge is cut out of a cylindrical tree (see figure). One 
cut is along the ground to the x axis. The second cut is at 
angle 0, also stopping at the x axis. 

(a) The curve C is part of a (circle) (ellipse) (parabola). 

(b) The height of point P in terms of x is . 

(c) The area ,4(x) of the triangular slice is 

(d> The volume of the wedge is . 


C 



40 The same wedge is sliced perpendicular to the y axis. 

(a) The slices are now (triangles) (rectangles) (curved). 

(b) The slice area is (slice height y tan 0), 

(c) The volume of the wedge is the integral 

(d) Change the radius from 1 to r. The volume is 

multiplied by 

41 A cylinder of radius r and height h is half full of water. 
Tilt it so the water just covers the base. 

(a) Find the volume of water by common sense. 

(b) Slices perpendicular to the x axis are (rectangles) (trap- 
ezoids) (curved). I had to tilt an actual glass, 

*42 Find the area of a slice in Problem 41. (The tilt angle has 
tan 0 = 2h/r.) Integrate to find the volume of water. 
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The slices in 43-46 are washers. Find the slice area and volume. 

43 The rectangle with sides x = 1, x — 3, y = 2 t y = 5 is rotated 
around the x axis. 

44 The same rectangle is rotated around the y axis. 

45 The same rectangle is rotated around the line y — 1. 

46 Draw the triangle with corners (1,0), (l, 1), (0, 1). After 
rotation around the x axis, describe the solid and find its 
volume. 

47 Bore a hole of radius a down the axis of a cone and 
through the base of radius ft . If it is a 45° cone {height also 
ft), what volume is left? Check a — 0 and a = ft. 

48 Find the volume common to two spheres of radius r if 
their centers are 2 (r — ft) apart. Use Problem 29 on spherical 
caps. 

49 (Shells vs. disks) Rotate y = 3 — x around the x axis from 
x = 0 to x = 2. Write down the volume integral by disks and 
then by shells. 

50 (Shells vs. disks) Rotate y = x l around the y axis from 
y = 0 to y = 8, Write down the volume integral by shells and 
disks and compute both ways. 

51 Yogurt comes in a solid of revolution. Rotate the line 
y = mx around the y axis to find the volume between y = a 
and y — b< 

52 Suppose y =f(x) decreases from /(0) = ftto /(l) = 0,The 
curve is rotated around the y axis. Compare shells to disks : 

2nxf(x) dx = f 0 

Substitute y = /(x) in the second. Also substitute dy = f'(x)dx . 
Integrate by parts to reach the first. 

53 If a roll of paper with inner radius 2 cm and outer radius 
10 cm has about 10 thicknesses per centimeter* approximately 
how long is the paper when unrolled? 

54 Find the approximate volume of your brain. OK to 
include everything above your eyes (skull too). 

Use shells to find the volumes in 55-63. The rotated regions 
lie between the curve and x axis. 

55 y = 1 — x 2 , 0 ^ x ^ 1 (around the y axis) 


56 y = 1/x, 1 < x < 100 (around the y axis) 

57 y = ^f\ — x 2 , G < x < 1 (around either axis) 

58 y = 1/(1 + x 2 ), 0^x0 (around the y axis) 

59 y = sin(x 2 )* 0 < x < (around the y axis) 

60 y = l/^/l — x 2 t 0^x^ 1 (around the y axis) 

61 y = x 2 * 0 ^ x < 2 (around the x axis) 

62 y = e*> 0 ^ x ^ 1 (around the x axis) 

63 y ^ In x, 1 < x ^ e (around the x axis) 

64 The region between y — x 2 and y — x is revolved around 
the y axis, (a) Find the volume by cutting into shells, (b) Find 
the volume by slicing into washers. 

65 The region between y=/(x) and y = l+/(x) is rotated 

around the y axis. The shells have height . The vol- 
ume out to x = a is It equals the volume of a 

because the shells are the same. 

66 A horizontal slice of the sphere- with-hole in Figure 8.6b 
is a washer. Its area is rrx 2 - na 2 = n (ft 2 - y 2 — a 2 ), 

(a) Find the upper limit on y (the top of the hole). 

(b) Integrate the area to verify the volume in Example 12. 

67 If the hole in the sphere has length 2* show that the volume 
is 4ti/3 regardless of the radii a and ft. 

*68 An upright cylinder of radius r is sliced by two parallel 
planes at angle x One is a height ft above the other. 

(a) Draw a picture to show that the volume between the 
planes is 7ir 2 Ji. 

(b> Tilt the picture by x, so the base and top are flat. What 
is the shape of the base? What is its area 4? What is the 
height H of the tilted cylinder? 

69 True or false T with a reason. 

(a) A cube can only be sliced into squares. 

(b) A cube cannot be cut into cylindrical shells. 

(c) The washer with radii r and J? has area rt(R — r) 2 , 

(d) The plane w = % slices a 3-dimensional sphere out of 
a 4-dimensional sphere x 2 + y 2 + z 1 + w 2 = 1 . 



8.2 Length of a Plane Curve 



The graph of y — x J 2 is a curve in the x-y plane. How long is that curve ? A definite 
integral needs endpoints, and we specify x — 0 and x = 4. The first problem is to know 
what “length function” to integrate. 

The distance along a curve is the arc length. To set up an integral* we break the 


8.2 Length of a Plane Curve 


321 


problem into small pieces. Roughly speaking, small pieces of a smooth curve ore nearly 
straight. We know the exact length As of a straight piece, and Figure 8,8 shows how 
it comes close to a curved piece. 



(As) 2 = (Ax) 2 + (Ay) 2 
(4s) 2 = (dx) 2 + (dx) 2 

ds = yf 1 + (dyjdx) 2 dx 



Hg, 8,8 Length is of short straight segment. Length ds of very short curved segment. 


Here is the unofficial reasoning that gives the length of the curve, A straight piece 
has (As) 2 = (Ax) 2 + (Ay) 2 , Within that right triangle, the height Ay is the slope 
(Ay/Ax) times Ax, This secant slope is close to the slope of the curve. Thus Ay is 
approximately (dyjdx) Ax. 


As ^ >J(Ax) 2 + (dyjdx) 2 (Ax) 1 = 1 + (dyjdx) 2 Ax. (1) 

Now add these pieces and ma ke them smal ler. The infinitesimal triangle has (ds) 2 = 
(dx) 2 + (dy) 2 . Think of ds as ^/l + (dyjdx) 2 dx and integrate: 


length of curve = 


ds = 


V 1 + {dyjdx) 2 dx. 


( 2 ) 


EXAMPLE 1 Keep y = x w and dyjdx = f x 1/2 . Watch out for § and f : 

length = It Jl+U dx = (J)($)(l + I*) 3 ' 2 ]* = £(10 3 ' 2 - l 3 ' 2 ). (3) 

This answer is just above 9. A straight line from (0, 0) to (4, 8) has exact length 
^80. Note 4 2 + 8 2 = 80. Since ^/SO is just below 9, the curve is surprisingly straight. 

You may not approve of those numbers (or the reasoning behind them). We can 
fix the reasoning, but nothing can be done about the numbers. This example y = x 3/2 
had to be chosen carefully to make the integration possible at all. The length integral 
is difficult because of the square root. In most cases we integrate numerically. 

EXAMPLE 2 The straight line y = 2x from x = 0 to x = 4 has dyjdx = 2: 

length = yfl + 4 dx = 4^/5 = v /&0 as before (just checking). 

We return briefly to the reasoning. The curve is the graph of y =/(x). Each piece 
contains at least one point where secant slope equals tangent slope: Ay/Ax =/'(c). 
The Mean Value Theorem applies when the slope is co ntinuous — this is required 
for a smooth curve. The straight length As is exactly ^/{Ax) 2 + (f r (c) Ax ) 2 . Adding 
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6 Application* of the Integral 


the n pieces gives the length of the broken line (close to the curve): 





As n -» qo and Ax max -» 0 this approaches the integral that gives arc length. 


8A The length of the curve y ~ f(x) from x = a to x = b is 

* - \ ds - £ V 1 + C/'W] 2 dx = £ jT+idyfdrf dx. (4) 


EXAMPLE 3 Find the length of the first quarter of the circle y = 1 - x 2 . 

Here dy/dx “ — xj^J 1 — x 2 . From Figure 8.9a, the integral goes from x = 0 to x = 1: 

kn * ,h - r * - r f * - r yr b- 

The antiderivative is sin -1 *. It equals nj2 at x = 1. This length nj2 is a quarter of 
the full circumference 2n. 


EXAMPLE 4 Compute the distance around a quarter of the ellipse y 2 + 2x 2 = 2. 
The equation is y = yj2 - 2x 2 and the slope is dyjdx - - Ixj^fl - 2x 2 . So J ds is 


4x 2 

2 - 2x 2 


'1 + 


dx — 




( 5 ) 


That integral can't be done in closed form. The length of an ellipse can only be 
computed numerically. The denominator is zero at x — 1, so a blind application of the 
trapezoidal rule or Simpson’s rule would give length = oo. The midpoint rule gives 
length - 1.91 with thousands of intervals. 


y = V 1 ~ A ' 2 


V 1 + (/) 2 ^ 


. a = cos/, y = sin; 

dy 

sin 2 / + cos 2 / dt 

$tan at i - 0^. 

dx\ 

V 



Fig. 8.9 Circle and ellipse, directly by y = f(x) or parametrically by x(t) and y(/). 


LENGTH OF A CURVE FROM PARAMETRIC EQUATIONS: x(t) AND yit) 

We have met the unit circle in two forms. One is x 2 + y 2 = 1. The other is x = cos f, 
y = sin L Since cos 2 1 + sin 2 1 = 1, this point goes around the correct circle. One advan- 
tage of the “ parameter ” t is to give extra information — it tells where the point is and 
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also when . In Chapter 1, the parameter was the time and also the angle— because 
we moved around the circle with speed 1. 

Using f is a natural way to give the position of a particle or a spacecraft. We can 
recover the velocity if we know x and y at every time t . An equation y = f(x ) tells the 
shape of the path, not the speed along it. 

Chapter 12 deals with parametric equations for curves. Here we concentrate on 
the path length — which allows you to see the idea of a parameter t without too much 
detail. We give x as a function of t and y as a function of L The curve is still 
approximated by straight pieces, and each piece has (As) 2 = (Ax) 2 4- (Ay) 2 . But instead 
of using Ay & {dyjdx) Ax, we approximate Ax and Ay separately: 

Ax {dxjdt) At, Ay # {dyjdt) At, As % {dxjdt) 2 + {dyjdt) 1 At. 


88 The length of a parametric curve is an integral with respect to t: 

jds = $ (dsjdt) dt = J J {dxjdt) 7 - + (dy/dt) 2 dt. (6) 


EXAMPLE 5 Find the length of the quarter-circle using x = cos t and y = sin t: 

f {dxjdt ) 2 + {dyjdt ) 2 dt = f ^/sin 2 1 + cos 2 £ dt - f dt-^. 

Jo Jo Jo 2 

The integral is simpler than 1/ V /T— , x 2 , and there is one new advantage. We can 
integrate around a whole circle with no trouble. Parametric equations allow a path to 
close up or even cross itself. The time t keeps going and the point (x(t), y(r)) keeps 
moving. In contrast, curves y = /(x) are limited to one y for each x, 

EXAMPLE 6 Find the length of the quarter-ellipse: x = cosr and y= v /2sin(: 

On this path y 2 + 2x 2 is 2 $in 2 r + 2 cos 2 1 = 2 (same ellipse). The jKHi-parametric 
equation y = - 2x 2 comes from eliminating r. We keep t : 

M2 fW 2 

length = I yj {dxjdt) 2 + {dyjdt) 2 dt = I ^/sin 2 1 + 2 cos 2 1 dt. (7) 

This integral (7) must equal (5). If one cannot be done, neither can the other. They 
are related by x ~ cos £, but (7) does not blow up at the endpoints. The trapezoidal 
rule gives 1 .9101 with less than 100 intervals. Section 5.8 mentioned that calculators 
automatically do a substitution that makes (5) more like (7). 

EXAMPLE 7 The path x = t 2 , y = £ 3 goes from (0, 0) to (4, 8). Stop at t = 2. 

To find this path without the parameter f, first solve for t = x 1/2 . Then substitute 
into the equation for y: y = t 3 — x 3/2 . The non-parametric form (with t eliminated) is 
the same carve y = x 3/2 as in Example 1. 

The length from the £-integral equals the length from the x-integral. This is 
Problem 22. 

EXAMPLE 8 Special choice of parameter : t is x. The curve becomes x = r, y = £ 3y2 . 

If x = t then dxjdt — 1. The square root in (6) is the same as the square root in (4). 
Thus the non-parametric form y =/(x) is a special case of the parametric form — just 
take t = x. 

Compare x = r, y = t 3/2 with x — t 2 , y — t z . Same curve , same length , different speed. 
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8 Applications of the Integral 


EXAMPLE 9 


Define “speed” by 


short distance 
short time 


ds 
Jt ' 



When a ball is thrown straight upward, dx/dt is zero. But the speed is not dy/dt. 
It is \dyjdt\. The speed is positive downward as well as upward. 


8.2 EXERCISES 


Read-through questions 

The length of a straight segment (A* across. Ay up) is 
As — 0 . Between two points of the graph of y(x), Ay is 

approximately dyjd x times b . The length of that piece 
is approximately ^/(Ax) 2 + c An infinitesimal piece of 
the curve has length ds = d Then the arc length integral 

is f « • 

For y~ 4 — x from x = 0 to x = 3 the arc length is 
j f = 9 . For y = x 3 the arc length integral is h . 

The curve x — cos r, y = sin t is the same a s 1 The 
length of a curve given by x(f), y(t) is J v/ 1 dt. For exam- 
ple x = cos t y y — sin t from £ = t i/ 3 to t = tt/ 2 has length 
* . The speed is ds/dt = [ For the s pecial c ase 

x = £, y —f{t) the length formula goes back to { v / w dx. 

Find the lengths of the curves in Problems 1-8. 

1 y = x 3/2 from (0, 0) to (1, 1 ) 

2 y = x 2/3 from (0, 0) to (1, 1) (compare with Problem 1 or 
put u = $ + x 2/3 in the length integral) 

3 y — j(x 2 + 2) 3 ' 2 from x = 0 to x = 1 

4 y = ^(x 2 — 2) 3/2 from x = 2 to x — 4 

x 3 1 

5 y = — - + — from x = 1 to x = 3 

3 4x 

x 4, l 

6 y = 4- — r from x = 1 to x = 2 

4 8x fc 

7 y = yx 3;2 - jx 1/2 from x = 1 to x = 4 

8 y = x 2 from (0, 0) to (1, 1) 

9 The curve given by x = cos 3 r, y = sin 3 £ is an astroid (a 
hypocycloid). Its non-parametric form is x 2/3 + y 2/3 = l. 
Sketch the curve from t =0 to t = nj2 and find its length. 

10 Find the length from t = 0 to r — n of the curve given by 
x-cos t + sin t t y = cos i — sin t. Show that the curve is a 
circle (of what radius?). 

1 1 Find the length from t - 0 to t = tt/ 2 of the curve given by 
x - cos t y y = t - sin t . 

12 What integral gives the length of Archimedes' spiral 
x = r cos t y y — t sin t? 


13 Find the distance traveled in the first second (to r = 1) if 
x«i* 2 ,y = ±(2M-l) 3 ' 2 . 

14 x — (1 - i cos It) cos t and y = (I 4- i cos 2l) sin t lead to 
4(1 — x 2 — y 2 ) 3 = 27(x 2 — y 2 ) 2 . Find the arc length from t = 0 
to rc/4. 

Find the arc lengths in 15-18 by numerical integration. 

15 One arch of y = sin x, from x = 0 to x = it. 

16 y = e* from x = 0 to x = 1. 

17 y = In x from x = 1 to x = e. 

18 x = cos ty y = 3 sin r, 0 < x < 2n. 

19 Draw r a rough picture of y = x 10 . Without computing the 
length of y = x" from (0, 0) to (1, l), find the limit as n -* oc. 

20 Which is longer between (1, 1) and (2, i), the hyperbola 
y = l/x or the graph of x + 2y = 3? 

21 Find the speed dsjdt on the circle x = 2 cos 3t, y = 2 sin 3 1. 

22 Examples 1 and 7 were y *= x 3/2 and x = f 2 , y — t 3 : 

length = j£ dx y length — J 2 t 1 + 9 1* dt. 

Show by substituting x = that these integrals agree. 

23 Instead of y — f(x) a curve can be given as x = g(y)> Then 

ds = ^/{dx) 2 + (dy) z = Jidxldy) 1 + 1 dy. 

Draw x = 5y from y = 0 to y ~ 1 and find its length, 

24 The length of x-y vl from (0,0) to (1,1) is 
JVs = j vfy+1 dy . Compare with Problem l: Same length? 
Same curve? 

25 Find the length of x = {(e y + e~ y ) from y= — 1 to y = l 
and draw the curve. 

26 The length of x — g(y) is a special case of equation (6) w ith 

y = t and x — g(r). The length integral becomes . 

27 Plot the point x = 3 cos t y y = 4 sin t at the five times 
t = 0, nj 2y 7 T, 37t/2, 2k. The equation of the curve is 

(x/3) 2 + (y/4) 2 = l, not a circle but an . This curve 

cannot be written as y = f(x) because . 

28 (a) Find the length of x = cos 2 r, y = sin 2 t y 0 ^ y < n. 

(b) Why does this path stay on the line x + y= 1? 

(c) Why isn’t the path length equal to >/2? 
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29 (important) The line y = x is close to a staircase of pieces 
that go straight across or straight up. With 100 pieces of length 
Ax = 1/100 or Ay = 1/100, find the length of carpet on the 
staircase. (The length of the 45 fl line is yfl . The staircase can 
be close when its length is not close.) 

30 The area of an ellipse is nab. The area of a strip around 
it (width A) is n(a + A)(b 4- A) — nab as n(a + b)A, The distance 
around the ellipse seems to be n(a + b)> But this distance is 
impossible to find— what is wrong? 

31 The point x = cos t t y = sin t,z = t moves on a space curve , 

(a) In three-dimensional spaoe ( ds ) 2 equals (dx) 2 + 
. In equation (6), ds is now dt. 


(b) This particular curve has ds = . Find its 

length from t = 0 to t = 2n. 

(c) Describe the curve and its shadow in the xy plane. 

32 Explain in 50 words the difference between a non -para- 
metric equation y=f(x) and two parametric equations 
x = x(t\y = y(ty 

33 Write down the integral for the length L of y = x 2 from 
(0, 0) to (1, 1), Show that y = ?x 2 from (0, 0) to (2, 2) is exactly 
twice as long. If possible give a reason using the graphs. 

34 (for professors) Compare the lengths of the parabola 
y = x 2 and the line y — bx from (0,0) to (b, b 2 ), Does the 
difference approach a limit as b -*• oo? 



8.3 Area of a Surface of Revolution 



This section starts by constructing surfaces. A curve y =/{ x) is revolved around an 
axis , That produces a “ surface of revolution," which is symmetric around the axis. If 
we revolve a sloping line, the result is a cone. When the line is parallel to the axis we 
get a cylinder (a pipe). By revolving a curve we might get a lamp or a lamp shade 
(or even the light bulb). 

Section 8.1 computed the volume inside that surface. This section computes the 
surface area , Previously we cut the solid into slices or shells. Now we need a good 
way to cut up the surface. 

The key idea is to revolve short straight line segments , Their slope is Ay/ Ax. They 
can be the same pieces of length As that were used to find length — now we compute 
area. When revolved, a straight piece produces a “thin band' (Figure 8, 10). The curved 
surface, from revolving y = /(x), is close to the bands. The first step is to compute the 
surface area of a band. 

A small comment: Curved surfaces can also be cut into tiny patches. Each patch 
is nearly flat, like a little square. The sum of those patches leads to a double integral 
(with dx dy ). Here the integral stays one-dimensional (dx or dy or dt). Surfaces of 
revolution are special — we approximate them by bands that go all the way around, 
A band is just a belt with a slope, and its slope has an effect on its area. 


middle radius x 



Fig, 6,10 Revolving a straight piece and a curve around the y axis and x axis. 
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0 Applications of the Integral 


Revolve a small straight piece ( length As not Ax). The center of the piece goes 
around a circle of radius r. The band is a slice of a cone . When we flatten it out 
(Problems 11-13) we discover its area. The area is the side length As times the middle 
circumference 2 nr: 

The surface area of a band is 2nrAs = 2nryfl + (Ay/ Ax) 2 Ax. 

For revolution around the y axis, the radius is r- x. For revolution around the x 
axis, the radius is the height: r — y = /(x). Figure 8. 10 shows both bands — the problem 
tells us which to use. The sum of band areas 2nr As is close to the area S of the curved 
surface. In the limit we integrate 2 nr ds: 


8C The surface area generated by revolving the curve y ~f(x) between x = a 
and x = b is 

S — J* 2n)\/\ + (< dy/dx ) 2 dx around the x axis (r — y) (1) 

S = 2nxy/l + (dy/dx) 2 dx around the y axis (r = x). (2) 


EXAMPLE 1 Revolve a complete semicircle y = y/ R 2 — x 2 around the x axis. 

The surface of revolution is a s phere . It s area (known!) is 4 tiR 2 . T he limits on x are 
- R and R. The slope of y = y/R 2 -x 2 is dy/dx = - x/y/'R 2 ^- x 2 : 

area S = J 2uy/' R 2 ^ x 1 J 1 + & x ~ J 2ttR dx - 4nR 2 . 

EXAMPLE 2 Revolve a piece of the straight line y = 2x around the x axis. 

The surface is a cone with (dy/dx) 1 - 4. The band from x = 0 to x = 1 has area 
2Ky/5: 

S = f 2ny ds = j* 2tt(2x) yf\ + 4 dx — 2 nJ~S. 

This answer must agree with the formula 2?rr As (which it came from). The line from 
(0, 0) to (1, 2) has length As = % /I. Its midpoint is (^, 1). Around the x axis, the middle 
radius is r = 1 and the area is 27i N /5. 

EXAMPLE 3 Revolve the same straight line segment around the y axis. Now the 
radius is x instead of y = 2x. The area in Example 2 is cut in half: 

S = J 2 nx ds = InXy/T^rA dx = 7 lyfi. 

For surfaces as for arc length, only a few examples have convenient answers. 
Watermelons and basketballs and light bulbs are in the exercises. Rather than stretch- 
ing out this section; we give a final area formula and show how to use it. 

The formula applies when there is a parameter t . Instead of (x, /(x)) the points on 
the curve are (x(t), y(t)). As t varies, we move along the curve. The length formula 
(ds) 2 = (dx) 2 + ( dy ) 2 is expressed in terms of t . 

For the surface of revolution around the x axis, the area becomes a f-integral: 


0D The surface area is J 2ny ds - J 27u>’(r) v /(dx/df ) 2 + (dy/dt) 2 dt . (3) 
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EXAMPLE 4 The point x = cos f, y = 5 + sin t travels on a circle with center at (0, 5). 
Revolving that circle around the x axis produces a doughnut. Find its surface area. 

Solution (dxjdt) 1 + (dyjdt) 2 — sin 2 1 + cos 2 1 = 1. The circle is complete at t = 2n: 

{ 2ny ds = Jo* 2n{5 + sin i) dt = |^27r(5r - cos 0]o* = 20 tt 2 . 


8.3 EXERCISES 


Read-through questions 

A surface of revolution comes from revolving a q around 
b . This section computes the c . When the curve is 
a short straight piece (length As), the surface is a d Its 
area is AS — e In that formula (Problem 13) r is the 
radius of f The line from (0, 0) to (1, 1) has length g 
and revolving it produces area u 

When the curve y=f[x) revolves around the x axis, the 
surface area is the integral > For y = x 2 the integral to 
compute is J When y = x 2 is revolved around the y axis, 
the area is S = * For the curve given by x = 2 f, y = f 2 , 
change ds to l dt. 

Find the surface area when curves 1-6 revolve around the x 


axis. 




1 

y 

H 

2 < .x ^ 6 

2 

y 

=* 3 , 

0« 

l 

3 

y 

= 7.x, 

-1 

< x ^ 1 (watch sign) 

4 

y 

= yfi 1 


, 0 < x ^ 2 

5 

y 

= v^ 

- X“. 

, - 1 =£ x ^ 1 

6 

y 

= cosh 

V. 

0 ^ .v ^ 1, 


In 7-10 find the area of the surface of revolution around the y 
axis. 

7 y = x\ 0 ^ x ^ 2 8 y = hx 2 + j. 0 1 

9 y = x+ 1, 0<x ^ 3 10 y = x 1/J , 1 

II A cone with base radius R and slant height .v is laid out 
flat. Fxplain why the angle (in radians) is 0 = 2 nR;s. Then the 
surface area is a fraction of a circle: 


12 A band with slant height As = s — s' and radii R and R' is 
laid out flat. Explain in one line why its surface area is 
nRs — ttRV. 




13 By similar triangles Rjs = R';s or Rs = R's. The middle 
radius r is y(R + R'). Substitute for r and As in the proposed 
area formula 2 nr As. to show that this gives the correct area 
nRs — tiRs\ 

14 Slices of a basketball all have the same area of cover, 
if they have the same thickness. 

(a) Rotate v — v / 1 — x 2 around the x axis. Show that 
d$ = 2 n dx/ 

(b) The area between x = a and x = a + h is 

(c) i of the Earth's area is above latitude . 

15 Change the circle in Example 4 to x = a cos r and y = 

b 4- a sin r. Its radius is and its center is . 

Find the surface area of a torus by revolving this circle around 
the x axis, 

16 What part of the circle x = R cos r, y = Rsinr should 
rotate around the y axis to produce the top half of a sphere? 
Choose limits on t and verify the area, 

17 The base of a lamp is constructed by revolving the quar- 
ter-circle y = v /2.v — \ 2 {x = l to x = 2) around the y axis. 
Draw the quarter-circle, find the area integral, and compute 
the area. 


area 


— I ns 
2k . 


R 

s 


ns 1 = nRs. 




18 The light bulb is a sphere of radius 1/2 with its bottom 
sliced off to fit onto a cylinder of radius 1,4 and length 1/3. 
Draw the light bulb and find its surface area (ends of the 
cylinder not included). 

19 The lamp shade is constructed by rotating y = l x around 
the y axis, and keeping the part from y = 1 to y = 2. Set up 
the definite integral that gives its surface area. 

20 Compute the area of that lamp shade. 
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B Applications of the Integral 


21 Explain why the surface area is infinite when y = \/x is 
rotated around the x axis (I ^ x < oc) t But the volume of 

“Gabriel's horn” is . It can't hold enough paint to 

paint its surface. 

22 A disk: of radius 1" can be covered by four strips of tape 
(width j"). If the strips are not parallel, prove that they can't 


cover the disk. Hint: Change to a unit sphere sliced by planes 
apart. Problem 14 gives surface area n for each slice. 

23 A watermelon (maybe a football) is the result of rotating 
half of the ellipse x = v /2cost, y — sin / (which means 
x 2 + 2 y l = 2). Find the surface area, parametrically or not. 

24 Estimate the surface area of an egg. 



8.4 Probability and Calculus 



Discrete probability usually involves careful counting. Not many samples are taken 
and not many experiments are made. There is a list of possible outcomes, and a 
known probability for each outcome. But probabilities go far beyond red cards and 
black cards. The real questions are much more practical: 

!. How often will too many passengers arrive for a flight? 

2, How many random errors do you make on a quiz? 

3* What is the chance of exactly one winner in a big lottery? 

Those are important questions and we will set up models to answer them. 

There is another point. Discrete models do not involve calculus. The number of 
errors or bumped passengers or lottery winners is a small whole number. Calculus 
enters for continuous probability ♦ Instead of results that exactly equal 1 or 2 or 3, 
calculus deals with results that fall in a range of numbers. Continuous probability 
comes up in at least two ways: 

(A) An experiment is repeated many times and we take averages. 

(B) The outcome lies anywhere in an interval of numbers. 

In the continuous case, the probability p n of hitting a particular value x — n becomes 
zero. Instead we have a probability density p(x) — which is a key idea. The chance that 
a random X falls between a and b is found by integrating the density p{ x): 

Prob \ a ^ X ^ h\ - p(.v) dx , (l) 

Roughly speaking, p(x)dx is the chance of falling between x and * + dx , Certainly 
p(;c) ^ 0. If a and b are the extreme limits — oc and including all possible outcomes, 
the probability is necessarily one: 

Prob )-■/. < A" < -r | > p[ \) dx = 1 . (2) 

This is a case where infinite limits of integration are natural and unavoidable. In 
studying probability they create no difficulty — areas out to infinity are often easier. 
Here are typical questions involving continuous probability and calculus: 

4* How conclusive is a 53%-47% poll of 2500 voters? 

5* Are 16 random football players safe on an elevator with capacity 3600 pounds? 
6, How long before your car is in an accident? 

It is not so traditional for a calculus course to study these questions. They need extra 
thought, beyond computing integrals (so this section is harder than average). But 
probability is more important than some traditional topics, and also more interesting. 
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Drug testing and gene identification and market research are major applications. 
Comparing Questions 1-3 with 4-6 brings out the relation of discrete to continuous — 
the differences between them, and the parallels. 

It would be impossible to give here a full treatment of probability theory. I believe 
you will see the point (and the use of calculus) from our examples. Frank Morgan’s 
lectures have been a valuable guide. 

DISCRETE RANDOM VARIABLES 


A discrete random variable X has a list of possible values. For two dice the outcomes 
are X = 2, 3, ..., 12. For coin tosses (see below), the list is infinite: X = 1, 2, 3, .... 

A continuous variable lies in an interval a < X ^ b. 


EXAMPLE 1 Toss a fair coin until heads come up. The outcome X is the number of 
tosses . The value of X is 1 or 2 or 3 or ... , and the probability is \ that X = 1 (heads 
on the first toss). The probability of tails then heads is p 2 ~ i. The probability that 
X — n is p n = (£)^ — this is the chance of n — 1 tails followed by heads. The sum of all 
probabilities is necessarily 1 : 

Pi + Pz + P3+ — = £ + i + i+ — = 1* 


EXAMPLE 2 Suppose a student (not you) makes an average of 2 unforced errors per 
hour exam. The number of actual errors on the next exam is X = 0 or 1 or 2 or .. . . 
A reasonable model for the probability of n errors — when they are random and 
independent — is the Poisson model (pronounced Pwason): 

2 n 

p n = probability of n errors = — e " 2 . 

rt! 


The probabilities of no errors, one error, and two errors are p Q , p it and p 2 : 




.135 




^ .27. 


The probability of more than two errors is 1 — .135 — .27 — .27 = .325. 

This Poisson model can be derived theoretically or tested experimentally. The total 
probability is again 1, from the infinite series (Section 6.6) for e 2 : 


Po + Pi + P2 + 



( 3 ) 


EXAMPLE 3 Suppose on average 3 out of 100 passengers with reservations don’t 
show up for a flight. If the plane holds 98 passengers, what is the probability that 
someone will be bumped* 


If the passengers come independently to the airport, use the Poisson model with 2 
changed to 3. X is the number of no-shows, and X = n happens with probability p n : 




3° _3 - 3 

Po=y e 3 = e 


Fl= u e 


-3 _ 




-3 


There are 98 seats and 100 reservations. Someone is bumped if X = 0 or X = 1: 

chance of bumping = p 0 + p x =e~ 2 + 3c“ 3 ft 4/20. 

We will soon define the average or expected value or mean of X — this model has p — 3. 
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CONTINUOUS RANDOM VARIABLES 

If X is the lifetime of a VCR, all numbers X ^ 0 are possible. If X is a score on the 
SAT, then 200 ^ X ^ 800. If X is the fraction of computer owners in a poll of 600 
people, X is between 0 and L You may object that the SAT score is a whole number 
and the fraction of computer owners must be 0 or 1/600 or 2/600 or .... But it is 
completely impractical to work with 601 discrete possibilities. Instead we take X to 
be a continuous random variable , falling anywhere in the range X ^ 0 or [200, 800] or 

0 ^ X ^ 1, Of course the various values of X are not equally probable. 

EXAMPLE 4 The average lifetime of a VCR is 4 years. A reasonable model for break- 
down time is an exponential random variable. Its probability density is 

p(jc) = \e~ xl * for 0 < oo. 

The probability that the VCR will eventually break is 1: 

Jo” &-* 14 dx = [ - ' e'*' 4 ! = o - 1) = 1. (4) 

The probability of breakdown within 12 years (X from 0 to 12) is ,95: 

fa 1 ie~ xl * dx = [~e~ xl ^ 2 = -e~* + 1 » .95. (5) 

An exponential distribution has p(x) - ae~ ax . Its integral from 0 to x is F(x) = 

1 — e~ ax . Figure 8.11 is the graph for a = 1, It shows the area up to x = 1. 

To repeat: The probability that a ^X is the integral of p(x) from a to b . 



fig, 8,11 Probabilities add to £ p n = 1. Continuous density integrates to f p(x) dx = 1. 


EXAMPLE 5 We now define the most important density function. Suppose the 
average SAT score is 500, and the standard deviation {defined below — it measures the 
spread around the average) is 200. Then the normal distribution of grades has 

p(x) = 2 < 200 > 2 for - x < x < x, 

200^/2* 

This is the normal {or Gaussian) distribution with mean 500 and standard deviation 
200. The graph of p(x) is the famous bell-shaped curve in Figure 8,12, 

A new objection is possible. The actual scores are between 200 and 800, while the 
density p(jc) extends ail the way from — oo to so. I think the Educational Testing 
Service counts all scores over 800 as 800. The fraction of such scores is pretty small — 
in fact the normal distribution gives 

Prob{X^800}= f L^ e -(*-s°o)j/ 2 < 2 oo)> dx ~ .0013. (6) 

J 800 200 v / 2 tc 
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p-2ap-c p /z + a^ + 2cr p- 2tr p-a p /i + cr /* + 2u 


Fig. 8.12 The normal distribution (bell-shaped curve) and its cumulative density F(x). 

Regrettably, e~ xl has no elementary antiderivative. We need numerical integration. 
But there is nothing the matter with that! The integral is called the “error function” 
and special tables give its value to great accuracy. The integral of e"* 2/2 from — oo 
to oo is exactly Then division by ,j2n keeps J p(x) dx — 

Notice that the normal distribution involves two parameters . They are the mean 
value (in this case p = 500) and the standard deviation {in this case o = 200). Those 
numbers mu and sigma are often given the “normalized” values ft = 0 and <j = 1: 

p(x) = — — e -i x -rt 2 l2° 2 becomes p(x) = \ — 

njln V 2 n 

The bell-shaped graph of p is symmetric around the middle point x = p. The width 
of the graph is governed by the second parameter o — which stretches the x axis and 
shrinks the y axis (leaving total area equal to 1). The axes are labeled to show the 
standard case p = 0, a — 1 and also the graph for any other p and a. 

We now give a name to the integral of p(x). The limits will be - oo and x, so the 
integral F(x) measures the probability that a random sample is below x : 

Prob {A r < x} = J* ^ p(x) dx = cumulative density function F(x). (7) 

F(x) accumulates the probabilities given by p(x), so dFjdx — p(x). The total prob- 
ability is F(oo) = 1. This integral from - oo to.oo covers all outcomes. 

Figure 8. 12b shows the integral of the bell-shaped normal distribution. The middle 
point x — p has F = j. By symmetry there is a 50-50 chance of an outcome below the 
mean. The cumulative density F(x) is near .16 at p — a and near .84 at p-f cr. The 
chance of falling in between is .84— .16 = .68. Thus 68% of the outcomes are less 
than one deviation a away from the center p. 

Moving out to p-2a and p + 2cr, 95% of the area is in between. With 95% 
confidence X is less than two deviations from the mean. Only one sample in 20 is 
further out (less than one in 40 on each side). 

Note that a = 200 is not the precise value for the SAT! 

MEAN, VARIANCE, AND STANDARD DEVIATION 

In Example 1, X was the number of coin tosses until the appearance of heads. The 
probabilities were p x p 2 = s, Ps~ What is the average number of tossesl 

We now find the “mean” p of any distribution p(x) — not only the normal distribution, 
where symmetry guarantees that the built-in number p is the mean. 

To find p, multiply outcomes by probabilities and add\ 

fi - mean = «/>„= l(p.) + 2(p 2 ) + 3(p 3 ) + 


( 8 ) 
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The average number of tosses is l(^) + 2{£) + 3(^)+ This series adds up (in 
Section 10,1) to p = 2. Please do the experiment 10 times, 1 am almost certain that 
the average will be near 2. 


When the average is A = 2 quiz errors or A = 3 no-shows, the Poisson probabilities 
are p n = X n e~ k jn\ Check that the formula p — E np n does give A as the mean: 


A * A 2 A 3 

1 — + 2— + 3 — + 

1! 2! 3! 


1 , J 

! 

1 

II 

1 

1 

L 1+ ii + ^ + -J 


, - A _ 


= Xe x e l = X. 


For continuous probability, the sum /i = £np n changes to p = § xp(x)dx. We 
multiply outcome x by probability p{jc) and integrate. In the VCR model, integration 
by parts gives a mean breakdown time of p = 4 years: 

f x p(x) dx = J" x(ie~ xl *) dx = \_-xe-* 14 ~ 4e~ xl *~\^ = 4. (9) 


Together with the mean we introduce the variance , It is always written ff 2 , and in 
the normal distribution that measured the ‘"width” of the curve. When a 2 was 200 3 , 
SAT scores spread out pretty far. If the testing service changed to u 2 = l 2 , the scores 
would be a disaster, 95% of them would be within ± 2 of the mean. When a teacher 
announces an average grade of 72, the variance should also be announced — if it is 
big then those with 60 can relax. At least they have company. 


8E The mean p is the expected value of X. The variance <r 2 is the expected 
value of (X — mean) 2 — (X- p) 2 . Multiply outcome times probability and add: 

P = £ *Pn c 2 = 'L( n ~ P) 2 P» (discrete) 

P = J _ „ xp(x) dx a 2 = J® „ (x - (if p(x) dx (continuous) 


The standard deviation (written a) is the square root of rr 2 , 

EXAMPLE 6 (Yes-no poll, one person asked) The probabilities are p and 1 - p. 

A fraction p = ^ of the population thinks yes , the remaining fraction 1 - p = \ thinks 
no. Suppose we only ask one person. If X = 1 for yes and X = 0 for no, the expected 
value of X is p = p = The variance is a 2 ~ p(l — p) = 



The standard deviation is <r = ^2/9, When the fraction p is near one or near zero, 
the spread is smaller — and one person is more likely to give the right answer for 
everybody. The maximum of a 1 = p( 1 — p) is at p = \ f, where a = j. 

The table shows p and a 2 for important probability distributions. 


Model 

Mean 

Variance 

Application 

Pi = P> Po = 1 — P 

P 

p(i - p) 

yes-no 

Poisson p n = X a e~ x (nl 

A 

A 

random occurrence 

Exponential p(x) = ae~ ax 

i/o 

1 ja 2 

waiting time 

Normal p(x) = }— 

yjlna 

p 

a 2 

distribution 
around mean 
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THE LAW OF AVERAGES AND THE CENTRAL LIMIT THEOREM 

We come to the center of probability theory (without intending to give proofs)* The 
key idea is to repeat an experiment many times — poll many voters, or toss many 
dice, or play considerable poker. Each independent experiment produces an outcome 
X , and the average from N experiments is X. It is called “ X bar”: 

* Xi + Xz* - +X S 

X = — = average outcome. 

All we know about p(x) is its mean p and variance a 2 . It is amazing how much 
information that gives about the average X : 


8F Law of Averages : X is almost sure to approach p as N -* go* 

Central Limit Theorem: The probability density p*(*) for X approaches 
a normal distribution with the same mean p and with variance u 2 jN. 


No matter what the probabilities for X r the probabilities for X move toward the normal 
bell-shaped curve * The standard deviation is close to <t/^/n when the experiment is 
repeated N times. In the Law of Averages, “almost sure” means that the chance of 
X not approaching p is zero* It can happen, but it won’t. 

Remark 1 The Boston Globe doesn’t understand the Law of Averages* I quote from 
September 1988: “What would happen if a giant Red Sox slump arrived? What would 
happen if the fabled Law of Averages came into play, reversing all those can't miss 
decisions during the winning streak?” They think the Law of Averages evens every- 
thing up, favoring heads after a series of tails* See Problem 20* 

EXAMPLE 7 Yes-no poll of N — 2500 voters. Is a 53% -47% outcome conclusive*? 

The fraction p of “yes” voters in t he whole population is not known. That is the reason 
for the poll. The deviation a = *Jp{ 1 — p ) is also not known, but for one voter this is 
never more than \ (when p = ^)* Therefore trj^/N for 2500 voters is no larger than 
£/y2500, which is 1%* 

The result of the poll was X = 53%, With 95% confidence, this sample is within 
two standard deviations (here 2%) of its mean* Therefore with 95% confidence, the 
unknown mean p = p of the whole population is between 51% and 55%* This poll is 
conclusive. 

If the true mean had been p = 50%, the poll would have had only a *0013 chance 
of reaching 53%. The error margin on each side of a poll is amazingly simple; it is 
always 1/^/iV* 

Remark 2 The New York Times has better mathematicians than the Globe. Two 
days after Bush defeated Dukakis, their poll of N= 11,645 voters was printed with 
the following explanation* “In theory, in 19 cases out of 20 [there is 95 %] the results 
should differ by no more than one percentage point [there is 1 I^/n] from what 
would have been obtained by seeking out all voters in the United States*” 

EXAMPLE 8 Football players at Caltech (if any) have average weight p = 210 pounds 
and standard deviation a — 30 pounds. Are N — 16 players safe on an elevator with 
capacity 3600 pounds? 16 times 210 is 3360. 
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The average weight X is approximately a normal random variable with = 210 and 
cr= 30 (yfN = 30/4* There is only a 2% chance that X is above + 2d - 225 {see 
Figure 8.12b — weights below the mean are no problem on an elevator)* Since 16 
times 225 is 3600, a statistician would have 98% confidence that the elevator is safe* 
This is an example where 98% is not good enough — I wouldn’t get on* 

EXAMPLE? (The famous Weldon Dice) Weldon threw 12 dice 26,306 times and 
counted the 5’s and 6 ’s, They came up in 33*77% of the 315,672 separate rolls* Thus 
X = .3377 instead of the expected fraction p = ^ of 5’s and 6 ’s. Were the dice fair? 

The variance in eac h roll is a 2 = p(l — p)~ 2 / 9 . The standard deviation of X is 
6 = a/y/N = ^ 2 / 9^3 15672 ^ *00084* For fair dice, there is a 95 % chance that X 
will differ from 3 by less than 2d * (For Poisson probabilities that is false* Here X is 
normal) But .3377 differs from .3333 by more than 5d. The chance of falling 5 standard 
deviations away from the mean is only about 1 in 10 , 000 ,+ 

So the dice were unfair. The faces with 5 or 6 indentations were tighter than the 
others, and a little more likely to come up* Modern dice are made to compensate for 
that, but Weldon never tried again. 


6.4 EXERCISES 


Read-througb questions 

Discrete probability uses counting, o probability uses 
calculus* The function p{x) is the probability b . The 
chance that a random variable fails between a and b is c 
The total probability is p(x) dx = <1 . Tn the dis- 

Crete case I p*. = e * The mean (or expected value) 
is p = | f in the continuous case and p = I np n in 
the g 

The Poisson distribution with mean A has p n = h , The 
sum T p„ = 1 comes from the 1 series. The exponential 
distribution has p(x) = r*or le^ 2x or J The standard 
Gaussian (or k ) distribution has y j2Kp(x) — e~* 2/2 . Its 
graph is the well-known 1 curve. The chance that the 
variable falls below x is F(x) = rn . f is the n density 
function. The difference F(x + dx) — F(x) is about o * 
which is the chance that X is between x and x + dx. 

The variance , which measures the spread around p r is 
a 2 = j p in the continuous case and o 2 = L q in the 
discrete case. Its square root a is the 1 The normal 
distribution has p(x) = » . If X is the t of N samples 
from any population with mean p and variance a 2 , the Law 
of Averages says that X will approach u The Central 
Limit Theorem says that the distribution for X approaches 

v , Its mean is w and its variance is * 


In a yes-no poll when the voters are 50-50, the mean for 
one voter is p = Off) + !(!) = y . The variance is 
( 0 - p) 2 Po + (1 -p) z Pi = 1 * Fora poll with N = 100 , a is 

a , There is a 95% chance that X (the fraction saying yes) 
will be between b and c 

1 If Pi = 5 , p 2 — z, Pi = h what is the probability of an 
outcome X < 4? What are the probabilities of X = 4 and 
X >4? 

2 With the same p„ = (ir, what is the probability that X is 
odd? Why is p n = (i) n an impossible set of probabilities? 
What multiple is possible? 

3 Why is p(x) = e~ 2x not an acceptable probability density 
for x>0? Why is p(x) = 4e~ 2x — e~ x not acceptable? 

*4 If p n = (£)", show that the probability P that X is a prime 
number satisfies 6/ 16 =£ P ^ 7/16. 

5 If p(x) = e~* for x 5= 0, find the probability that X > 2 and 
the approximate probability that 1 < X < 1.01. 

6 If p(x) = Cjx 3 is a probability density for x ^ 1 , find the 
constant C and the probability that X ^ 2, 

7 If you choose x completely at random between 0 and it, 
what is the density p(x) and the cumulative density F(x)? 


tJoe DiMaggkfs 56-game hitting streak was much more improbable — I think it is statistically 
the most exceptional record in major sports. 
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In 8-13 find the mean mine fi — £ np„ or fi = J xp(x) dx. 

8 Po = 1/2, Pi = 1/4, pi = 1/4 

9 Pi = 1/7, pi = 1/7 p, = 1/7 

10 p,= l/n!e (p a = 1/e, p, = 1/e, p 2 = l/2e, ...) 

11 p(x) = 2ln(l +x 2 ), x>0 

12 p(x) = e x (integrate by parts) 

13 p(.x) = a£”“ (integrate by parts) 

14 Show by substitution that 

dx = s flo e-“ 2 du = JTnc. 

15 Find the cumulative probability F (the integral of p ) in 
Problems 11, 12, 13. In terms of F, what is the chance that a 
random sample lies between a and bl 

16 Can-Do Airlines books 100 passengers when their plane 
only holds 98. If the average number of no-shows is 2, what 
is the Poisson probability that someone will be bumped? 

17 The waiting time for a bus has probability density 
(l/10)e’ wl °, with p = 10 minutes. What is the probability of 
waiting longer than 10 minutes? 

18 You make a 3-minute telephone call. If the waiting time 
for the next incoming call has p(x) = e"*, what is the prob- 
ability that your phone will be busy? 

19 Supernovas are expected about every 100 years. What is 
the probability that you will be alive for the next one? Use a 
Poisson model with k — .01 and estimate your lifetime, (Super- 
novas actually occurred in 1054 (Crab Nebula), 1572, 1604, 
and 1987. But the future distribution doesn't depend on the 
date of the last one,) 

20 (a) A fair coin comes up heads 10 times in a row. Will 
heads or tails be more likely on the next toss? 

(b) The fraction of heads after N tosses is a. The expected 
fraction after 2 N tosses is . 

21 Show that the area between p and p + a under the bell- 
shaped curve is a fixed number (near 1/3), by substituting 

y= : 

*p+a 1 fl 1 

dx = ~ e -* 2fl dy , 

Jti Cyjln Jo yj2n 

What is the area between p — a and pi The area outside 
(p - a, p + ff)? 

22 For a yes-no poll of two voters, explain why 

Po = (1 - P) 2 > Pi=2p- 2 p 2 f p 2 = p\ 

Find p and c 2 * N voters give the “binomial distribution " 


23 Explain the last step in this reorganization of the formula 
for a 2 : 

a 2 = J (x - tifp{x) dx = }(* 2 - 2xfi + p 2 )p(x) dx 
= f x 2 p(x) dx-2p\ xp(x) dx + p 2 l p(jc) dx 
= J x 2 p(x) dx—fi 2 . 

24 Use | (jc — p) 2 p(x) dx and also J x 2 p(x) dx — p 2 to find a 2 
for the uniform distribution: p(x) = 1 for 0 < x < 1. 

25 Find(j 2 ifp 0 - 1/3, p x = 1/3, p 2 = 1/3. Usel(n — jd^and 
also I n 2 p n — p 2 . 

26 Use Problem 23 and integration by parts (equation 7,1.10) 
to find a 2 for the exponential distribution p(x) = 2e~ 2x for 
x^0, which has mean J, 

27 The waiting time to your next car accident has probability 
density p(x) = ie~ xf2 . What is pi What is the probability of 
no accident in the next four years? 

28 With p = i, 4, .,,, find the average number p of coin 
tosses by writing Pi + 2p 2 + 3py -\ — 35 (Pi+Pa+P3H — )+ 
(Pi + Ps + P^+ *") + (P3 + P4 + Ps + "*) + ■■■. 

29 In a poll of 900 Americans, 30 are in favor of war. What 
range can you give with 95% confidence for the percentage 
of peaceful Americans? 

30 Sketch rough graphs of p(x) for the fraction x of heads in 
4 tosses of a fair coin, and in 16 tosses. The mean value is J. 

31 A judge tosses a coin 2500 times. How many heads does 
it take to prove with 95% confidence that the coin is unfair? 

32 Long-life bulbs shine an average of 2000 hours with stan- 

dard deviation 150 hours. You can have 95% confidence that 
your bulb will fail between and hours. 

33 Grades have a normal distribution with mean 70 and stan- 
dard deviation 10. If 300 students take the test and passing is 
55, how many are expected to fail? (Estimate from 
Figure 8. 12b.) What passing grade will fail 1/10 of the, class? 

34 The average weight of luggage is p = 30 pounds with devi- 
ation ff = 8 pounds. What is the probability that the luggage 
for 64 passengers exceeds 2000 pounds? How does the answer 
change for 256 passengers and 8000 pounds? 

35 A thousand people try independently to guess a number 
between 1 and 1000. This is like a lottery, 

(a) What is the chance that the first person fails? 

(b) What is the chance P 0 that they all fail? 

(c) Explain why P 0 is approximately 1 je. 

36 (a) In Problem 35, what is the chance that the first person 
is right and all others are wrong? 

(b) Show that the probability P x of exactly one winner is 
also close to 1/e. 

(c) Guess the probability P B of n winners (fishy question). 
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8.5 Masses and Moments 



This chapter concludes with two sections related to engineering and physics. Each 
application starts with a finite number of masses or forces. Their sum is the total 
mass or total force. Then comes the “continuous case," in which the mass is spread 
out instead of lumped. Its distribution is given by a density function p (Greek rho}, 
and the sum changes to an integral. 

The first step (hardest step?) is to get the physical quantities straight. The second 
step is to move from sums to integrals (discrete to continuous, lumped to distributed). 
By now we hardly stop to think about it — although this is the key idea of integral 
calculus. The third step is to evaluate the integrals. For that we can use substitution 
or integration by parts or tables or a computer. 

Figure 8.13 shows the one-dimensional case: masses along the x axis . The total 
mass is the sum of the masses. The new idea is that of moments — when the mass or 
force is multiplied by a distance: 

moment of mass around the y axis = mx = (mass) times (distance to axis). 



M 

It i 

f i 

F 


) — * • — 

X 


X 



moment I 


in , x | + m t x + nt ; .v , = AY.v 


torque 


i 1 1 


Fig. 8,13 The center of mass is at x = (total Tnoment).(total mass) = average distance. 


The figure has masses E 3, 2, The total mass is 6. The “lever arms" or “moment 
arms" are the distances jc = 1, 3, 7. The masses have moments 1 and 9 and 14 (since 
mx is 2 times 7), The total moment is I + 9 + 14 = 24, Then the balance point is at 
x - MJM = 24/6 - 4, 

The total mass is the sum of the m’s. The total moment is the sum of m n times x n 
(negative on the other side of x = 0). If the masses are children on a seesaw, the 
balance point is the center of gravity x — also called the center of mass: 


DEFINITION 


^ m n x n _ total moment 
^ m„ total mass 


If all masses are moved to x, the total moment (6 times 4) is still 24. The moment 
equals the mass X m n times x* The masses act like a single mass at x. 

Also: If we move the axis to x, and leave the children where they are, the seesaw 
balances. The masses on the left of x = 4 will offset the mass on the right. Reason: 
The distances to the new axis are x„ - x. The moments add to zero by equation (1): 


moment around new axis = ^ m n (x„ - x) = ^ m„x a - ^ m n x = 0. 


Turn now r to the continuous case , when mass is spread out along the line. Each 
piece of length Ax has an average density p n - (mass of piece)/(length of piece) = 
Aw; Ax. As the pieces get shorter, this approaches dmidx — the density at the point. 
The limit of (small mass)-(small length) is the density p(x). 

Integrating that derivative p = dmdx, wc recover the total mass: X p n Ax becomes 


total mass M = ) p(xj dx. (2) 

When the mass is spread evenly, p is constant. Then M = pL = density times length. 
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The moment formula is similar. For each piece, the moment is mass p n &x 
multiplied by distance x — and we add* In the continuous limit, p(x) dx is multiplied 
by x and we integrate: 

total moment around y axis ~ M y = j xp(x) dx. (3) 

Moment is mass times distance. Dividing by the total mass M gives “average 
distance’*: 


center of mass x = 


moment _ M y _ J xp(x) dx 
mass M f p(x) dx 


(4) 


Remark If you studied Section 8*4 on probability, you will notice how the formulas 
match up. The mass j p(x)dx is like the total probability j p(x)dx. The moment 
j xp(x ) dx is like the mean j xp(x) dx. The moment of inertia j (x - x) 2 p(x) dx is the 
variance. Mathematics keeps hammering away at the same basic ideas! The only 
difference is that the total probability is always 1. The mean really corresponds to 
the center of mass x, but in probability we didn’t notice the division by J p(x)dx— 1 . 


EXAMPLE 1 With constant density p from 0 to L, the mass is M - pL. The moment 
is 

M * = So X P dx = ?P x2 ~§) = : iP LZ - 

The center of mass is x— M y /M = Lj2, It is halfway along* 

EXAMPLE 2 With density e~ x the mass is 1, the moment is 1, and x is 1: 

Jo e~ x dx = 1 and xe~ x dx = [ — xe~ x — e~ x = 1* 


MAS5E5 AND MOMENTS IN TWO DIMENSIONS 


Instead of placing masses along the x axis, suppose m 1 is at the point (x 1 , yj in the 
plane. Similarly m n is at (x fl , y„ )* Now there are two moments to consider. Around the 
y axis M y = E m B x n and around the x axis M x — £ Please notice that the x’s go 
into the moment M y — because the x coordinate gives the distance from the y axis! 

Around the x axis, the distance is y and the moment is The center of mass is 
the point ( x , y) at which everything balances: 


- _ M y _ £ m n x n 
M £ m » 


and 


- _ _ Z m »yn 


y M 


m„ 


(5) 


In the continuous case these sums become two-dimensional integrals* The total 
mass is JJ p(x t y) dx dy , when the density is p = mass per unit area. These “double 
integrals” are for the future (Section 14. 1), Here we consider the most important case: 
p = constant * Think of a thin plate, made of material with constant density (say 
p = 1). To compute its mass and moments, the plate is cut into strips (Figure 8,14): 


mass M = area of plate 

moment M y = j (distance x) (length of vertical strip) dx 
moment M x = j (height y) (length of horizontal strip) dy * 


( 6 ) 

(7) 

( 8 ) 
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The mass equals the area because p = 1. For moments, all points in a vertical strip 
are the same distance from the y axis. That distance is x . The moment is x times area, 
or x times length times dx — and the integral accounts for all strips. 

Similarly the x - moment of a horizontal strip is y times strip length times dy. 


EXAMPLE 3 A plate has sides x = 0 and y — 0 and y = 4 - 2x. Find Af, Af p , Af x . 

mass Af = area = y dx = (4 — 2x) dx = [ Ax — x 2 ]q = 4. 

The vertical strips go up to y = 4 — 2x, and the horizontal strips go out to x = y(4 — y): 


moment AT, = 


moment Af* = 


r 2 
0 


x(4 - 2x) dx = 


lx 2 — - JC 3 

3 


y-(4- y ) dy = 


8 

' 3 

16 

T‘ 


The “center of mass” has x = M y /M = 2/3 and y — Af x /Af = 4/3. This is the centroid 
of the triangle (and also the “center of gravity”). With p = 1 these terms all refer to 
the same balance point ( x f y ). The plate will not tip over, if it rests on that point. 


EXAMPLE 4 Find Af^ and Af x for the half-circle below x 2 + y 2 = r 2 . 

M y = 0 because the region is symmetric — Figure 8.14 balances on the y axis. In the 
x-moment we integrate y times the length of a horizontal strip (notice the factor 2): 

M, = To y ■ 2 dy = - !(r 2 - y 2 ) w i = |r 3 . 

Divide by the mass (the area jitr 2 ) to find the height of the centroid: y — Af x /Af = 
4rf3n. This is less than \r because the bottom of the semicircle is wider than the top. 


MOMENT OF INERTIA 


The moment of inertia comes from multiplying each mass by the square of its 
distance from the axis. Around the y axis, the distance is x . Around the origin, it is r: 

ly = £ and /, = I y*m„ and / 0 = I r„ 2 m„. 

Notice that J x + I y = I 0 because x 2 + y 2 = r 2 . In the continuous case we integrate. 

The moment of inertia around the y axis is I y — JJ x 2 p(x, y) dx dy . With a constant 
density p- 1, we again keep together the points on a strip. On a vertical strip they 
share the same x. On a horizontal strip they share y: 

I y = J (x 2 ) (vertical strip length) dx and / x = j (y 1 ) (horizontal strip length) dy. 
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In engineering and physics, it is rotation that leads to the moment of inertia. Look 
at the energy of a mass m going around a circle of radius r. It has 1 Q = mr 2 . 

kinetic energy = ?mv 2 = \m(r<o} 7 = ^/ 0 oF. (9) 

The angular velocity is w (radians per second). The speed is v = rco (meters per second). 

An ice skater reduces / 0 by putting her arms up instead of out. She stays dose to 
the axis of rotation { r is small). Since her rotational energy 7 / 0 w 2 does not change, 
ay increases as / 0 decreases. Then she spins faster. 

Another example: It takes force to turn a revolving door. More correctly, it takes 
torque. The force is multiplied by distance from the turning axis: T = Fx, so a push 
further out is more effective. 

To see the physics, replace Newton’s law F = ma = m dvjdt by its rotational form: 
T = I dayjdt. Where F makes the mass move, the torque T makes it turn. Where m 
measures unwillingness to change speed, l measures unwillingness to change rotation. 

EXAMPLE 5 Find the moment of inertia of a rod about (a) its end and (b) its center. 

The distance x from the end of the rod goes from 0 to L. The distance from the center 
goes from — Ljl to L/2. Around the center, turning is easier because / is smaller: 

'end = jo X 2 dx = \L? / cen(er = \ L ! 2 Lj2 X 2 dx = i^L 3 . (10) 


(J) ill - .v -dm 


CD' =p-' : ^ 




Fig. 8,15 Moment of inertia for rod and propeller. Rolling balls beat cylinders. 


MOMENT OF INERTIA EXPERIMENT 


Experiment : Roll a solid cylinder {a coin), a hollow cylinder (a ring), a solid ball (a 
marble), and a hollow ball (not a pingpong ball) down a slope. Galileo dropped things 
from the Leaning Tower— this experiment requires a Leaning Table. Objects that fall 
together from the tower don’t roll together down the table. 


Question 1 
Question 2 
Question 3 
Question 4 


What is the order of finish? Record your prediction Jirst\ 

Does size make a difference if shape and density are the same? 
Does density make a difference if size and shape are the same? 
Find formulas for the velocity v and the finish time T. 


To compute r, the key is that potential energy plus kinetic energy is practically 
constant. Energy loss from rolling friction is very small. If the mass is m and the 
vertical drop is / 1 , the energy at the top (all potential) is mgh. The energy at the bottom 
{all kinetic) has two parts: \mv 2 from movement along the plane plus jlto 2 from 
turning. Important fact: 1 )=-a>r for a rolling cylinder or ball of radius r. 
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Equate energies and set a> = vlr: 


r 1 ? 1 r 7 1 J ^ I \ 

mgh = _ mr + - /or = - mv z \ 1 ■+■ — 7 L 
2 2 2 \ mr ) 

The ratio limr 1 is critical. Call it J and solve (11) for v 2 : 

2gh 

v 2 = -j (smaller J means larger velocity). 


01 ) 


( 12 ) 


The order of J' s, for different shapes and sizes, should decide the race. Apparently 
the density doesn't matter, because it is a factor in both / and m — so it cancels in 
J = ljmr 2 < A hollow cylinder has J = 1 , which is the largest possible all its mass is 
at the full distance r from the axis. So the hollow cylinder should theoretically come 
in last. This experiment was developed by Daniel Drucker, 

Problems 35 37 find the other three ^'s. Problem 40 finds the time Tby integration. 
Your experiment will show how close this comes to the measured time. 


8.5 EXERCISES 


Read-through questions 

If masses m H arc at distances x fl , the total mass is Af — o , 
The total moment around x = 0 is Af y = b . The center of 
mass is at x = e . | n the continuous case, the mass distri- 
bution is given by the d p(x). The total mass is Af = 
e and the center or mass is at x = * With p = x\ 

the integrals from 0 to L give M = 9 and J xp{x) dx = 

h and x = I The total moment is the same if the 
whole mass Af is placed at I 

In a plane, with masses at the points y fl ), the moment 
around the y axis is k The center of mass has x = 1 

and f = m , For a plate with density p — l, the mass Af 
equals the n [f the plate is divided into vertical strips of 
height y(x), then Af = j y(x) dx and M y = \ o dx. For a 
square plate 0<x, y ^ L. the mass is Af = P and the 
moment around the y axis is M y = q The center of mass 
is at (x, y) = r This point is the * where the plate 
balances. 

A mass m at a distance x from the axis has moment of 
inertia I = t A rod with p = t from x = a to x = h has 
S y = u For a plate with p = 1 and strips of height y(x), 
this becomes /,. — ( v The torque T is w times 

x 


Compute the mass Af along the x axis, the moment M y around 
x = 0, and the center of mass x = Af y / Af . 

1 = 2 at x, = 1, m 2 = 4 at x 2 = 2 

2 m = 3 at x = 0, 1, 2, 6 

3 p = I for — 1 < a ^ 3 

4 p = x 2 for 0 ^ x 


5 p — 1 for 0 ^ x < 1 , p = 2 for l ^ \ ^ 2 

6 p = sin x for 0 ^ x ^ 7T 

Find the mass Af, the moments Af y and M A1 and the center of 
mass (x, y). 

7 Unit masses at (x, y) = (1,0), (0, 1), and (FI) 

8 m, - I at ( 1, 0), m 2 = 4 at (0, 1) 

9 p = 7 in the square 0 ^ x ^ l, 0 ^ y ^ L 

10 p = 3 in the triangle with vertices (0, 0), (a, 0), and (0, b). 

Find the area Af and the centroid (x, y) inside curves 11-16, 

U y — y/ 1 — x 2 , y — 0, x — 0 (quarter-circle) 

12 y = x. v = 2 ■ x, y = 0 (triangle) 

13 y = t'“ 2 -\ y — 0, x — 0 (infinite dagger) 

14 y = x 2 h y = x (lens) 

15 x 2 + y 2 = 1, x 2 + y 2 = 4 (ring) 

16 x 2 + y 2 = 1, x 2 - y 2 = 4, y = 0 (half-ring). 

Verify these engineering formulas for l y with p — l: 

17 Rectangle bounded by x = 0, x = a. y = 0, y = b: 
l y = a*h;X 

18 Square bounded by x = — jti, x = 4a, y — - la, y = 4u: 

/, = fl 4 /12. 

19 Triangle bounded by x = 0, y = 0, x + y = a: I y = a*j 1 2. 

20 Disk of radius a centered at x = y = 0: J v = tujT;4. 
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21 The moment of inertia around the point x = t of a rod 
with density p(x) is / = j (x — r) 2 p(x) dx. Expand (x - f) 2 and 
/ into three terms* Show that dljdt — 0 when f = x. The 
moment of inertia is smallest around the center of mass* 

22 A region has x = 0 if Af v = J x(height of strip) </x = 0. 
The moment of inertia about any other axis x = c is 
/ = J (x — c) 2 (height of strip) dx. Show that / = / y -h 
(area)(c 2 ). This is the parallel axis theorem: 1 is smallest 
around the balancing axis c - 0. 

23 (With thanks to Trivial Pursuit) In what state is the center 
of gravity of the United States — the "geographical center 11 or 
centroid? 

24 Pappus (an ancient Greek) noticed that the volume is 

V = j 27ry(strip width) dy = 2nM x = 2 nyM 

when a region of area M is revolved around the x axis. In the 
first step the solid was cut into * 




25 Use this theorem of Pappus to find the volume of a torus. 
Revolve a disk of radius a whose center is at height y = h > a. 

16 Rotate the triangle of Example 3 around the x axis and 
find the volume of the resulting cone — first from V = 2nyM, 
second from $nr 2 h. 

27 Fi nd M x and M y for a thin wire along the semicircle 

y = — x 2 . Take p = 1 so M = length = tl 

28 A second theorem of Pappus gives A = 2nyL as the surface 
area when a wire of length L is rotated around the x axis. 
Verify his formula for a horizontal wire along y = 3 (x = 0 
to x = L) and a vertical wire (y = 1 to y = L 4- 1)* 


length L 



polar - 


>/n 



(r 3 


29 The surface area of a sphere is A = 4 n when r = 1, So A — 

In yL leads to y = for the semicircular wire in 

Problem 27. 

30 Rotating y = mx around the x axis between x=^0 and 

x = 1 produces the surface area A = 

31 Put a mass m at the point (x, 0). Around the origin the 

torque from gravity is the force mg times the distance x. This 
equals g times the mx. 

32 If ten equal forces F are alternately down and up at 

x=l,2 10, what is their torque? 

33 The solar system has nine masses m n at distances r„ with 
angular velocities (o n . What is the moment of inertia around 
the sun? What is the rotational energy? What is the torque 
provided by the sun? 

34 The disk x 2 + y 2 < a 2 has / 0 = J* r 2 lnr dr = Why is 

this different f rom l Y in Problem 20? Find the radius of 
gyration r = (The rotational energy \t Q to z equals 

\Mr 2 (n 2 — when the whole mass is turning at radius r.) 

Questions 35-42 come from the moment of inertia experiment* 

35 A solid cylinder of radius r is assembled from hollow cylin- 
ders of length /, radius x t and volume (27rx)(/)(dx). The solid 
cylinder has 

mass M =f 0 Inxlp dx and / = x 2 2?ix/p dx. 

With p — 1 find M and / and J = //Mr 2 , 

36 Problem 14.4.40 finds J = 2/5 for a solid ball. It is less 

than J for a solid cylinder because the mass of the ball is 
more concentrated near 

37 Problem 14.4.39 finds J = * J* sin 3 #=. for a 

hollow ball. The four rolling objects finish in the order 


38 By varying the density of the ball how could you make it 
roll faster than any of these shapes? 

39 Answer Question 2 about the experiment. 

40 For a vertical drop of y, equation (12) gives the velocity 
along the plane: v 2 = 2gyj(l + J). Thus v — cy V2 for c = 
The vertical velocity is dyjdt = v sin x: 

dyjdt = cy 1/2 sin a and [ y~ 1/2 dy = f e sin a dt * 

Integrate to find y(f). Show that the bottom is reached 
(y = h) at time T = 2^/h/c sin oc. 

41 What is the theoretical ratio of the four finishing times? 

42 True or false: 

(a) Basketballs roll downhill faster than baseballs* 

(b) The center of mass is always at the centroid* 

(c) By putting your arms up you reduce l x and I y . 

(d) The center of mass of a high jumper goes over the bar 
{on successful jumps). 
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3 Applications of the integral 


8.6 Force, Work, and Energy 

Chapter 1 introduced derivatives dfjdt and dfjdx , The independent variable could be 
t or x . For velocity it was natural to use the letter r. This section is about two 
important physical quantities — force and work — for which x is the right choice. 

The basic formula is W= Fx. Work equals force times distance moved (distance in 
the direction of F). With a force of 100 pounds on a car that moves 20 feet, the work 
is 2000 foot-pounds. If the car is rolling forward and you are pushing backward, the 
work is —2000 foot-pounds. If your force is only 80 pounds and the car doesn’t 
move, the work is zero. In these examples the force is constant. 

W^Fx is completely parallel to /= vt. When v is constant, we only need multi- 
plication. It is a changing velocity that requires calculus. The integral J u(r) dt adds 
up small multiplications over short times. For a changing force, we add up small 
pieces of work F dx over short distances: 

W = Fx (constant force) W = J F(x) dx (changing force). 

In the first case we lift a suitcase weighing F = 30 pounds up x = 20 feet of stairs. 
The work is W— 600 foot-pounds. The suitcase doesn’t get heavier as we go up — it 
only seems that way. Actually it gets lighter (we study gravity below). 

In the second case we stretch a spring, which needs more force as x increases. 
Hooke's law says that F(x) = kx. The force is proportional to the stretching distance x. 
Starting from x = 0, the work increases with the square of x: 

F= kx and W= J* kx dx = \kx 2 . (1) 

In metric units the force is measured in Newtons and the distance in meters. The unit 
of work is a Newton-meter (a joule). The 600 foot-pounds for an American suitcase 
would have been about 800 joules in France, 

EXAMPLE 1 Suppose a force of F = 20 pounds stretches a spring 1 foot. 

(a) Find k. The elastic constant is k = Fjx — 20 pounds per foot, 

(b) Find W. The work is ^kx 1 = 5 *20* l 2 = 10 foot-pounds, 

(c) Find x when F = — 10 pounds. This is compression not stretching: x = - \ foot. 

Compressing the same spring through the same distance requires the same work. For 
compression x and F are negative. But the work W=jkx 2 is still positive. Please 
note that W does not equal kx times x! That is the whole point of variable force 
(change Fx to J F(x) dx). 

May I add another important quantity from physics? It comes from looking at the 
situation from the viewpoint of the spring. In its natural position, the spring rests 
comfortably. It feels no strain and has no energy. Tension or compression gives it 
potential energy . More stretching or more compression means more energy. The 
change in energy equals the work. The potential energy of the suitcase increases by 
600 foot-pounds, when it is lifted 20 feet. 

Write F(x) for the potential energy. Here x is the height of the suitcase or the 
extension of the spring. In moving from x = a to x = 6, work = increase in potential : 

W-fcF(x)dx=V(b)~V(a). (2) 

This is absolutely beautiful. The work W is the definite integral. The potential V is 
the indefinite integral. If we carry the suitcase up the stairs and back down, our total 
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work is zero . Wc may fee] tired, but the trip down should have given back our energy. 
(It was in the suitcase.) Starting with a spring that is compressed one foot, and ending 
with the spring extended one foot, again we have done no work, V= jkx 2 is the same 
for x = — 1 and x = 1 . But an extension from x = 1 to x — 3 requires work: 

W= change in V = ^k(3) 2 - jk(\) 2 . 

Indefinite integrals like V come with a property that we know well. They include 
an arbitrary constant C. The correct potential is not simply \kx 2 , it is jkx 2 + C. To 
compute a change in potential, we don't need C. The constant cancels. But to deter- 
mine V itself, we have to choose C. By fixing V = 0 at one point, the potential 
is determined at all other points. A common choice is V=0 at x = 0. Sometimes 
K = 0 at x = x (for gravity). Electric fields can be “grounded” at any point. 

There is another connection between the potential V and the force F, According 
to (2), V is the indefinite integral of F. Therefore F(x) is the derivative of E(x). The 
fundamental theorem of calculus is also fundamental to physics: 

force exerted on spring: F= dVjdx (3a) 

force exerted by spring: F = — dVjdx (3b) 

Those lines say the same thing. One is our force pulling on the spring, the other is 
the “restoring force” pulling back. (3a) and (3b) are a warning that the sign of F 
depends on the point of view. Electrical engineers and physicists use the minus sign. 
In mechanics the plus sign is more common. It is one of the ironies of fate that 
F = V\ while distance and velocity have those letters reversed: v =/'. Note the change 
to capital letters and the change to x. 



fig. 8.16 Stretched spring; suitcase 20 feet up; moon of mass m; oscillating spring. 


EXAMPLE 2 Newton's law of gravitation {inverse square law): 

force to overcome gravity “ GMmjx 2 force exerted by gravity = — GMmjx 1 

An engine pushes a rocket forward. Gravity pulls it back. The gravitational constant 
is G and the Earth's mass is Ai. The mass of the rocket or satellite or suitcase is m, 
and the potential is the indefinite integral: 

V{x) = j F(x) dx= - GMmjx + C. (4) 

Usually C = 0, which makes the potential zero at x = oc. 

Remark When carrying the suitcase upstairs, x changed by 20 feel. The weight was 
regarded as constant — which it nearly is. But an exact calculation of work uses the 
integral of F(x), not just the multiplication 30 times 20. The serious difference comes 
when the suitcase is carried to x = oc. With constant force that requires infinite work. 
With the correct (decreasing) force, the work equals V at infinity (which is zero) minus 
V at the pickup point x 0 . The change in K is W = GMmjx 0 . 
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8 Applications of the Integral 


KINETIC ENERGY 

This optional paragraph carries the physics one step further. Suppose you release the 
spring or drop the suitcase. The external force changes to F = 0. But the internal 
force still acts on the spring, and gravity still acts on the suitcase. They both start 
moving. The potential energy of the suitcase is converted to kinetic energy , until it 
hits the bottom of the stairs. 

Time enters the problem, either through Newton’s law or Einstein's: 

(A ,r ek>f0fl) F = mu = m ^ (Einstein) F = - y (mvf (5) 

dt (it 

Here we stay with Newton, and pretend the mass is cons tant. Exe rcise 21 follows 
Einstein; the mass increases with velocity. There m = m 0 /yjl v 2 /c 2 goes to infinity 
as v approaches c t the speed of light. That correction comes from the theory of 
relativity, and is not needed for suitcases. 

What happens as the suitcase falls? From x = a at the top of the stairs to x = b at 
the bottom, potential energy is lost. But kinetic energy jtnv 2 is gained, as we see from 
integrating Newton’s law: 

dv dv dx dv 

force t -m — = m- — — = mv-— 
dt dx dx dx 


work 


F dx = 


dv 1 , 1 . 

mv — dx= - mv (b) — - mv (a), 
dx 2 2 


This same force F is given by — dVjdx* So the work is also the change in V: 


work 



dx = - V(h) + V(a). 


Since (6) = (7), the total energy \mv 2 + V (kinetic plus potential) is constant: 

K(fr) = -imr 2 (a)+ V{q). 

This is the law of conservation of energy . The total energy is conserved. 


( 6 ) 

V) 

( 8 ) 


EXAMPLE 3 Attach a mass m to the end of a stretched spring and let go. The spring s 

energy V = jkx 2 is gradually converted to kinetic energy of the mass. At x = 0 the 

change to kinetic energy is complete: the original jkx 2 has become jmv 2 . Beyond 
x = 0 the potential energy increases, the force reverses sign and pulls back, and kinetic 
energy is lost. Eventually all energy is potential — when the mass reaches the other 
extreme. It is simple harmonic motion, exactly as in Chapter 1 (where the mass was 
the shadow of a circling ball). The equation of motion is the statement that the rate 
of change of energy is zero (and we cancel i; = dx/dt): 

mv 1 -f - kx 2 ) = mr — kx — - = 0 or m - + kx — 0, (9) 

2 J dt dt dr 

That is F = tna in disguise. For a spring, the solution x = cos yf 'k/m t will be found 
in this book. For more complicated structures, engineers spend a billion dollars a 
year computing the solution. 


-i 

dt \2 
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PRESSURE AND HYDROSTATIC FORCE 

Our forces have been concentrated at a single points. That is not the case for pressure. 
A fluid exerts a force all over the base and sides of its container. Suppose a water 
tank or swimming pool has constant depth h (in meters or feet). The water has weight- 
density w fs 9800 N/m 3 ^ 62 lb/ft 3 . On the base, the pressure is w times h. The force 
is wh times the base area A: 

F = (pounds or Newtons) p = FI A = wh (lb/ft 2 or N/m 2 ), (10) 

Thus pressure is force per unit area . Here p and F are computed by multiplication, 
because the depth h is constant. Pressure is proportional to depth (as divers know). 
Down the side wall, h varies and we need calculus. 

The pressure on the side is still wh — the same in all directions . We divide the side 
into horizontal strips of thickness Ah. Geometry gives the length 1(h) at depth h 
(Figure 8.17). The area of a strip is 1(h) Ah. The pressure wh is nearly constant on the 
strip — the depth only changes by A/i, The force on the strip is A F — whlAh, Adding 
those forces, and narrowing the strips so that Ah -* 0, the total force approaches an 
integral: 

total force F = j whl(h) dh (It) 



Fig. 0.17 Water tank and dam: length of side strip = l t area of layer = A. 


EXAMPLE 4 Find the total force on the trapezoidal dam in Figure 8.17. 

The side length is i = 60 when h = 0, The depth h increases from 0 to 20. The main 
problem is to find / at an in-between depth h. With straight sides the relation is 
linear: / = 60 + ch. We choose c to give / = 50 when h — 20. Then 50 — 60 4- c(20) 
yields c — — 7, 

The total force is the integral of whl. So substitute /= 60 - jh: 

F = ff vWi(60 - H) dh = [30w/i 2 - £wA 3 ]f = 12000* - £(8000*). 

With distance in feet and w — 62 lb/ft 3 , F is in pounds. With distance in meters and 
w = 9800 N/m 3 , the force is in Newtons. 

Note that (weight-density w) = (mass-density p) times (g) - (1000) (9,8). These SI 
units were chosen to make the density of water at 0°C exactly p= 1000 kg/m 3 . 

EXAMPLE 5 Find the work to pump water out of a tank . The area at depth h is A\h). 

Imagine lifting out one layer of water at a time . The layer weighs wA(h)Ah. The 
work to lift it to the top is its weight times the distance h . or whA(h)Ah . The work 
to empty the whole tank is the integral: 

W= J wM(fe) dh. 


( 12 ) 
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Suppose the tank is the bottom half of a sphere of radius R . The cross-sectional area 
at depth h is A = n{R 2 - h 2 ). Then the work is the integral (12) from 0 to R. It equals 
W— ttwR 4 /4, 

Units: w = for ce/(di stance) 3 times R 4 = (distance) 4 gives work W = (force)(distance). 


6.6 EXERCISES 


Read-through questions 

Work equals o times b , For a spring the force is 
F = c , proportional to the extension x (this is d law). 
With this variable force, the work in stretching from 0 to x is 
W = j 9 = f > This equals the increase in the g 

energy V. Thus IF is a h integral and V is the corre- 
sponding l integral, which includes an arbitrary i . 
The derivative dVjdx equals k . The force of gravity is 
F = i and the potential is V= m . 

In falling, V is converted to n energy K = o . The 

total energy K -f V is p (this is the law of q when 

there is no external force). 

Pressure is force per unit r Water of density w in 
a pool of depth h and area A exerts a downward force 
F = t on the base. The pressure is p = t On the 

sides the g is still vvh at depth h , so the total force is 
j whl dh, where l is v . In a cubic pool of side s, the force 
on the base is F = w , the length around the sides is 
/ = x , and the total force on the four sides is F = v 

The work to pump the water out of the pool is 

W — J whA dh = i < 

1 (a) Find the work W when a constant force F = 12 pounds 
moves an object from x = .9 feet to x = 1,1 feet. 

(b) Compute W by integration when the force F = 12/x 1 
varies with x. 

2 A 12-inch spring is stretched to 15 inches by a force of 75 
pounds. 

(a) What is the spring constant k in pounds per foot ? 

(b) Find the work done in stretching the spring. 

(c) Find the work to stretch it 3 more inches. 

3 A shock-absorber is compressed 1 inch by a weight of l 
ton. Find its spring constant k in pounds per foot. What 
potential energy is stored in the shock-absorber? 

4 A force F = 20x — x 3 stretches a nonlinear spring by x. 

(a) What work is required to stretch it from x — 0 to 
x = 2? 

(b) What is its potential energy V at x = 2, if K(0) = 5? 

(c) What is k = dF/dx for a small additional stretch at 
x = 2? 


5 (a) A 120-lb person makes a scale go down x inches. How 
much work is done? 

(b) If the same person goes x inches down the stairs, how 
much potential energy is lost? 

6 A rocket burns its 100 kg of fuel at a steady rate to reach 
a height of 25 km, 

(a) Find the weight of fuel left at height h. 

(b) How much work is done lifting fuel? 

7 Integrate to find the work in winding up a hanging cable 
of length 100 feet and weight density 5 Ib/ft. How much addi- 
tional work is caused by a 200-pound weight hanging at the 
end of the cable? 

8 The great pyramid (height 500' — you can see it from 
Cairo) has a square base 800' by 800 r . Find the area A at 
height h . If the rock weighs w = 100 1b/ft 3 , approximately how 
much work did it take to lift all the rock? 

9 The force of gravity on a mass m is F = -GMm/x 2 . With 
G — 6* 10 17 and Earth mass M = 6 • 10 24 and rocket mass 
m = 1000, compute the work to lift the rocket from x = 6400 
to x = 6500. (The units are kgs and kms and Newtons, giving 
work in Newton-kms,) 

10 The approximate work to lift a 30-pound suitcase 20 feet 
is 600 foot-pounds. The exact work is the change in the poten- 
tial K=— GmM/x. Show r that AK is 600 times a correction 
factor R Z I(R 2 - 10 2 ), when x changes from R — 10 to R + 10. 
(This factor is practically 1* when R = radius of the Earth,) 

11 Find the work to lift the rocket in Problem 9 from 
x = 6400 out to x = oo. If this work equals the original 
kinetic energy Jmu 2 , what was the original v ( the escape 
velocity }? 

12 The kinetic energy ?mv 2 of a rocket is converted into 
potential energy —GMm/x. Starting from the Earth's radius 
x = R y wha t x docs the rocket reach? If it reaches x = oo show 
that v = ^j2GMjR. This escape velocity is 25,000 miles per 
hour, 

13 It takes 20 foot-pounds of work to stretch a spring 2 feet. 
How much work to stretch it one more foot? 

14 A barrel full of beer is 4 feet high with a 1 foot radius and 
an opening at the bottom. How much potential energy is lost 
by the beer as it comes out of the barrel? 
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15 A rectangular dam is 40 feet high and 60 feet wide. Com- 
pute the total side force F on the dam when (a) the water is 
at the top (b) the water level is halfway up, 

16 A triangular dam has an 80-meter base at a depth of 30 
meters. If water covers the triangle, find 

(a) the pressure at depth h 

(b) the length l of the dam at depth h 

(c) the total force on the dam, 

17 A cylinder of depth H and cross-sectional area A stands 
full of water (density w), (a) Compute the work W = j wAh dh 
to lift all the water to the top. (b) Check the units of W. 
(c) What is the work W if the cylinder is only half full? 

18 In Problem 17, compute W in both cases if H = 20 feet, 
w = 62 Jb/fl 3 , and the base is a circle of radius r = 5 feet, 

19 How much work is required to pump out a swimming 
pool, if the area of the base is 800 square feet, the water is 4 
feet deep, and the top is one foot above the water level? 


20 For a cone-shaped tank the cross-sectional area increases 
with depth: A = nr 2 h z IH 2 . Show that the work to empty it is 
half the work for a cylinder with the same height and base. 
What is the ratio of volumes of water? 

21 In relativity the mass is m = — v 2 jc 2 . Find the cor- 

rection factor in Newton's equation F = m 0 a to give Einstein’s 
equation F = d(mv)jdt = (d{mv)jdv){dvjdt) = m 0 n, 

22 Estimate the depth of the Titanic, the pressure at that 
depth, and the force on a cabin door. Why doesn't every door 
collapse at the bottom of the Atlantic Ocean? 

23 A swimming pool is 4 meters wide, 10 meters long, and 2 
meters deep. Find the weight of the water and the total force 
on the bottom. 

24 If the pool in Problem 23 has a shallow end only one 
meter deep, what fraction of the water is saved? Draw a cross- 
section (a trapezoid) and show the direction of force on the 
sides and the sloping bottom. 

25 In what ways is work like a definite integral and energy 

like an indefinite integral? Their derivative is the - 
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CHAPTER 9 


Polar Coordinates and Complex 

Numbers 



9.1 Polar Coordinates 


Up to now, points have been located by their x and y coordinates. But if you were 
a flight controller, and a plane appeared on the screen, you would not give its 
position that way. Instead of x and y, you would read off the direction of the plane 
and its distance. The direction is given by an angle 6 . The distance is given by a 
positive number r. Those are the polar coordinates of the point, where x and y are 
the rectangular coordinates. 

The angle 0 is measured from the horizontal. Suppose the distance is 2 and the 
direction is 30° or n/6 (degrees preferred by flight controllers, radians by mathemati- 
cians). A pilot looking along the x axis would give the plane’s direction as “11 
o’clock.” This totally destroys our system of units, by measuring direction in hours. 
But the angle and the distance locate the plane. 

How far to a landing strip at r = 1 and 9 = — n/2? For that question polar coordi- 
nates are not good. They are perfect for distance from the origin (which equals r), 
but for most other distances I would switch to x and y. It is extremely simple to 
determine x and y from r and 0, and we will do it constantly. The most used formulas 
in this chapter come from Figure 9.1 — where the right triangle has angle 0 and 
hypotenuse r. The sides of that triangle are x and y: 

x — r cos 9 and y — r sin 9. (1) 

The point at r = 2, 9 = n/6 has x = 2 cos(n/6) and y = 2 sin(n/6). The cosine of 
it/ 6 is yi/2 and the sine is So x = and y= 1. Polar coordinates convert 
easily to xy coordinates — now we go the other way. 

Always x 2 + y 2 = r 2 . In this example (y/l) 2 + (l) 2 = (2) 2 . Pythagoras produces 
r from x and y. The direction 0 is also available, but the formula is not so beautiful: 

r - Jx 1 + y 2 and tan 9=- and (almost) 6 = tan" 1 -. (2) 

v x x 

Our point has y/x = 1/^/3. One angle with this tangent is 6 = tan -1 (l/^) — n/6. 
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Rg. 9.1 Polar coordinates r, 0 and rectangular coordinates x = r cos 0, y = r sin 0, 

EXAMPLE 1 Point B in Figure 9.1c is at a negative angle 8 = — rc/4. The x coordinate 
rcos(-7t/4) is the same as rcos7t/4 (the cosine is even). But the y coordinate 
r sin( — jt/ 4) is negative. Computing r and 0 from x = 1 and y = t, the distance is 
r - yr+1 and tan 0 is - 1/1. 

Warning To any angle 0 we can add or subtract 2 n — which goes a full 360° circle 
and keeps the same direction. Thus — rc/4 or - 45° is the same angle as 7rc/4 or 315°. 
So is 15rc/4 or 675°. 

If we add or subtract 180°, the tangent doesn’t change. The point (1, - 1) is on the 
-45° line at r= yfl. The point (-1, 1) is on the 135° line also with r= Jl, Both 
have tan 0 = — 1. We had to write "almost” in equation (2), because a point has many 
0’s and two points have the same r and tan 0. 

Even worse, we could say that — —1) is on the 135° line but at a negative 

distance r—— v /2. A negative r carries the point backward along the 135° line, which 
is forward to B . In giving the position of B , I would always keep r > 0. But in drawing 
the graph of a polar equation, r < 0 is allowed. We move now to those graphs. 

THE CIRCLE r=cos0 

The basis for Chapters 1-8 was y =/(x). The key to this chapter is r = F{0). That is 
a relation between the polar coordinates, and the points satisfying an equation like 
r — cos 8 produce a polar graph . 

It is not obvious why r = cos 8 gives a circle. The equations r = cos 28 and 
r = cos 2 0 and r = 1 4- cos 0 produce entirely different graphs— not circles. The direct 
approach is to take 0=0°, 30°, 60°, ... and go out the distance r = cos 8 on each ray. 
The points are marked in Figure 9.2a, and connected into a curve. It seems to be a 
circle of radius j, with its center at the point (j, 0). We have to be able to show 
mathematically that r = cos 6 represents a shifted circle . 

One point must be mentioned. The angles from 0 to n give the whole circle. The 
number r = cos 8 becomes negative after n/2, and we go backwards along each ray. 



Rg.9.2 The circle r = cos 6 and the switch to x and y. The circle r — sin 8. 
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At 6 = n (to the left of the origin) the cosine is — 1. Going backwards brings us to 
the same point as 8 = 0 and r = + 1 — which completes the circle. 

When 9 continues from n to 2n we go around again. The polar equation gives the 
circle twice. {Or more times, when 0 continues past 2 ti.) If you don’t like negative r’s 
and multiple circles, restrict 9 to the range from -njl to njl. We still have to see 
why the graph of r = cos 9 is a circle. 

Method 1 Multiply by r and convert to rectangular coordinates x and y : 

r = cos 9 => r 2 = r cos 9 => x 2 + y 1 = x. (3) 

This is a circle because of x 2 + y 1 . From rewriting as (jc - \) 2 + y 2 = we recognize 
its center and radius. Center at x — % and y - 0; radius Done. 

Method 2 Write x and y separately as functions of 0. Then 0 is a “parameter”: 

x = r cos 9 = cos 2 9 and y = r sin 9 = sin 0 cos 9 . (4) 

These are not polar equations but parametric equations. The parameter 0 is the angle, 
but it could be the time — the curve would be the same. Chapter 12 studies parametric 
equations in detail — here we stay with the circle. 

To find the circle, square x and y and add. This produces x 2 + y 1 — x in Problem 26. 
But here we do something new: Start with the circle and find equation (4). In case you 
don’t reach Chapter 12, the idea is this. Add the vectors OC to the center and CP 
out the radius: 

The point P in Figure 9.2 has (jc, y) — OC + CP — 0) + (j cos t, j sin t). 

The parameter f is the angle at the center of the circle. The equations are 
jc = £ + £ cos t and y = \ sin t. A trigonometric person sees a double angle and sets 
t — 20. The result is equation (4) for the circle: 

jc = £ + j cos 20 = cos 2 0 and y = y sin 20 = sin 0 cos 0. (5) 

This step rediscovers a basic theorem of geometry: The angle t at the center is twice 
the angle 9 at the circumference . End of quick introduction to parameters. 

A second circle is r = sin 0, drawn in Figure 9.2c. A third circle is r = cos 0 + sin 0, 
not drawn. Problem 27 asks you to find its xy equation and its radius. All calculations 
go back to jc = r cos 9 and y = r sin 0 — the basic facts of polar coordinates! The last 
exercise shows a parametric equation with beautiful graphs, because it may be pos- 
sible to draw them now. Then the next section concentrates on r — F(9 ) — and goes 
far beyond circles. 


9.1 EXERCISES 


Read-through questions 

Polar coordinates r and 6 correspond to x = a and y = 
b The points with r > 0 and 0 = ti are located c 

The points with r = 1 and 0 ^ 0 < rc are located d , Re- 
versing the sign of 0 moves the point (*, y) to e . 

Given x and y, the polar distance is r = i The tangent 

of 0 is q The point (6, 8) has r = h and 0 = i 
Another point with the same 0 is i Another point with 
the same r is k . Another point with the same r and tan 0 

is 1 


The polar equation r — cos 0 produces a shifted m . The 
top point is at 0 = n . which gives r = o . When 0 
goes from 0 to 2rc, we go p times around the graph. 
Rewriting as r 2 = r cos 0 leads to the xy equation q Sub- 
stituting r = cos 0 into x = r cos 0 yields x — r and simi- 
larly v = a In this form x and y are functions of the 
t 0. 

Find the polar coordinates r > 0 and 0 ^ 0 < 2ti of these points. 

1 <x,y) = (0,l) 2 (*, y) = (—4, 0) 
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3 (x, y) = {Jl, 72) 4 (x, y) = (- 1, Jl) 

5 (x,y) = (-l, -1) <S (*,y) = (3,4) 

Find rectangular coordinates (x. y) from polar coordinates. 

7 (r, A) = (2, ir/2) 8 (r, 9} = (1, 3*/2) 

9 (r, 0) = ( 720 , n/4) 10 (r, 0) = (3*, 3*) 

11 (?\ 0H(2, -n/fi) 12 (r, 5) = (2, 

13 What is the distance from (x, y) = (y/3 t 1) to (1, — ^/5)? 

14 How far is the point r = 3, d = izj2 from r = 4, 9 = x? 

15 How far is (x, y ) = (r cos 0, r sin 0} from (X, Y) = (R cos 0, 
R sin 0)? Simplify (x — X ) 2 + (y — Y) 2 by using cos{0 — 0) — 
cos 0 cos 0 + sin 8 sin 0. 

16 Find a second set of polar coordinates (a different r or 0) 
for the points 

(r,0) = (- 1,^/2), (-i t 3*/4), (1, -n/2), (0,0), 

17 Using polar coordinates describe (a) the half-plane x > 0; 
(b) the half-plane y^;0; (c) the ring with x 2 + y 2 between 4 
and 5; (d) the wedge x ^ |y|. 

18 True or false, with a reason or an example: 

(a) Changing to — r and —0 produces the same point. 

(b) Each point has only one r and 0, when r < 0 is not 
allowed. 

(c) The graph of r = 1/sin 0 is a straight line, 

19 From x and 0 find y and r, 

20 Which other point has the same r and tan 0 as x = yfi, 
y= 1 in Figure 9,1b? 

21 Convert from rectangular to polar equations: 

(a)j> = x (b)x + y = l (c) x* + = x + y 

22 Show that the triangle with vertices at (0, 0), (r j , 0!), and 

{r z ,02) has area A = ir x r 7 sin(0 3 — 0!). Find the base and 
height assuming 0 ^ < 0 2 < n. 

Problems 23-28 are about polar equations that give circles. 

23 Convert r = sin 0 into an xy equation. Multiply first by r. 


24 Graph r = sin 0 at 0 = 0 P , 30°, 60*, 360°, These thirteen 

values of 0 give different points on the graph. What 

range of & s goes once around the circle? 

25 Substitute r = sin 0 into x = r cos 0 and y = r sin 9 to find 
x and y in terms of the parameter 0, Then compute x 2 + y 2 
to reach the xy equation. 

26 From the parametric equations x = cos 2 0 and y = 
sin 0 cos 0 in (4), recover the xy equation. Square, add, 
eliminate 0. 

27 (a) Multiply r — cos 0 + sin 0 by r to convert into an xy 
equation, (b) Rewrite the equation as (x — £) 2 -h {y — i) 2 = R 2 
to find the radius R. (c) Draw the graph. 

28 Find the radius of r ~ a cos 0 + b sin 0, (Multiply by r,) 

29 Convert x + y = 1 into an r6 equation and solve for r. 
Then substitute this r into x = r cos 0 and y = r sin 0 to find 
parametric equations for the line, 

36 The equations x = cos 2 0 and y = sin 2 0 also lead to 
x + y = l — but they are different from the answer to 
Problem 29, Explanation: 0 is no longer the polar angle and 
we should have written L Find a point x = cos 2 0, y — sin 2 0 
that is not at the angle 0. 

31 Convert r = cos 2 0 into an xy equation (of sixth degree!) 

32 If you have a graphics package for parametric curves, 
graph some hypocycloids . The equations are x = 
(1 - b) cos t + b cos(l - b)tjb,y — (1 - b)sin t — b sin(l - b)tfb. 
The figure shows b =-fo and part of 6 = .31831. 




9.2 Polar Equations and Graphs 



The most important equation in polar coordinates, by far, is r = 1, The angle 0 does 
not even appear. The equation looks too easy, but that is the point! The graph is a 
circle around the origin (the unit circle). Compare with the line x = 1. M ore important, 
compare the simplicity of r — 1 with the complexity of y = ± 1 — x 1 . Circles are so 

common in applications that they created the need for polar coordinates. 
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This section studies polar curves r = F(6). The cardioid is a sentimental favorite — 
maybe parabolas are more practical. The cardioid is r= 1 + cos0 ? the parabola is 
r- 1/(1 + cos 8). Section 12.2 adds cycloids and astroids. A graphics package can 
draw them and so can we. 

Together with the circles r = constant go the straight lines 0 = constant. The equa- 
tion 8 — tt/ 4 is a ray out from the origin, at that fixed angle. If we allow r < 0, as we 
do in drawing graphs, the one-directional ray changes to a full line. Important: The 
circles are perpendicular to the rays . We have “orthogonal coordinates” — more inter- 
esting than the x — y grid of perpendicular lines. In principle x could be mixed with 
0 (non-orthogonal), but in practice that never happens. 

Other curves are attractive in polar coordinates — we look first at five examples. 
Sometimes we switch back to x — r cos 8 and y - r sin 8 , to recognize the graph. 

EXAMPLE \ The graph of r = 1/cos 0 is the straight line x — 1 (because r cos 8 - 1). 

EXAMPLE 2 The graph of r - cos 28 is the four-petal flower in Figure 9.3. 

The points at 8 — 30° and — 30 Q and 150° and — 150° are marked on the flower. They 
all have r- cos 28 = £. There are three important symmetries — across the x axis , 
across the y axis , and through the origin. This four-petal curve has them all. So does 
the vertical flower r “ sin 28 — but surprisingly, the tests it passes are different. 

(Across the x axis: y to -y) There are two ways to cross. First, change 8 to - 8 . 
The equation r = cos 28 stays the same. Second, change 8 to n - 8 and also r to — r. 
The equation r = sin 28 stays the same. Both flowers have x axis symmetry. 

(Across the y axis : x to -x) There are two ways to cross. First, change 8 to n - 8. 
The equation r = cos 28 stays the same. Second, change 8 to — 8 and r to — r. Now 
r = sin 28 stays the same (the sine is odd). Both curves have y axis symmetry. 

(Through the origin) Now we change r to — r or 0 to 8 + n. The flower equations 
pass the second test only: cos 2(8 + tt) = cos 28 and sin 2(9 + n) ~ sin 28. Every 
equation r 2 = F(G) passes the first test, since (-r) 2 = r 2 . 

The circle r= cos 8 has x axis symmetry, but not y or r. The spiral r = 8 } has 
y axis symmetry, because — r = (-0) 3 is the same equation. 

Question What happens if you change r to — r and also change 8 to 8 + 7t! 
Answer Nothing — because (r, 0) and (- r, 8 + n) are always the same point. 




Hg. 9.3 The four-petal flower r = cos 26 and the spiral r = 9 (r > 0 in red). 
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EXAMPLE 3 The graph of r = 8 is a spiral of Archimedes — or maybe two spirals. 

The spiral adds new points as 9 increases past In. Our other examples are 
“periodic” — 0 = 2n gives the same point as 6 “ 0. A periodic curve repeats itself. The 
spiral moves out by 2n each time it comes around. If we allow negative angles and 
negative r = 0, a second spiral appears, 

EXAMPLE 4 The graph of r — 1 + cos 8 is a cardioid. It is drawn in Figure 9.4c. 

The cardioid has no simple xy equation. Still the curve is very attractive. It has a 
cusp at the origin and it is heart-shaped (hence its name). To draw it, plot r = 1 + cos 0 
at 30° intervals and connect the points. For this curve r is never negative, since cos 0 
never goes below —1. 

It is a curious fact that the electrical vector in your heart almost traces out a 
cardioid. See Section 11,1 about electrocardiograms. If it is a perfect cardioid you are 
in a little trouble. 



r = I + -2- cos 0 

no dimple 


Fig, 9,4 




r = 1 +-^~ cos 0 

dimple 

Limagons r = 1 4- h cos 0 , including a cardioid and Mars seen from Earth. 



EXAMPLE 5 The graph of r = 1 + b cos 0 is a limagon (a cardioid when b = 1). 

Limagon (soft c) is a French word for snail — not so well known as escargot but just 
as inedible, (/ am only referring to the shell. Excusez-moil) Figure 9,4 shows how a 
dimple appears as b increases. Then an inner loop appears beyond b = 1 (the cardioid 
at b = 1 is giving birth to a loop). For large b the curve looks more like two circles. 
The limiting case is a double circle, when the inner loop is the same as the outer 
loop. Remember that r = cos 0 goes around the circle twice. 

We could magnify the lima^on by a factor c, changing to r — c(l + b cos 0). We 
could rotate 180° to r= 1 — h cos 8. But the real interest is whether these figures arise 
in applications, and Donald Saari showed me a nice example. 

Mars seen from Earth The Earth goes around the Sun and so docs Mars. Roughly 
speaking Mars is times as far out, and completes its orbit in two Earth years. 

We take the orbits as circles: r = 2 for Earth and r = 3 for Mars. Those equations 
tell where but not when. With time as a parameter, the coordinates of Earth and Mars 
are given at every instant f: 

x t = 2 cos y E = 2 sin 2 nt and 


= 3 cos 7tf, v M = 3 sin nt> 
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At t = 1 year, the Earth completes a circle (angle = 2n) and Mars is halfway. 

Now the key step. Subtract to find the position of Mars relative to Earth: 

jc m _ e = 3 cos nt — 2 cos 2 nt and y M _ E = 3 sin nt “ 2 sin 2nt. 

Replacing cos 2nt by 2 cos 2 7ir - 1 and sin 2nt by 2 sin nt cos nt , this is 

x M _ E = (3 - 4 cos nt)oos nt + 2 and y M _ E = (3 - 4 cos 7rf)sin nt . 

Seen from the Earth, Mars does a loop in the sky! There are two fs for which 
3 - 4 cos nt — 0 (or cos nt = *). At both times, Mars is two units from Earth (x M _ E = 
2 and y M _ E = 0). When we move the origin to that point, the 2 is subtracted away — 
the M — E coordinates become x — r cos nt and y — r sin nt with r = 3-4 cos nt. That 
is a lima^on with a loop, like Figure 9.4d. 

Note added in proof I didn't realize that a 3-to-2 ratio is also responsible for heating 
up two spots on opposite sides of Mercury. From the newspaper of June 13, 1990: 

“Astronomers today reported the first observations showing that Mercury 
has two extremely hot spots. That is because Mercury, the planet closest to the 
Sun, turns on its axis once every 59.6 days, which is a day on Mercury. It goes 
around the sun every 88 days, a Mercurian year. With this 3-to-2 ratio between 
spin and revolution, the Sun appears to stop in the sky and move backward , 
describing a loop over each of the hot spots.” 

CONIC SECTIONS IN POLAR COORDINATES 

The exercises include other polar curves, like lemniscates and 200-petal flowers. But 
get serious. The most important curves are the ellipse and parabola and hyperbola . 
In Section 3.5 their equations involved 1, x, y 7 x 2 , xy, y 1 . With one focus at the origin, 
their polar equations are even better. 


9A The graph of r — 1 + e cos 9) is a conic section with “eccentricity” e: 

circle if e — 0 ellipse if 0 < e < 1 parabola if e — 1 hyperbola if e > 1. 


EXAMPLE 6 (e = 1) The graph of r= 1/(1 + cos 9) is a parabola. This equation is 
r + r cos 9 = 1 or r = 1 - x. Squaring both sides gives x 2 + y 2 = 1 - 2x + x 1 . Cancel- 
ing x 2 leaves y 2 = 1 - 2x, the parabola in Figure 9.5b, 

The amplifying factor A blows up all curves, with no change in shape. 

EXAMPLE 7 (e - 2) The same steps lead from r( t + 2 cos 9) - 1 to r — 1 - 2x. Squar- 
ing gives x 2 + y 2 = 1 — Ax + 4x 2 and the x 2 terms do not cancel. Instead we have 
y 2 — 3x 2 = 1 — Ax. This is the hyperbola in Figure 9.5c, with a focus at (0, 0). 

The hyperbola y 2 - 3x 2 - 1 (without the —4x) has its center at (0, 0). 

EXAMPLE 8 (e = J) The same steps lead from r(l + i cos 9) = 1 to r = 1 - %x. Squar- 
ing gives the ellipse x 2 + y 2 — \ — x + *x 2 . Polar equations look at conics in a new 
way, which happens to match the sun and planets perfectly. The sun at (0, 0) is not 
the center of the system , but a focus. 

Finally e = 0 gives the circle r = 1. Center of circle = both foci = (0* 0). 
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fig. 9.5 r — 1/(1 + e cos 0) is an ellipse for e = a parabola for e = 1, a hyperbola for e = 2. 


The directrix The figure shows the line d (the “directrix*’) for each curve. All points 
P on the curve satisfy r = j/ , i r | — e\Pd\. The distance to the focus is e times the distance 
to the directrix, {e is still the eccentricity, nothing to do with exponentials.) A geometer 
would start from this property r = e\Pd\ and construct the curve. We derive the 
property from the equation: 


A 

1 + e cos 6 


r + ex = A 



a) 


The directrix is the line at x = Aje> That last equation is exactly IP/] = e\Pd\. 


Notice how two numbers determine these curves. Here the numbers are A and e . 
In Section 3.5 they were a and b. (The ellipse was x 2 ja 2 + y 2 jb 2 = 1.) Using A and e 
we go smoothly from ellipses through parabolas (at e= 1) and on to hyperbolas. 
With three more numbers we can move the focus to any point and rotate the curve 
through any angle. Conics are determined by five numbers . 


9.2 EXERCISES 


Read-through questions 

The circle of radius 3 around the origin has polar equation 
q . The 45 fl line has polar equation b . Those graphs 
meet at an angle of c . Multiplying r — 4 cos 0 by r yields 
the xy equation d . Its graph is a e with center at 
f The graph of r = 4/cos 0 is the line x = q The 
equation r 2 — cos 20 is not changed when 8 —6 (symmetric 

across h ) and when 0 -* n + 6 (or r i ). The graph 
of r = 1 + cos 6 is a i 

The graph of r = A H k ) is a conic section with one 
focus at i It is an ellipse if m and a hyperbola if 
n . The equation r = 1/(1 +cos 0) leads tor + x = 1 which 
gives a o . Then r = distance from origin equals 1 — x = 
distance from p . The equations r = 3(l — x) and r = 
i(l — x) represents a q and an r « Including a shift 
and rotation, conics are determined by t numbers. 


Convert to xy coordinates to 

1 r sin 8 = 1 
3 r = 2 cos 0 
5 r = 1/(2 + cos 0) 


draw and identify these curves. 

2 Hcos 0 — sin 0) = 2 
4 r = - 2 sin 6 
6 r = l/(l +2 cos 0) 


In 7-14 sketch the curve and check for x, )\ and r symmetry. 

7 r 2 = 4 cos 20 (lemniscate) 

8 r 2 = 4 sin 20 (lemniscate) 

9 r = cos 30 (three petals) 

10 r 2 = 10 + 6 cos 40 


11 r = e e (logarithmic spiral) 

12 r = 1/0 (hyperbolic spiral) 

13 r = tan 0 

14 r = 1 — 2 sin 30 (rose inside rose) 

15 Convert r = 6 sin 6 + 8 cos 0 to the xy equation of a circle 
(what radius, what center?}. 

*16 Squaring and adding in the Mars-Earth equation gives 
*M-E + yM-E= 13— 12 cos m. The graph of r 2 = 13 - 
12 cos 0 is not at all like Figure 9.4d. What went wrong? 


In 17-23 find the points where the two curves meet 

17 r = 2 cos 0 and r = 1 + cos 0 

Warning: You might set 2 cos 0 = 1+ cos 0 to find cos 0 = 1. 
But the graphs have another meeting point — they reach it at 
different 0’s. Draw graphs to find all meeting points. 
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18 r 2 = sin 20 and r 2 = cos 20 

19 r — 1 + cos 9 and r = 1 — sin $ 

20 r = 1 4- cos 9 and r = 1 — cos 6 

21 r — 2 and r = 4 sin 20 

22 r 2 — 4 cos 0 and r = 1 — cos $ 

23 r sin 0 = 1 and r cos (9 — tt/4) = ^J2 (straight lines) 

24 When is there a dimple in r=l + 6cos0? From x = 
(1 4- b cos 0 ) cos 9 find dx/d9 and d 2 x/d6 2 at 9 = n. When that 
second derivative is negative the limagon has a dimple. 

25 How many petals for r = cos 50? For r = cos 0 there was 
one, for r = cos 20 there were four. 

26 Explain why r = cos 100 0 has 200 petals but r = cos 101 0 

only has 101, The other 101 petals are . What about 

r = cosi0? 

27 Find an xy equation for the cardioid r — 1 + cos 0. 

28 (a) The flower r = cos 26 is symmetric across the x and y 
axes. Does that make it symmetric about the origin? (Do 
two symmetries imply the third, so — r = cos 20 produces 
the same curve?) 

(b) How can r = l, 0 = rc/2 lie on the curve but fail to 
satisfy the equation? 

29 Find an xy equation for the flower r — cos 29. 

30 Find equations for curves with these properties: 

(a) Symmetric about the origin but not the x axis 

(b) Symmetric across the 45° line but not symmetric in x 
or y or r 

(c) Symmetric in x and y and r (like the flower) but 
changed when x*-+y (not symmetric across the 45° line). 

Problems 31-37 are about conic sections — especially ellipses. 

31 Find the top point of the ellipse in Figure 9,5a, by maxi- 
mizing y — r sin 0 = sin 0/(1 + i cos 0), 


32 (a) Show that all conics r = 1/(1 + e cos 0) go through 
x = 0, y = L 

(b) Find the second focus of the ellipse and hyperbola. For 
the parabola (e = 1) where is the second focus? 

33 The point Q in Figure 9.5c has y = 1. By symmetry find x 
and then r (negative!). Check that x 2 + y 2 = r 2 and |QF| = 

2\QA- 

34 The equations r = -4/(1 + e cos 0) and r = 1/(C + D cos 0) 

are the same if C = and D = For the 

mirror image across the y axis replace 0 by , This 

gives r = 1/(C — D cost?) as in Figure 12.10 for a planet 
around the sun. 

35 The ellipse r = >1/(1 ■+■ e cos 0) has length 2a on the x axis. 
Add r at 8 — 0 to r at 0 = n to prove that a = 4/(1 — e 2 ). The 
Earth’s orbit has a = 92,600,000 miles = one astronomical 
unit (AU). 

36 The maximum height b occurs when y = r sin 0 = A sin 0 / 
(1 + e cos 0) has dyjdB = 0. Show that b = y mH = Ajy/l —e 2 . 

37 Combine a and b from Problems 35-36 to find c = 
y/a 2 — b 2 = Aej{ 1 — e 2 ). Then the eccentricity e is c/a, Halley’s 
comet is an ellipse with a =18. 1 AU and b = 4.6 AU so 

e = t 


Comets have large eccentricity, planets have much smaller e: 
Mercury *21, Venus ,01, Earth .02, Mam *09, Jupiter *05, Sat- 
urn ,05, Uranus .05, Neptune ,01, Pluto *25, Kohoutek .9999. 

38 If you have a computer with software to do polar graphs, 
start with these: 

1. Flowers r= A + cos n9 for n = j, 3, 7, 8; A =0, 1, 2 

2. Petals r = (cos m0 + 4 cos n0)/cos 0, (m, a) = (5, 3), (3,5), 
(9, t)> (2* 3) 

3. Logarithmic spiral r = e 9/z * 

4. Nephroid r = 1 + 2 sin j0 from the bottom of a teacup 
5* Dr. Fay’s butterfly r = e c * i9 ~ 2 cos 49 + sin 5 {0./12) 

Then create and name your own curve. 



9.3 Slope, Length, and Area for Polar Curves 



The previous sections introduced polar coordinates and polar equations and polar 
graphs. There was no calculus! We now tackle the problems of area (integral calculus) 
and slope (differential calculus), when the equation is r = F(0). The use of F instead 
of/is a reminder that the slope is not dF/d$ and the area is not j F(9)d9. 

Start with area. The region is always divided into small pieces — what is their 
shape? Look between the angles 9 and 6 + A£? in Figure 9.6a* Inside the curve is a 
narrow wedge — almost a triangle, with A0 as its small angle. If the radius is constant 
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area 



Fig, 9.4 Area of a wedge and a circle and an intersection of circles. 



r = cos 0 


area 
(cos 2 0-(^ 


the wedge is a sector of a circle. It is a piece of pie cut at the extremely narrow angle 
A0. The area of that piece is a fraction (the angle A0 divided by the whole angle 2n) 
of the whole area nr 2 of the circle: 


area of Hedge = nr 1 = ^ r 2 A0 - - [F(tf)] a Atf* ( 1 ) 

2n 2 2 

We admit that the exact shape is not circular. The true radius F(tf) varies with 9 — 
but in a narrow angle that variation is small. When we add up the wedges and let 
A 0 approach zero, the area becomes an integral. 


9B The area inside the polar curve r- F(0) is the limit of J^r 2 A0 = £|F 2 A0: 


area = 


I 



\ [fW] 2 dd. 


( 2 ) 


EXAMPLE 1 Find the area inside the circle r = cos 9 of radius \ {Figure 9.6). 


area = 


f 1 m cos 9 sin 9 + 9 I 2 * 

- cos 2 0 dO = — - — 1 

o 2 4 J 0 


2n 

T' 


That is wrong! The correct area of a circle of radius ^ is tt/ 4. The mistake is that we 
went twice around the circle as 9 increased to 2n. Integrating from 0 to n gives nj 4. 


EXAMPLE 2 Find the area between the circles r = cos 9 and r = 

The circles cross at the points where r = cos 0 agrees with r = \. Figure 9.6 shows these 
points at ±60°, or 0— ±n/3. Those are the limits of integration, where cos 9 = ^ The 
integral adds up the difference between two wedges, one out to r — cos 9 and a smaller 
one with r~\: 

[* fJ 1 
area= - 
J -n,3 2 

Note that chopped wedges have area j(F] - F\)A8 and not jiF^ - F 2 ) 2 A8, 


(cos B) 2 




dd. 


(3) 


EXAMPLE 3 Find the area between the cardioid r = 3 + cos 9 and the circle r = 1, 


r«i2 i 

area = — [(1 + cos 0 ) 2 — 1 2 }d8 

J -nfi 2 


f limits 9 - 4- ^ where 1 + cos 9 

\ “2 


■0 


2 

) >48 
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SLOPE OF A POLAR CURVE 


Where is the highest point on the cardioid r - 1 + cos 0? What is the slope at 0 — 
tt/4? Those are not the most important questions in calculus, but still we should 
know how to answer them. I will describe the method quickly, hy switching to 
rectangular coordinates: 

x = t cos 0 — (1 + cos 0) cos 0 and y = r sin 0 = {1 + cos 0)sin 0. 

For the highest point, maximize y by setting its derivative to zero: 

dyjdO = (1 + cos 0)(cos 0) + (-sin 0)(sin 0) * 0. (3) 

Thus cos 0 -I- cos 20 = 0, which happens at 60°. The height is y = (1 + 

For the slope, use the chain rule dy/dO = (dyidx)(dxjdS): 

dy dyjdO (1 + cos 0)(cos 0) + (— sin 0)(sin 0) 

— * " , ( 4 ) 

dx dxjdd (1 + cos 0)(- sin 0) + (—sin 0) cos 0 

Equations (3) and (4) avoid the awkward (or impossible) step of eliminating 0. Instead 
of trying to find y as a function of jc, we keep x and y as functions of 0. At 0 = tt/4, 
the ratio in equation (4) yields dyfdx = - 1/(1 + yfl). 

Problem 18 finds a general formula for the slope, using dyjdx = (dy/d0)/(dx/d0). 
Problem 20 finds a more elegant formula, by looking at the question differently. 


LENGTH OF A POLAR CURVE 


The length integral always starts with ds — yf(dx) 2 +(dy) 2 . A polar curve has x = 
r cos 0 = F(0) cos 0 and y = F(0) sin 0, Now take derivatives by the product rule: 


dx = (F'(0)cos 0 — F(0)sin 0)d0 and dy - (F'(0) sin 0 + F(0)cos 0)d0. 
Squaring and adding (note cos 2 0 + sin 2 0) gives the element of length ds: 

ds = JlF'mi 2 + [F(fl)] 2 de. (5) 

The figure shows (ds) 2 = (dr) 2 + ( rd6 ) 2 , the same formula with different letters. The 
total arc length is J ds. 

The area of a surface of revolution is j 2ny ds (around the x axis) or j 2 nx ds 
(around the y axis). Write x, y , and ds in terms of 6 and dO . Then integrate. 



EXAMPLE 4 The circle r = cos 0 has ds = yfl d0. So its length is n (not 2*!! — don't 
go around twice). Revolved around the y axis the circle yields a doughnut with no 
hole. Since x = r cos 0 = cos 3 0, the surface area of the doughnut is 


2 nx ds = 


K 

2n cos J 0 d8 = n 1 . 
o 


EXAMPLE 5 The 

Oto k: 

length 


length of r = 1 + cos 9 is, by symmetry, double the integral from 
of cardioid = 2 


,/(-sin 9) 2 + (1 + cos 8) 2 d9 


= 2 


y/2 + 2 cos 9 d9= 4 


■* 9 

cos - d9 = 8. 
o 2 
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We substituted 4 cos 2 jO for 2 + 2 cos 6 in the square root. It is possible to skip 
symmetry and integrate from 0 to 2tt — but that needs the absolute value |cos £0| to 
maintain a positive square root. 


EXAMPLE 6 The logarithmic spiral r = e e has ds = >Je^ 19 + e~ ld dQ> It spirals to 
zero as 0 goes to infinity, and the total length is finite; 

\ds = % Jle- d9= -Jie-X 

Revolve this spiral for a mathematical seashell with area (2ne ~ e cos 0)^/le~ & dB. 


9.3 EXERCISES 


Read-through exercises 

A circular wedge with angle A0 is a fraction q of a whole 
circle. If the radius is r, the wedge area is b . Then the 
area inside r = F(9) is j e The area inside r = 0 2 from 0 
to n is d That spiral meets the circle r = 1 at 0 = e 
The area inside the circle and outside the spiral is f A 
chopped wedge of angle A 0 between r v and r 2 has area g 

The curve r — F(0) has x = r cos 0 = h and y = i 

The slope dyjdx is dyjdd divided by I For length (ds) 2 = 
(dx) 2 + (i dy ) 2 = k . The length of the spiral r = 9 to 8 = n 
is f i (not to compute integrals). The surface area when 
r = 9 is revolved around the x axis is j 2*_y ds = J m . The 
volume of that solid is Jjtv 2 Jx = J n 

In 1-6 draw the curve and find the area inside. 


15 Compute the area between the cardioid and circle in 
Example 3. 

16 Find the complete area (carefully) between the spiral r = 
e~ 0 ( 0 > 0) and the origin. 

17 At what 9' s does the cardioid r = 1 + cos 6 have infinite 
slope? Which points are furthest to the left (minimum jc)? 

18 Apply the chain rule dyjdx = (dyjd9)j(dxjd9) to x = 
F(9) cos 0, y = F(0) sin 0. Simplify to reach 

dy _ F + tan 0 dF/dO 
dx — F tan 9 + dFjdd‘ 

19 The groove in a record is nearly a spiral r = c0: 

length = | y/r 2 + (drjdO) 1 dO = J* 4 yjr 1 + c 2 drjc . 


1 r = 1 + cos 0 

2 r = sin 0 + cos 9 from 0 to n 

3 r = 2 + cos 0 

4 r = 1+2 cos 0 (inner loop only) 

5 r = cos 20 (one petal only) 

6 r = cos 30 (one petal only) 

Find the area between the curves in 7-12 after locating their 
intersections (draw them first), 

7 circle r = cos 9 and circle r = sin 9 
g spiral r = 0 and y axis (first arch) 

9 outside cardioid t = 1 + cos 0 inside circle r = 3 cos 9 

10 lemniscate r 2 = 4 cos 20 outside r = y/2 

1 1 circle r = 8 cos 0 beyond line t cos 0 = 4 

1 2 circle r = 1 0 beyond line r cos 0 = 6 

13 Locate the mistake and find the correct area of the lemnis- 
cate r 1 = cos 20: area = \r 2 dO = f* jeos 20 d6 = 0. 

14 Find the area between the two circles in Example 2. 


Take c = ,002 to give 636 turns between the outer radius 
14 cm and the inner radius 6 cm (14 — 6 equals >002(636)2*). 

(a) Omit c 2 and just integrate r drjc , 

(b) Compute the length integral Tables and calculators 
allowed. You will never trust integrals again. 

20 Show that the angle \jt between the ray from the origin 
and the tangent line has tan \jt = FI(dF/d0). 

Hint: If the tangent line is at an angle with the horizontal, 
then tan is the slope dyjdx in Problem 18> Therefore 


tan \j/ = tan(0 — 0) = 


tan 0 — tan 0 
1 + tan tfi tan 0 


Substitute for tan tfi and simplify like mad. 

21 The circle r = F(0) - 4 sin 0 has rjf = 0. Draw a figure 
including 0, <f> , ij/ and check tan 

22 Draw the cardioid r = 1 — cos 0, noticing the minus sign. 
Include the angles 9, tp t \jt and show that ij/ = 0/2. 

23 The first limagon in Figure 9.4 looks like a circle centered 
at (^, 0). Prove that it isn’t. 

24 Find the equation of the tangent line to the circle r = cos 0 
at 9 — n j6. 
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In 25-28 compute the length of the curve. 

25 r = 0 {() from 0 to 2 tt) 

26 r = sec 0 ( 0 from 0 to njA) 

27 r = sin J ( 0/3) (0 from 0 to 3 tt) 

28 r = 0 1 (0 from 0 to tt) 

29 The narrow wedge in Figure 9,6 is almost a triangle. It 
was treated as a circular sector but triangles are more familiar. 
Why is the area approximately jr 2 A 0? 

30 In Example 4 revolve the circle around the x axis and find 
the surface area. We really only revolve a semicircle. 

31 Compute the seashell area 2n J2 JJJ e~ 26 cos 0 dO using 
two integrations by parts. 

32 Find the surface area when the cardioid r = 1 + cos 0 
is revolved around the x axis, 

33 Find the surface area when the lemniscate r 2 =cos2 0 is 
revolved around the x axis. What is 0 after one petal? 


34 When y =f(x) is revolved around the x axis, the volume 
is J 7 iy z dx. When the circle r = cos 0 is revolved, switch to a 
^-integral from 0 to n/2 and check the volume of a sphere. 

35 Find the volume when the cardioid r = I + cos 0 is rotated 
around the x axis, 

36 Find the snrface area and volume when the graph of r = 
1/cos 0 is rotated around the y axis (0 ^ 0 ^ n/4). 

37 Show that the spirals r = 0 and r= 1 /0 are perpendicular 
when they meet at 0= 1. 

38 Draw three circles or radius 1 that touch each other and 
find the area of the curved triangle between them. 

39 Draw the unit square 0 ^ x ^ l, 0 ^ y ^ 1, In polar coordi- 
nates its right side is r = , Find the area from j ?r 2 d9. 

40 (Unravel the paradox) The area of the ellipse x = 4 cos 0, 
y— 3 sin 0 is k* 4*3 = 12ti. But the integral of ?r z dO is 

i i 

-{16 cos 2 0- 9 sin 2 0)(I9 = 12 -jr, 

Jo 2 2 


9.4 Complex Numbers 


Real numbers are sufficient for most of calculus. Starting from x 1 + 4, its integral 
3 .x 3 + 4x + C is also real. If we are given x 3 — 1, its derivative 3.x 2 is real. But the 
roots (or zeros) of those polynomials are complex numbers: 

.v 2 -^4 = 0 and x 3 -1=0 have complex solutions. 

We expect two square roots of — 4. There are three cube roots of 1 . Complex numbers 
are unavoidable, in order to find n roots for each polynomial of degree n. 

This section explains how to work with complex numbers. You will see their 
relation to polar coordinates. At the end, we use them to solve differential equations. 

Start with the imaginary number /. F very body knows that x 1 — - 1 has no real 
solution. When you square a real number, the result is never negative. So the world 
has agreed on a solution called /. (Except that electrical engineers call it Imaginary 
numbers follow' the normal rules of addition, subtraction, multiplication, and division, 
with one difference: Whenever i z appears it is replaced by — l . In particular — i times 
- j gives + i 1 = — T In other words, - r is also a square root of — F There are two 
solutions (i and - /) to the equation _x 2 4- 1 = 0. 

Finding cube roots of 1 will stretch us further. We need complex numbers — real 
plus imaginary. 


9B A complex number (say 1 + 3i) is the sum of a real number (1) and a pure 
imaginary number (30. Addition keeps those parts separate; multiplication uses 
i 2 = - 1: 

Addition: (1 + 3i) + (l + 30 = 1 + 1 + i(3 + 3) = 2 + 6/ 
Multiplication: (1 + 3 0(1 + 3 i] = 1 + 3r + 3f + 9i 2 = - 8^ 6i. 
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Adding 1 + 3i to 5 - i is easy (6 + 20- Multiplying is longer, but you see the rules: 

(l + 3i)(5-i)=5 + 15i-i-3i 2 = 8+ 14i. 

The point is this: We don’t have to imagine any more new numbers. After accepting 
i, the rest is straightforward. A real number is just a complex number with no 
imaginary part! When 1 + 3i combines with its “ complex conjugate” 1 — 3i — adding 
or multiplying — the answer is real: 

(1 +30 + (1-30 = 2 (real] 

(1 + 3i)(l - 30 = 1 “ 3i + 3i - 9i 2 = 10. (real) 

The complex conjugate offers a way to do division, by making the denominator real: 

1 1 1 - 3( 1 - 3( , 1 1 x - iy x-iy 

1 + 3i 1 + 3il - 3i 10 x + iy x + iyx- iy x 2 + y 2 


9C The complex number x + iy has real part x and imaginary part y, Its 
complex conjugate is x - iy . The product (x + iy ){x — iy) equals, x 2 + y 2 — r 1 . 
The absolute value {or modulus) is r = \x + iyi = ^/x 3 + y 2 . 


THE COMPLEX PLANE 

Complex numbers correspond to points in a plane . The number 1 + 3i corresponds to 
the point (1,3). Similarly x + iy is paired with (x, y ) — which is x units along the “real 
axis' 1 and y units up the “imaginary axis.” The ordinary plane turns into the complex 
plane . The absolute value r is the same as the polar coordinate r (Figure 9.8a). 

The figure shows two more copies of the complex plane. The one in the middle is 
for addition and subtraction. It uses rectangular coordinates. The one on the right is 
for multiplication and division and squaring. It uses polar coordinates. In squaring 
a complex number, t is squared and 6 is doubled — as the right figure and equation 
(3) both show. 


o l h 



a + ty 


1 real 
axis 


imaginary axis 


sum ^ 2 + 2/ 



* ‘ * 3 - 1 
conjugate 


r 2 = l 


A ] + o 2 


r, 0 -> rl 20 


Fig. 9.8 The complex plane shows x, y, r , 0. Add with x and y, multiply with r and S. 


Adding complex numbers is like adding vectors (Chapter 11). The real parts give 
3-1 and the imaginary parts give 1 + 1, The vector sum (2, 2) corresponds to the 
complex sum 2 + 2i. The complex conjugate 3 — t is the mirror image across the real 
axis (i reversed to — 0* The connection to r and 9 is the same as before (you see it in 
the triangle): 

x = r cos 6 and y = r sin 8 so that x + iy = r(cos 9 + i sin 6 ). (2) 

In the third figure, 1 + i has t = ^/2 and 9 = a/4. The polar form is cos rc/4 + 
^Jli sin rc/4. When this number is squared, its 45° angle becomes 90°. The square is 
(1 + i) z = 1 + 2i — 1 = 2i\ Its polar form is 2 cos n/2 + 2i sin tc/2. 
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9D Multiplication adds angles, division subtracts angles, and squaring doubles 
angles. The absolute values are multiplied, divided, and squared: 

(r cos 8 + ir sin B) 2 = r 2 cos 28 + i r 2 sin 26, (3) 

For nth powers we reach i* and n8 . For square roots, r goes to ^/r and 9 goes to 
%9. The number -1 is at 180°, so its square root i is at 90°, 

To see why 8 is doubled in equation (3), factor out r 2 and multiply as usual: 

(cos 8 + i sin 0)(cos 8 + i sin 0) = cos 2 0 — sin 2 0 + 2 i sin 8 cos 8. 

The right side is cos 28 + i sin 28. The double-angle formulas from trigonometry 
match the squaring of complex numbers. The cube would be cos 38 + i sin 36 (because 
26 and 8 add to 36, and r is still 1), The nth power is in de Afoivre's formula : 

{cos 6 + i sin 9}" = cos n8 + i sin n8. (4) 

With n — — 1 we get cos(“ 6) + i sin(— 6 ) — which is cos 8 — i sin 8 , the complex 
conjugate; 

1 _ 1 cos 6 i sin 8 cos 8 - i sin 8 

cos 8 4- i sin 8 cos 6 + i sin 8 cos 8 — i sin 8 1 

We are almost touching Euler's formula , the key to all numbers on the unit circle: 


Euler's formula : cos 8 + i sin 8 = e lB . 


( 6 ) 


Squaring both sides gives (^(e 1 *) = e 2lB . That is equation (3). The — 1 power is 1 je lB = 
e~ ie . That is equation (5). Multiplying any by produces The special case 
0 = 8 gives the square, and the special case 0 = - 8 gives = 1. 

Euler’s formula appeared in Section 6,7, by changing x to 18 in the series for e x : 


x 2 

e* = l + x + — + — + 
2 6 


becomes e l ° = 1 + i6 — — — i— + 


A highlight of Chapter 10 is to recognize two new series on the right. The real terms 
\ — \8 1 + — add up to cos 0. The imaginary part — adds up to sin 8. 

Therefore e iB equals cos 8 + i sin 8. It is fantastic that the most important periodic 
functions in all of mathematics come together in this graceful way. 

We learn from Euler (pronounced oiler) that e 1 * 1 ~ 1. The cosine of 2n is 1, the sine 
is zero. If you substitute x — 2ni into the infinite series, somehow everything cancels 
except the 1 — this is almost a miracle. From the viewpoint of angles, 9 = 2n carries 
us around a full circle and back to e 2 ™ = 1. 

Multiplying Euler’s formula by r, we have a third way to write a complex number: 

Every complex number is x~Hy = r cos 8 + ir sin 8 = re lS , (7) 


EXAMPLE 1 2e m times 3e ie equals 6e 2iS + For 8 — tt/ 2, 2i times 3i is —6, 

EXAMPLE 2 Find w 2 and w 4 and w 8 and w 2S when w = e”^ 4 . 

Solution e"* /4 is 1/^/2 + i/^/2* Note that r 2 = \ \ — L Now watch angles: 

w 2 — e* nl2 — i w 4 = e™ = - 1 w 8 = 1 w 2S — w 8 w 8 w 8 w — w. 

Figure 9.9 shows the eight powers of w. They are the eighth roots of\. 
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Fig. 9.9 The eight powers of w and the cube roots of 1. 


EXAMPLE 3 (x 2 + 4 = 0) The square roots of -4 are 2 i and - 21 Instead of (0(0 = 
— I we have = — 4* If Euler insists, we write 2i and — 2f as 2e tKil and 2e l3H/2 . 

EXAMPLE 4 (The cube roots of 1) In rectangular coordinates we have to solve 
{x + iyf = 1, which is not easy. In polar coordinates this same equation is r 3 e 3j# = 1. 
Immediately r = L The angle 8 can be 2nj3 or 4rc/3 or 6nj3 — the cube roots in the 
figure are evenly spaced : 

^2*i/3j3 _ e 2ni — j ^4?rt/3j3 _ ^4 ni — j ^6m'/3^3 — ^6ni = | 

You see why the angle 8 ti/ 3 gives nothing new. It completes a full circle back to 2rc/3. 

The nth roots of l are e 2j:un , e 4ni ", ..., 1. There are n of them . 

They He at angles 2n/n, 4 k/ n, . . . ,2 n around the unit circle . 

SOLUTION OF DIFFERENTIAL EQUATIONS 

The algebra of complex numbers is now applied to the calculus of complex functions. 
The complex number is c t the complex function is e a . It will solve the equations 
y if = — 4y and y m = y, by connecting them to c 2 = - 4 and c 3 — 1. Chapter 16 does 
the same for all linear differential equations with constant coefficients — this is an 
optional preview. 

Please memorize the one key idea: Substitute y = into the differential equation 
and solve for c. Each derivative brings a factor c , so / = ctf* and y rt = c 2 e CI : 

d 2 y;dt 2 - - 4y leads to cV' = - 4e c \ which gives c 2 = - 4. (8) 

For this differential equation, c must be a square root of - 4. We know the candidates 
(c = 2 i and c = - 2 i). The equation has two “pure exponential solutions” e a : 

y = e 2t1 and y= e~ 2il . (9) 

Their combinations y = Ae 2lt + Be^ 2lt give all solutions. In Chapter 16 we will choose 
the two numbers A and B to match two initial conditions at t = 0. 

The solution y = e 2u = cos 2t + i sin 2f is complex. The differential equation is real. 
For real y* s, take the real and imaginary parts of the complex solutions: 

y^\ = cos 2f and >‘ iHia8inary = sin 2t. (10) 

These are the “pure oscillatory solutions.” When y — e 2tl travels around the unit 
circle, its imaginary part sin It moves up and down. (It is like the ball and its shadow 
in Section 1.4, but twice as fast because of 2f.) The real part cos It goes backward 
and forward. By the chain rule, the second derivative of cos 2f is —4 cos 2 1. Thus 
d 2 yjdi 2 = ~4 y and we have real solutions. 
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EXAMPLE 5 Find three solutions and then three real solutions to d 3 yfdt 3 = y. 

Key step : Substitute y = & l . The result is c 3 e pr = e ct . Thus c 3 “ 1 and c is a cube root 
of 1. The candidate c = 1 gives y = d {our first solution). The next c is complex: 

l/3 

c = e 2nii3 = - - + i yields y = e c: = e~ t/2 e^ 3t[2 . (11) 

The real part of the exponent leads to the absolute value \y\ = It decreases as r 
gets larger, so y moves toward zero. At the same time, the factor e iy/3u2 goes around 
the unit circle. Therefore y spirals in to zero {Figure 9*10)* So does its complex 
conjugate, which is the third exponential* Changing i to — 1 in (11) gives the third 
cube root of 1 and the third solution ^ r/2 . 

The first real solution is y = e l . The others are the two parts of the spiral: 

yr«i = e" ,/2 cos Jltjl and y lniasinary = <T'' 2 sin Jlt\2. (12) 

That is r cos 9 and t sin 9 * It is the ultimate use {until Chapter 16) of polar coordinates 
and complex numbers* We might have discovered cos 2 1 and sin 2f without help, for 
/' = — 4y A don’t think these solutions to y m = y would have been found. 


EXAMPLE 6 Find four solutions to d^yjdt* = y by substituting y = e ct * 

Four derivatives lead to c 4 = 1. Therefore c is i or —1 or — i or 1. The solutions are 
y = e l \ e~\ e~ l \ and e f . If we want real solutions, e il and e~ lt combine into cos t and 
sin t. In all cases = y. 




Fig* 9 *i0 Solutions move in the complex plane: >■" - — Ay and /" = y and y" H = y * 


9.4 EXERCISES 


Read-through questions 

The complex number 3 + 4 i has real part ° and imagi- 
nary part b Its absolute value is r = c and its com- 
plex conjugate is d Its position in the complex plane is 
at ( e l Its polar form is r cos 9 + ir sin 0 = l e l& . Its 
square is 0 + i h Its nth power is I e' n& . 

The sum of 1 + i and 1 — j is i The product of 1 + i 
and 1 — i is k In polar form this is y/le** 14, times 1 
The quotient (l + i)/(l— i) equals the imaginary number 

m The number (1 4- 0 s equals n An eighth root of 
1 is w = o The other eighth roots are p 


To solve d^yidr* = y t look for a solution of the form y — 
q Substituting and canceling ^ leads to the equation 

f There are 3 choices for c, one of which is 

(— l + i)/^2* With that choice \tf*\ = t The real 

solutions are Re = u and Im e ct — v 


In 1-6 plot eaeh number in the complex plane. 

1 2 + i and its complex conjugate 2 — i and their sum and 
product 

2 1 + i and its square (1 4- i) 2 and its reciprocal 1/(1 + i) 
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3 2e'* /6 and its reciprocal ie~ iKf6 and their squares 

4 The sixth roots of 1 (six of them) 

5 cos 3 ji/4 + i sin 3^/4 and its square and cube 

6 4e? K/3 and its square roots 

7 For complex numbers c = x + iy = re ** and their con- 
jugates c = x — fy = re"“ find all possible locations in the 
complex plane of (1) c + c (2) c — c (3) cc (4) cjc , 

8 Find x and y for the complex numbers x + iy at angles 
0 = 45°, 90°, 135* on the unit circle. Verify directly that the 
square of the first is the second and the cube of the first is the 
third, 

9 If c — 2 + i and d =* 4 + 3i find cd and c/d r Verify that the 
absolute value |cd| equals |c| times |d|, and |c/d| equals |c| 
divided by |d|. 

10 Find a solution x to e* = i and a solution to e u = 1/e, 
Then find a second solution. 

Find the sum and product of the numbers in 11-14. 

11 e m and e~ i0 t also e 2xi} 3 and e 4 * V3 

12 e? and e'* also and 

13 The sixth roots of 1 (add and multiply all six) 

14 The two roots of c 2 — 4c + 5 = 0 

15 If c = re w is not zero, what are c 4 and c" 1 and tT 4 ? 

lfi Multiply out (cos 0 + i sin Gf — e i26 t to find the real part 
cos 30 and the imaginary part sin 38 in terms of cos 8 and 
sm 0 . 

17 Plot the three cube roots of a typical number re 1 * Show 
why they add to zero. One cube root is 

18 Prove that the four fourth roots of rJ* multiply to give 


In 19-22, find all solutions of the form y = e*. 

19 y ff + y = 0 20 y"' + y = 0 

21 = 0 22 y" + 6/+5y = 0 

Construct two real solutions from the real and imaginary parts 
of e* (first find c): 

23 y* + 49y = 0 24 /-2/ + 2y = 0 

Sketch the path of y = e* as t increases from zero, and mark 

y = e p : 

25 c = 1 — z 26 c= — 1+f 27 c = nif 4 

28 What is the solution of dyjdt = iy starting from y 0 = 1? 

For this solution, matching real parts and imaginary parts of 
dyjdt = iy gives _____ and . 

29 In Figure 9,10b, at what time t does the spiral cross the 
real axis at the far left? What does y equal at that time? 

30 Show that cos 8 = + e~ { ®) and find a similar formula 

for sin 0. 

31 True or false , with an example to show why: 

(a) If c x + c 2 is real, the c*$ are complex conjugates. 

(b) If }c x \=2 and \c 2 \ = 4 then c x c 2 has absolute value 8. 

(c) If led = 1 and )c 2 | = 1 then |c t +■ c 2 l is (at least 1) (at 
most 2} (equal to 2). 

<d) If e 0 approaches zero as t -* oo, then (c is negative) (the 
real part of c is negative) ( \c\ is less than 1). 

32 The polar form of re 1 * times Re** is , The rectan- 
gular form is . Circle the terms that give 

rit cos(0 + <f > ). 

33 The complex number 1 /{re**) has polar form and 

rectangular form and square roots . 

34 Show that cos ix = cosh x and sin it = i sinh x, What is 
the cosine of i? 
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CHAPTER 10 


Infinite Series 


Infinite series can be a pleasure {sometimes). They throw a beautiful light on sin x 
and cos x. They give famous numbers like n and e. Usually they produce totally 
unknown functions — which might be good. But on the painful side is the fact that 
an infinite series has infinitely many terms. 

It is not easy to know the sum of those terms. More than that, it is not certain 
that there is a sum. We need tests, to decide if the series converges. We also need 
ideas, to discover what the series converges to . Here are examples of convergence , 
divergence , and oscillation: 

1 +i + i + — =2 1 + 1 + 1 + = oo 1 - 1 + 1 - 1— =? 

The first series converges. Its next term is 1/8, after that is 1/16 — and every step 
brings us halfway to 2. The second series (the sum of Ts) obviously diverges to infinity. 
The oscillating example (with l’s and — Ts) also fails to converge. 

All those and more are special cases of one infinite series which is absolutely the 
most important of all: 

The geometric series is 1 + x + x 1 + x 3 4- •** = - — 


This is a series of functions. It is a “power series.” When we substitute numbers for 
x, the series on the left may converge to the sum on the right. We need to know when 
it doesn’t. Choose x = \ and x = 1 and x = — 1: 


1+2 + (£)* + — is the convergent series. Its sum is 


1-i 


1 + 1 + 14 — is divergent. Its sum is — ^ ^ = - = oo 


! + (-!) + (-1) 2 + ■” is the oscillating series. Its sum should be 


1 

1 - (- 1 ) 


2 ' 
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The last sum bounces between one and zero, so at least its average is At x = 2 
there is no way that 1+2 + 4+84- — agrees with 1/(1 - 2). 

This behavior is typical of a power series — to converge in an interval of x’s and 
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to diverge when x is large. The geometric series is safe for x between -1 and 1. 
Outside that range it diverges. 

The next example shows a repeating decimal 1.111...: 


Set x = -j^. The geometric series is 1 + ^ + 



The decimal 1.111... is also the fraction 1/(1 — which is 10/9. Every 
fraction leads to a repeating decimal. Every repeating decimal adds up (through the 
geometric series) to a fraction. 

To get 3.333..., just multiply by 3. This is 10/3. To get 1.0101..., set x= 1/100. 
This is the fraction 1/(1 - ^ 5 ), which is 100/99. 


Here is an unusual decimal (which eventually repeats). I don’t really understand it: 

- 7 - = .004 115 226 337 448 ... 

243 


Most numbers are not fractions (or repeating decimals). A good example is n\ 


n 




1000 


5 

10000 + 


This is 3.1415... , a series that certainly converges. We happen to know the first billion 
terms (the billionth is given below). Nobody knows the 2 billionth term. Compare 
that series with this one, which also equals n: 


71 = 4 — 


1 + 1 - 

3 5 


4 

- + 
7 


That alternating series is really remarkable. It is typical of this chapter, because its 
pattern is clear. We know the 2 billionth term (it has a minus sign). This is not a 
geometric series, but in Section 10.1 it comes from a geometric series. 

Question Does this series actually converge? What if all signs are 4- ? 

Answer The alternating series converges to n (Section 10.3). The positive series 
diverges to infinity (Section 10.2). The terms go to zero, but their sum is infinite. 

This example begins to show what the chapter is about, Part of the subject deals 
with special series, adding to 10/9 or n or e*. The other part is about series in general, 
adding to numbers or functions that nobody has heard of. The situation was the 
same for integrals — they give famous answers like In x or unknown answers like 
J x x dx . The sum ofl + 1/8 + 1/27 H — is also unknown — although a lot of mathema- 
ticians have tried. 

The chapter is not long, but it is full. The last half studies power series. We begin 
with a linear approximation like 1 + x. Next is a quadratic approximation like 
1 + x + x 2 . In the end we match all the derivatives of /(x). This is the “ Taylor series 
a new way to create functions — not by formulas or integrals but by infinite series. 

No example can be better than 1/(1 - x), which dominates Section 10.1. Then we 
define convergence and test for it. (Most tests are really comparisons with a geometric 
series.) The second most important series in mathematics is the exponential series 
e x = 1 4- x + jx 1 4- £x 3 H — . It includes the series for sin x and cos x, because of the 
formula e iX = cos x + i sin x. Finally a whole range of new and old functions will 
come from Taylor series. 

In the end, all the key functions of calculus appear as “infinite polynomials” (except 
the step function). This is the ultimate voyage from the linear function y = mx 4- fc. 



368 


10 Infinite Series 



10.1 The Geometric Series 



We begin by looking at both sides of the geometric series: 


1 + X + x z + x 3 + 


1 

1 - X 


(1) 


How does the series on the left produce the function on the right? How does 1/(1 — jc) 
produce the series? Add up two terms of the series, then three terms, then n terms: 

] z 1 -y 3 1-x" 

l+x=-Z- l + x + x 2 = -^- l + + . (2) 

1 - X 1 - X 1 - X 


For the first, 1 + x times 1 - x equals 1 - x 2 by ordinary algebra. The second begins 
to make the point: 1 + x + x 2 times 1 - x gives 1 - x + x — x 2 + x 2 - x 3 . Between 
1 at the start and — x 3 at the end, everything cancels. The same happens in all cases: 
1 + + x* -1 times 1 — x leaves 1 at the start and — x n at the end. This proves 

equation (2) — the sum of n terms of the series. 

For the whole series we will push n towards infinity. On a graph you can see what 
is happening. Figure 10.1 shows n = 1 and n= 2 and n — 3 and n = oo. 



Fig. 10.1 Two terms, then three 
terms, then full series: 

1 + x + x 2 + ■■■ = — . 

1 — x 


1 + x + x 2 + ■■■ 

j ~x v /r 

1 — X 


X 

X — 


X 


2 


X 

X 


2 

2 


— X 


3 


The infinite sum gives a finite 
answer, provided x is between 
— 1 and 1 . Then x” goes to zero: 

\-x H 1 

1 - X 1 — x' 


Sow start with the function 1/(1 - x). How does it produce the series? One way is 
elementary but brutal, to do “long division” of 1 — x into 1 (next to the figure). 
Another way is to look up the binomial formula for (1 — x) _1 . That is cheating — we 
want to discover the series, not just memorize it. The successful approach uses cal- 
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cuius. Compute the derivatives of f(x) = 1/(1 — x): 

/'=( i~*r 2 r = 2(i-*r 3 r=6(i-xr 4 - (3) 

At x = 0 these derivatives arc 1, 2, 6, 24, .... Notice how - 1 from the chain rule keeps 
them positive. The nth derivative at x = 0 is n factorial: 

/( 0) = 1 m = 1 /"(0) = 2 /"'(0) = 6 - /<">(0) = n!. 

Now comes the idea. To match the series with 1/(1 — x), match ail those derivatives at 
x = 0. Each power x n gets one derivative right. Its derivatives at x = 0 are zero, except 
the /7th derivative, which is n! By adding all powers we get every derivative right — 
so the geometric series matches the function: 

1 + x 4 x 2 + x 3 + ,+t has the same derivatives at x = 0 as 1/(1 — x). 

The linear approximation is 1 4 x. Then comes i/"( 0)x 2 = x 2 . The third derivative 
is supposed to be 6, and x 3 is just what we need. Through its derivatives, the function 
produces the series. 

With that example, you have seen a part of this subject. The geometric series 
diverges if |x|^l. Otherwise it adds up to the function it comes from (when 
- 1 < x < l). To get familiar with other series, we now apply algebra or calculus — to 
reach the square of 1/(1 - x) or its derivative or its integral. The point is that these 
operations are applied to the series. 

The best I know is to show you eight operations that produce something useful. 
At the end we discover series for In 2 and n. 


T Multiply the geometric series by a or ax: 


a 4 ax 4 ax + ■ = ax 4 ax 4 ax 4 - t+ = — . (4) 

1 - x 1 - x 

The first series fits the decimal 3.333.... In that case a = 3. The geometric series for 
x = to gave 1-111*-. — 10/9, and this series is just three times larger. Its sum is 10/3. 

The second series fits other decimals that arc fractions in disguise. To get 12/99, 
choose a = 12 and x = 1/100: 

^ „ 12 12 12 12/100 12 
100 1 00 2 100 5 1 - 1/100 99 

Problem 13 asks about .8787... and .1231 23,... It is usual in prccalculus to write 
+ + ar 2 4 ■■■ = aj{ 1 — r). But we use x instead of r to emphasize that this is a 

/unction — which wc can now differentiate, 

2. The derivative of the geometric series 1 4 x 4 x 2 4 - is 1/(1 — x) 2 : 

,+2 * +3 * !+ 4 -' 3 + ”'=£( 4 )-( 4 ?- (5) 

At x-jo the left side starts with 1.23456789. The right side is 1/(1 - j^) 2 = l/(9/l0) 2 , 
which is 100/81, If you have a calculator, divide 100 by 81, 

The answer should also be near (1.1111111 1) 2 , which is 1.2345678987654321. 

3. Subtract 1 4 x 4 x 2 4 ■■ from I 4 2x 4 3x 2 4 as you subtract functions : 


x 4 2x 2 4 3x 3 4 


1 1 _ 

(l-x) 2_ (l-x)“(l-x) 2t 


( 6 ) 


Curiously, the same scries comes from multiplying (5) by x. It answers a question left 
open in Section 8.4 — the average number of coin tosses until the result is heads. This 
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is the sum l(pi ) + 2(p 2 ) + from probability, with x = {: 

»(i) + 2 + 3& 3 + - = —i— = 2. (7) 

(i -ir 

The probability of waiting until tbe nth toss is p n — The expected value is two 
tosses , 1 suggested experiments, but now tbis mean value is exact. 

4 . Multiply series : the geometric series times itself is 1/(1 - x) squared: 

(1 + x + x 2 + -)(1 + * + ** + ■■■) = 1+2 jc + 3jc 2 + -\ (8) 

The series on the right is not new! In equation (5) it was the derivative of y= 1/(1 — x). 
Now it is the square of the same y. The geometric series satisfies dyjdx = y 2 , so the 
function does too. We have stumbled onto a differential equation. 

Notice how the series was squared. A typical term in equation (8) is 3x 2 , coming 
from 1 times x 1 and x times x and x 2 times 1 on the left side. It is a lot quicker to 
square 1/(1 — x) — but other series can be multiplied when we don't know what func- 
tions they add up to. 

5. Solve dyjdx = y 2 from arty starting value— a new application of series: 

Suppose the starting value is y = 1 at x = 0. The equation y* = y 2 gives l 2 for the 
derivative. Now a key step: The derivative of the equation gives y" = 2yy T . At x = 0 
that is 2 • 1 • 1. Continuing upwards, the derivative of 2yy f is 2 yy ff + 2(y') 2 . At x = 0 
that is y ,tf = 4 + 2 = 6. 

All derivatives are factorials: 1, 2, 6, 24, .... We are matching the derivatives of the 
geometric series 1 + x + x 2 + x 3 + .... Term by term, we rediscover the solution to 
/ = y 2 . The solution starting from y( 0) = 1 is y = 1/(1 - x). 

A different starting value is — 1. Then y* - {— l) 2 = 1 as before. The chain rule gives 
y** — 2 yy' — —2 and then y m = 6. With alternating signs to match these derivatives, the 
solution starting from - 1 is 

y= - 1 + x-x 2 + x 3 -” = -1/(1 + x). (9) 

It is a small challenge to recognize the function on the right from the series on the 
left. The series has — x in place of x; then multiply by -1. The sum y = - 1/(1 + x) 
also satisfies / = y 2 . We can solve differential equations from all starting values by 
infinite series . Essentially we substitute an unknown series into the equation, and 
calculate one term at a time. 

6. The integrals of 1 + x + x 2 + and 1 - x + x 2 - are logarithms : 

1 1 f x dx 

x + -x 2 + -x 3 + = - — — = — In (1 — x) (10a) 

2 3 Jo 1 -x 

X ~\x 2 + ^c 3 - - = = + In (1 + X) (10b) 

2 3 Jo 1 + * 

The derivative of (10a) brings back the geometric series. For logarithms we find 1/n 
not 1/n! The first term x and second term j x 2 give linear and quadratic approxi- 
mations. Now we have the whole series. I cannot fail to substitute 1 and j, to find 
ln(l - 1) and ln(l + 1) and ln(l - *): 

x = 1: 1 +£+ 3 + z + -■ = -In 0= + oo (11a) 

x = i: + In 2 = .693 (lib) 

x-i: i + i + A + &+-=-M = ln 2 . ( 12 ) 
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The first series diverges to infinity. This harmonic series 1 + \ \ came into the 

earliest discussion of limits (Section 2*6)* The second series has alternating signs and 
converges to In 2* The third has plus signs and also converges to In 2* These will be 
examples for a major topic in infinite series — tests for convergence. 

For the first time in this book we are able to compute a logarithm! Something 
remarkable is involved. The sums of numbers in (11) and (12) were discovered from the 
sums of functions in {10). You might think it would be easier to deal only with numbers, 
to compute In 2, But then we would never have integrated the series for 1/(1 “ x) and 
detected (10)* It is better to work with x , and substitute special values like £ at the 
end* 

There are two practical problems with these series. For In 2 they converge slowly. 
For In e they blow up. The correct answer is In e = 1, but the series can’t find it* Both 
problems are solved by adding ( 10 a) to ( 10 b), which cancels the even powers: 

/ X 3 X ^ \ 1 + x 

T + y + y + "V = 111(1 +JC) " lnU " JC) “ ln r^* (13) 

At x = 5 , the right side is In f — in f = In 2. Powers of 3 are much smaller than powers 
of 1 or so In 2 is quickly computed. All logarithms can be found from the improved 
series (13)* 


7. Change variables in the geometric series (replace x by x 2 or — x 2 ): 

l + x 2 + x 4 + x 6 +- = li(l-x 2 ) (14) 

1 -x 2 + x 4 -* 6 + ■■■ = 1/(1 + x 2 ). (15) 

This produces new functions (always our goal)* They involve even powers of x. The 
second series will soon be used to calculate n * Other changes are valuable: 

\2 j 2 


x x 

- in place of x: t + - 4- 




- in place of x: 1 + - + + -■ = 


1 - (x/ 2 ) 2- x 

1 x 


(16) 


(17) 


1 -( 1 /x) x- 1 

Equation (17) is a series of negative powers x~ n . It converges when \x\ is greater than 
l. Convergence in (17) is for large x. Convergence in (16) is for |x| < 2. 


8 * The integral of l - x 2 -h x 4 — x 6 + ■■ yields the inverse tangent of x: 


x — - 


1 . 1 , 1 7 f dx 

-x 3 + -X 5 - -x 7 + - = _ 

3 5 7 ]\+x 2 


= tan x* 


( 18 ) 


We integrated (15) and got odd powers. The magical formula for tt (discovered by 
Leibniz) comes when x = L The angle with tangent 1 is nj 4: 


, 111 n 

1 — — + — — — H - **■ = — 

3 5 7 4 


(19) 


The first three terms give tt a 3.47 (not very close). The 5000th term is still of size 
*0001, so the fourth decimal is still not settled. By changing to x = 1/^/3, the astrono- 
mer Halley and his assistant found 71 correct digits of tt/6 (while waiting for the 
comet). That is one step in the long and amazing story of calculating n. The Chudnov- 
sky brothers recently took the latest step with a supercomputer — they have found 
more than one billion decimal places of n (see Science, June 1989). The digits look 
completely random, as everyone expected. But so far we have no proof that all ten 
digits occur of the time. 
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Historical note Archimedes located n above 3.14 and below 37 . Variations of his 
method (polygons in circles) reached as far as 34 digits — but not for 1800 years. Then 
Halley found 71 digits of jt/ 6 with equation (18), For faster convergence that series 
was replaced by other inverse tangents, using smaller values of x: 

- = tan' - + tan" - = 4 tan '--tan . (20) 

A prodigy named Dase, who could multiply 100-digit numbers in his head in 8 hours, 
finally passed 200 digits of n. The climax of hand calculation came when Shanks 
published 607 digits. I am sorry to say that only 527 were correct, (With years of 
calculation he went on to 707 digits, but still only 527 were correct.) The mistake was 
not noticed until 1945! Then Ferguson reached 808 digits with a desk calculator. 

Now comes the computer. Three days on an ENIAC (1949) gave 2000 digits. A 
hundred minutes on an IBM 704 (1958) gave 10,000 digits. Shanks (no relation) 
reached 100,000 digits. Finally a million digits were found in a day in 1973, with a 
CDC 7600. All these calculations used variations of equation (20). 

The record after that went between Cray and Hitachi and now IBM. But the 
method changed. The calculations rely on an incredibly accurate algorithm, based 
on the ‘"arithmetic-geometric mean iteration” of Gauss. It is also incredibly simple, 
all things considered: 

On + 1 = a " 1 b " + 1 = = A+ 1 ! ( 1 1 2V; - V> 

1 i \ k - 0 

The number of correct digits more than doubles at every step. By n = 9 we are far 
beyond Shanks (the hand calculator). No end is in sight. Almost anyone can go past 
a billion digits, since with the Chudnovsky method we don’t have to start over again. 

It is time to stop. You may think (or hope) that nothing more could possibly be 
done with geometric series. We have gone a long way from 1/(1 - x), but some 
functions can never be reached. One is e x (and its relatives sin x, cos x, sinh x, cosh x). 
Another is l - x (and its relatives l/ v /l - x\ sin ” sec” 1 x, .,,). The exponentials 
arc in 10.4, with series that converge for all x. The square roots are in 10,5, closer to 
geometric series and converging for [x| < 1. Before that we have to say what con- 
vergence means. 

The series came fast, but I hope you see what can be done (subtract, multiply, 
differentiate, integrate). Addition is easy, division is harder, all are legal. Some un- 
expected numbers are the sums of infinite series. 

Added in proof By e-mail I just learned that the record for n is back in Japan: 
2 30 digits which is more than F07 billion. The elapsed time was 100 hours (75 hours 
of CPU time on an NFC machine). The billionth digit after the decimal point is 9. 


10.1 EXERCISES 


Read- through questions 

The geometric series 1 + x + x 2 + ■- adds to q It con- 
verges provided |x| < b The sum of n terms is c 
The derivatives of the series match the derivatives of 1/(1 — x) 
at the point x = d where the nth derivative is e 
The decimal U 1 1 ... is the geometric series at x = f and 


equals the fraction g The decimal .666,,, multiplies this 
by h The decimal .999... is the same as < 

The derivative of the geometric scries is | = k 

This also comes from squaring the i series. By choosing 
-x = .01, the decimal 1.02030405 is close to m The 
differential equation dyidx = y~ is solved by the geometric 
series, going term by term starting from y(0) = n 
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The integral of the geometric series is o — P , At 
x = 1 this becomes the q series, which diverges. At x = 
f weiind In 2 = « . The change from x to —x produ- 

ces the series 1/{1 4- x) = t and ln(I + x} = u . 

In the geometric series, changing to x 2 or — x 2 gives 
1/(1 — x 2 ) = _y_ and 1/n 4- x 2 ) - * . Integrating the 

last one yields x — ^x 3 + = x . The angle whose 

tangent is x = 1 is tan -1 1 = y Then substituting x — I 
gives the series n = * 

1 The geometric series is 1 + x + x 2 + * ■* = G. Another way 

to discover G is to multiply by x . Then x + x 2 + x 3 + = 

xG 3 and this can be subtracted from the original series. What 
does that leave, and what is G? 

2 A basketball is dropped 10 feet and bounces back 6 feet. 
After every fall it recovers j of its height. What total distance 
does the ball travel, bouncing forever? 

3 Find the sums of $ + £ + ^ and 1 — | 4- ^ and 

10— 1 +.1 — .01... and 3.040404.... 

4 Replace x by 1 — x in the geometric series to find a series 
for 1 lx. Integrate to find a series for lnx. These are power 
series “around the point x = 1 ” What is their sum at x = 0? 

5 What is the second derivative of the geometric series, and 
what is its sum at x = i? 

6 Multiply the series (1 + x + x 2 + -'*)(1 — x + x 2 - ■■■) and 
find the product by comparing with equation (14). 

7 Start with the fraction 7. Divide 7 into 1.000... (by long 

division or calculator) until the numbers start repeating. 
Which is the first number to repeat? How do you know that 
the next digits will be the same as the first? 

Note about the fractions 1 jq, IG/tf, 100/q, ... All remainders are 
less than q so eventually two remainders are the same. By 
subtraction, q goes evenly into a power 10* minus a smaller 
power 10*“". Thus qc= 10*- 10*"* for some c and 1 jq has 
a repealing decimal: 

1 c cl 

q~ 10*-10*"" _ 10*1-10"* 

= 7o^( 1 + + To 55 + '“)■ 

Conclusion: Every fraction equals a repeating decimal. 

8 Find the repeating decimal for and read off c. What is 
the number n of digits before it repeats? 

9 From the fact that every q goes evenly into a power 10* 
minus a smaller power, show that all primes except 2 or 5 go 
evenly into 9 or 99 or 999 or 

10 Explain why .010010001... cannot be a fraction (the 
number of zeros increases). 

11 Show that .123456789101112... is not a fraction. 


12 From the geometric series, the repeating decimal 
1.065065.. . equals what fraction? Explain why every repeating 
decimal equals a fraction. 

13 Write ,878787... and ,123123... as fractions and as geo- 
metric series. 


14 Find the square of 1,111... 

as an infinite series. 

Find the functions which equal the sums 15-24. 

15 x + x 2 + x 5 + ■■ 

16 l — 2x + 4jc 2 

17 x 3 +x 6 + x 9 + 

18 ix-ijc' + ijc 1 -- 

19 In x + pn x) 3 +(ln x) 3 + -* 

20 x — 2x 2 + 3jc 3 - — 

_ 1 1 1 

21 — + 2 T 1 + “' 
x x A x J 

22 x + + 4- ■■ 

1+x (1+x) 2 

23 tan x— itan 3 x+ jtan s x- -• 

‘ 24 e* + e 2x 4- e 3 * H — 

25 Multiply the series for 1/(1 

— x) and 1/(1 +x) to find the 


coefficients of x, x 2 , x 3 and x". 

26 Compare the integral of 1 + x 2 + x* + to equation (13) 
and find Jdx/(l-x 2 ). 

27 What fractions are dose to .2468 and .987654321? 

28 Find the first three terms in the series for 1/(1 — x) 3 . 


Add up the series 29-34. Problem 34 comes from (18). 


„„ 2 2 2 

29 T + T? + 77 + " ' 

3 3 3 

30 .1 + ,02 + .003 H — 

31 .t+4(.01)+i{.001) + - 

32 .1 -i{.01) + i(.001) 

33 .l+i(.001)+i(.00001)+- 

34 [ L + _i 

■” 1 3,3 + 5.32 

35 Compute the nth derivative of 1 + 2x + 3x 2 4 - ■" at x = 0, 


Compute also the nth derivative of (1 — x)~ 2 . 

36 The differential equation dy/dx = y 2 starts from y(Q) = b. 
From the equation and its derivatives find y\ y ", /" at x = 0, 
and construct the start of a series that matches those deriva- 
tives. Can you recognize y(x)? 

37 The equation dyjdx = y 2 has the differential form dyjy 2 = 
dx. Integrate both sides and choose the integration constant 
so that y — b at x = 0. Solve for y(x) and compare with 
Problem 36. 

38 In a bridge game, what is the average number p of deals 

until you get the best hand? The probability on the first deal 
is Then p 2 = (j)(A) = (probability of missing on the 

first) times (probability of winning on the second). Generally 
p B = (J)"“ 1 (i). The mean value fi is pi + 2p 2 + 3p 3 + — 


39 Show that (Ifl n )(I6 n ) = Sn H h J1 is ridiculous. 

40 Find a series for In i by choosing x in (10b). Find a series 
for In 3 by choosing x in (13). How is In $ related to In 3, and 
which series converges faster? 
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41 Compute In 3 to its second decimal place without a calcu- 
lator (OK to check). 

42 To four decimal places, find the angle whose tangent is 

*=A- 

43 Two tennis players move to the net as they volley the ball. 
Starting together they each go forward 39 feet at 13 feet per 
second. The ball travels back and forth at 26 feet per second. 
How far does it travel before the collision at the net? (Look 
for an easy way and also an infinite series.) 

44 How many terms of the series 1— i+i— £+*'■ are 
needed before the first decimal place doesn’t change? Which 
power of i equals the 100th power of j? Which power I /a" 
equals I/2 100 ? 


45 If tany = i and tanz = i, then the tangent of y + z is 
(tan y + tan z)/(l — tan y tan z) — 1. If tan y = % and tan z — 

, again tan(y + z) = 1. Why is this not as good as 

equation (20), to find n/4? 

46 Find one decimal of n beyond 3.14 from the series for 
4 ta n ” 1 i and 4 tan “ 1 i- How many terms are needed in each 
series? 

47 (Calculator) In the same way find one decimal of n 
beyond 3,14159. How many terms did you take? 

48 From equation (10a) what is Ze^/n? 

49 Zeno’s Paradox is that if you go half way, and then half 
way, and then half way---, you will never get there. In your 
opinion, doesi + i + ^H — add to 1 or not? 
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This is the third time we have stopped the calculations to deal with the definitions. 
Chapter 2 said what a derivative is. Chapter 5 said what an integral is. Now we say 
what the sum of a series is — if it exists. In all three cases a limit is involved. That is 
the formal, careful, cautious part of mathematics, which decides if the active and 
progressive parts make sense. 

The series i ^ + **■ converges to 1. The series 1 + ^ diverges to infin- 

ity. The series 1 “ i + i — converges to In 2. When we speak about convergence or 
divergence of a series, we are really speaking about convergence or divergence of its 
“partialsums.” 

DEFINITION 1 The partial sum s n of the series a x + fl 2 + stops at a n \ 

s„ = sum of the first n terms = a 1 + a 2 + ' " + 

Thus s n is part of the total sum. The example i + i + i H — has partial sums 



Those add up larger and larger parts of the series — what is the sum of the whole 
series? The answer is: The series ? + * + , converges to 1 because its partial sums s tt 
converge to 1. The series a x + a 2 + a z -\- ... converges to s when its partial sums— 
going further and further out — approach this limit s. Add the a’s, not the s* s, 

DEFINITION 2 7 he sum of a series is the limit of its partial sums s n . 

We repeat: if the limit exists . The numbers s H may have no limit. When the partial 
sums jump around, the whole series has no sum , Then the series does not converge. 
When the partial sums approach s, the distant terms a n are approaching zero. More 
than that, the sum of distant terms is approaching zero. 

The new idea (E a„ = s) has been converted to the old idea (s„ s). 

EXAMPLE A The geometric series + x ^ H — converges to s = £, 

The partial sums 5 : , s 2i s 4 are .1, .11, ,111, .1111. They are approaching s = £. 
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Note again the difference between the series of d s and the sequence of s’s. The series 
1 + 1 + 1 + ■ ■ ' diverges because the sequence of s’s is 1 , 2 , 3, ....A sharper example is 
the harmonic series: 1 + i + 3 H — diverges because its partial sums 1 , 1 ^, ... eventu- 
ally go past every number s. We saw that in 2.6 and will see it again here. 

Do not confuse a n -> 0 with s n -*■ s. You cannot be sure that a series converges, just 
on the basis that a n -> 0. The harmonic series is the best example: a n = 1 /rr — ► 0 but 
still s n -► oc. This makes infinite series into a delicate game, which mathematicians 
enjoy. The line between divergence and convergence is hard to find and easy to cross. 
A slight push will speed up a n -► 0 and make the s n converge. Even though a n 0 
does not by itself guarantee convergence, it is the first requirement: 


10A If a series converges ($„ -+ 5 ) then its terms must approach zero (a„ -► 0). 

Proof Suppose s n approaches 5 (as required by convergence). Then also x 
approaches 5 , and the difference s n — s „- 1 approaches zero. That difference is a„. So 
a n -*0. 

EXAMPLE 1 (continued) For the geometric series 1 + * + x 2 + the test a„-> 0 is 
the same as x n -> 0. The test is failed if \x\ ^ 1, because the powers of x don't go to 
zero. Automatically the series diverges. The test is passed if — 1 < jc < 1 , But to prove 
convergence, we cannot rely on a n -* 0. It is the partial sums that must converge: 

1 - x n 1 

s a = 1 + x + ■ + x" 1 = and s n -► , This is 5 . 

1 - x 1 - x 

For other series, first check that a n -> 0 (otherwise there is no chance of con- 
vergence). The a n will not have the special form x" — so we need sharper tests. 

The geometric series stays in our mind for this reason. Many convergence tests are 
comparisons with that series , The right comparison gives enough information: 

If \a x \ < ^ and \a 2 \ < k anc * then a x + a 2 + ... converges faster than i + 4 + 

More generally, the terms in a x + a 2 + a$ + . . . may be smaller than 
ax + ax 2 + ax 2, + . . . . Provided x < 1, the second series converges. Then Y a n a lso 
converges. We move now to convergence by comparison or divergence by comparison. 
Throughout the rest of this section, all numbers a„ are assumed positive. 

COMPARISON TEST AND INTEGRAL TEST 

In practice it is rare to compute the partial sums s n = a x H — + a n . Usually a simple 
formula can’t be found. We may never know the exact limit s. But it is still possible 
to decide convergence — whether there is a sum — by comparison with another series 
that is known to converge. 


10B ( Comparison test) Suppose that 0 ^a n ^b„ and YK converges. Then 
Y converges. 

The smaller terms a„ add to a smaller sum: Y a * * s below Y^» a nd must converge. 
On the other hand suppose a n ^ c„ and Y c n = 00 ■ This comparison forces Y ~ 00 ■ 
A series diverges if it is above another divergent series. 

Note that a series of positive terms can only diverge “to infinity." It cannot oscillate, 
because each term moves it forward. Either the s„ creep up on 5 , passing every number 
below it, or they pass all numbers and diverge. If an increasing sequence s n is bounded 
above t it must converge . The line of real numbers is complete, and has no holes. 
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The harmonic series 1 + j + 3 H- \ + ... diverges to infinity, 

A comparison series is l + i + i + £ + F + F + i + The harmonic series is 

larger , But this comparison series is really 1 + i because j = | = §* 

The comparison series diverges. The harmonic series, above it, must also diverge. 

To apply the comparison test, we need something to compare with. In Example 2, 
we thought of another series. It was convenient because of those ^s. But a different 
series will need a different comparison, and where will it come from? There is an 
automatic way to think of a comparison series. It comes from the integral test. 

Allow me to apply the integral test to the same example. To understand the integral 
test , look at the areas in Figure 10*2. The test compares rectangles with curved areas. 




Fig* 10*2 Integral test : Sums and integrals both diverge (p = l) and both converge (p > l) r 


EXAMPLE 2 (again) Compare 1 + 1 + 7 + ... with the area under the curve y = l/.x* 


Every term a n = l/'n is the area of a rectangle. We are comparing it with a curved 
area c a . Both areas are between x = n and x = n + 1 , and the rectangle is above the 
curve. So a D > c n : 


rectangular area a„ = - exceeds curved area c. n 
n 


fn + 1 


dx 


n 


X 


Here is the point. Those c n *s look complicated, but we can add them up. The sum 
cl + ... + c n is the whole area, from 1 to n + 1. It equals ln(n + 1 ) — we know the 
integral of 1 /x. We also know that the logarithm goes to infinity. 

The rectangular area 1 + 1/2 ... + 1/n is above the curved area. By comparison 

of areas, the harmonic series diverges to infinity — a little faster than ln(n + 1 ). 

Remark The integral of 1/x has another advantage over the series with ^s. First, 
the integral test was automatic. From 1/n in the series, wc went to i/x in the integral* 
Second, the comparison is closer. Instead of adding only { when the number of terms 
is doubled, the true partial sums grow like In n. To prove that, put rectangles under 
the curve. 

Rectangles below the curve give an area below the integral. Figure 10.2b omits the 
first rectangle, to get under the curve. Then we have the opposite to the first 
comparison - the sum is now smaller than the integral: 

1 1 1 * n dx 

+ +... + _< — = lnn. 

2 3 n J j x 

Adding 1 to both sides, s rt is below 1 + In n. From the previous test, s n is above 
ln(n + 1). That is a narrow space— we have an excellent estimate ofs n . The sum of 1/n 



10.2 Convergence Tests: Positive Series 


377 


and the integral of 1/x diverge together. Problem 43 will show that the difference 
between s n and In n approaches “Euler’s constant/" which is V - .511 .... 

Main point : Rectangular area is s ff , Curved area is close. We are using integrals to 
help with sums {it used to be the opposite). 

Question If a computer adds a million terms every second for a million years, how 
large is the partial sum of the harmonic series? 

Answer The number of terms is u = 60 2 * 24 • 365 ■ 10 12 < 3.2 ■ I0 19 . Therefore In n 
is less than In 3,2 + 19 In 10 < 45. By the integral test s n < I + In n, the partial sum 
after a million years has not reached 46, 

For other series, t/x changes to a different function y(x), At x = n this function 
must equal a n . Also y(x) must be decreasing. Then a rectangle of height a n is above 
the graph to the right of x = n, and below the graph to the left of x = n. The series 
and the integral box each other in: left sum ^ integral ^ right sum. The reasoning is 
the same as it was for a n = \/n and y(x)- 1/x: There is finite area in the rectangles 
when there is finite area under the curve. 

When we can’t add the a\ we integrate y(x) and compare areas: 


10C (Integral test) If y(x ) is decreasing and y(n ) agrees with a n , then 
a, + a 2 + a 3 + and J y(x) dx both converge or both diverge. 

EXAMPLE 3 The "p-series" ^ converges if p> !. Integrate y = 


f 


dx 


so by addition 


1 

n=2 « P < J 1 X"‘ 


In Figure 10.2c, the area is finite if p > 1. The integral equals [x 1 ~ p j{\ - p)]“ , which 
is !/{p— 1). Finite area means convergent series. If 1/1 p is the first term, add 1 to the 
curved area: 

11.1. . . 1 _ p 


\p + 2” + y + 


i + 


p- 1 p- f 


The borderline case p- 1 is the harmonic series (divergent). By the comparison 
test, every p < 1 also produces divergence. Thus Xt j^Jn diverges by comparison with 
Jdx/ v // x (and also by comparison with Xl/n). Section 7.5 on improper integrals runs 
parallel to this section on “improper sums” (infinite series). 

Notice the special cases p = 2 and p = 3. The series 1 + £ + 4 + converges. Euler 
found tt 2 /6 as its sum. The series 1 + i + also converges. That is proved by 

comparing Xl/n 3 with XI jn 1 or with jdx/x 3 . But the sum for p — 3 is unknown. 


Extra credit problem The sum of the p-series leads to the most important problem 
in pure mathematics. The “zeta function” is Z(p) = TA jn p , so Z( 2) = ti 2 /6 and Z(3) is 
unknown. Riemann studied the complex numbers p where Z(p) = 0 (there are infi- 
nitely many). He conjectured that the real part of those p is always j. That has been 
tested for the first billion zeros, but never proved. 


COMPARISON WITH THE GEOMETRIC SERIES 


We can compare any new series a x + a 2 + •** with 1 + x + — . Remember that the 
first million terms have nothing to do with convergence. It is further out, as n -► oo, 
that the comparison stands or falls. We still assume that a n > 0. 
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10D ( Ratio test) If a„+ i fa n approaches a limit L < 1, the series converges. 
10E {Root test) If the nth root (a n ) 1/h approaches L < 1, the series converges. 


Roughly speaking, these tests make a n comparable with L n — therefore convergent. 
The tests also establish divergence if L > 1* They give no decision when L = 1. Unfor- 
tunately L — 1 is the most important and the hardest case. 

On the other hand, you will now see that the ratio test is fairly easy. 

EXAMPLE 4 The geometric series x + x 2 H — has ratio exactly x. The nth root is 
also exactly x. So L — x. There is convergence if x < 1 (known) and divergence if x > 1 
(also known). The divergence of 1 + 1 + is too delicate (!) for the ratio test and 
root test, because L= L 

EXAMPLE 5 The p-series has a„ = 1 in p and a n+l ja„ — n p ![n l) p . The limit as n -* oo 
is L = 1, for every p* The ratio test does not feel the difference between p = 2 (conver- 
gence) and p = 1 (divergence) or even p = — 1 (extreme divergence). Neither does the 
root test* So the integral test is sharper. 

EXAMPLE 6 A combination of p-series and geometric series can now be decided: 

xx 1 x n a x n + 1 n p 

— + — has ratio = — approaching L = x, 

\ p 2 P n p a n (n + \} p x n * 

It is |x| < 1 that decides convergence, not p. The powers x n are stronger than any n p . 
The factorials nl will now prove stronger than any x H . 

EXAMPLE 7 The exponential series e x = 1 + x + ^x 2 + fx 3 - ■ converges for all x. 
The terms of this series are x n !nl The ratio between neighboring terms is 

*" +l /(n+l)! X u . L L r rt 

= , which approaches L = 0 as n -> oc. 

x jn\ n + 1 ^ 

With x = 1, this ratio test gives convergence of ^ 1/n! The sum is e. With x = 4, the 
larger series ^ 4 n /nl also converges* We know this sum too — it is c 4 * Also the sum 
of x n n p jn\ converges for any x and p. Again L = 0 — the ratio test is not even close. 
The factorials take over , and give convergence. 

Here is the proof of convergence when the ratios approach L< 1, Choose x halfway 
from L to 1, Then x < 1, Eventually the ratios go below x and stay below: 

a* + 1 I&n < x a x + 2 I&n + 1 < x a * + 3 /a* + 2 <x 

Multiply the first two inequalities. Then multiply all three: 

% + i /a N <x a N + 2 /a N <x 2 + 

Therefore + 1 + a y + 2 T + 3 is less than a v (x + x 2 + x 3 + *«■)* Since x < 1, 
comparison with the geometric series gives convergence. 

EXAMPLE 8 The series ^ \jn n is ideal for the root test* The nth root is 1/n, Its 
limit is L = 0* Convergence is even faster than for e = ^ 1/n! The root test is easily 
explained, since (a„) L ' rt < x yields a n < x n and x is close to L < 1. So we compare with 
the geometric series* 
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SUMMARY FOR POSITIVE SERIES 


The convergence of geometric series and p-series and exponential series is settled, I 
will put these a n 's in a line, going from most divergent to most convergent. The 
crossover to convergence is after 1 /n: 


l + 1+ - <'''>£ 'n 

10A 10B and 10C 

(a„ -j* 0) (comparison and integral) 


n I 4" 1 1 
2" 2" nl n\ tf 

10D and 10E 

(ratio and root) 


You should know that this crossover is not as sharp as it looks. On the convergent 
side, l/rt(ln n) z comes before all those p-series. On the divergent side, l/n(ln n) and 
l/n(ln n)(ln In n) belong after l/n. For any divergent (or convergent) series, there is 
another that diverges (or converges) more slowly. 

Thus there is no hope of an ultimate all-purpose comparison test. But comparison 
is the best method available. Every series in that line can be compared with its 
neighbors, and other series can be placed tn between. It is a topic that is understood 
best by examples. 


EXAMPLE 9 Y diverges because Y~ diverges. The comparison uses In n < «. 

EXAMPLE 10 V— - — —7 ^ f— -7 < 00 Y— : — — T = 00 * 

^ n( In ri) 2 J x(ln x) 2 ^ «(ln n) J x{\n x) 

The indefinite integrals are — 1/In x and ln(ln x). The first goes to zero as x -+ 00 ; the 

integral and series both converge. The second integral ln(ln x) goes to infinity — very 
slowly but it gets there. So the second series diverges. These examples squeeze new 
series into the line, closer to the crossover. 

EXAMPLE 1 1 2 ^ i < — so ^ ^ + -j^ + <j- + ^ + ^ + *** (convergence). 

The constant 1 in this denominator has no effect — and again in the next example. 

1 1 ill 111 

EXAMPLE 12 _>_ so - + - + - + - > - + - + - + 

2n — 1 In 1 3 5 2 4 6 

Y 1 12n is 1/2 times Y 1 l n * 50 both series diverge. Two series behave in the same 
way if the ratios a„/b n approach L> 0. Examples 11-12 have n 2 /(n 2 -h I)-* l and 
2n/(2n — !)—►!, That leads to our final test: 


10F (Limit comparison test) If the ratio ajb n approaches a positive limit L, 
then Y a n afi d Y ^ either both diverge or both converge. 


Reason : a„ is smaller than 2 Lb n and larger than \Lb n , at least when n is large. So the 
two series behave in the same way. For example ^sin(7/^) converges for p> 1, 
not for p ^ !. It behaves like Y */ wP (here L= 7). The tail end of a series (large n) 
controls convergence. The front end (small n) controls most of the sum. 

There are many more series to be investigated by comparison. 
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Read-through questions 

The convergence of a i + a 2 F is decided by the partial sums 
s K = a If the s n approach $> then La* = b For the 

c series 1 4- x F -■* the partial sums are s„ = d In 

that case s„-* 1/(1 — jc) if and only if _®_. In all cases the 
limit $„ -+ 5 requires that a n -* i But the harmonic series 

a n = 1 jn shows that we can have<i n o and still the series 
h , 

The comparison test says that if 0 < a n ^ b K then * In 

case a decreasing y(x) agrees with a n at x = n, we can apply 

the ] test. The sum converges if and only if fc 
By this test the /sseries EI/h p converges if and only if p is 
i For the harmonic series (p= 1), s n = 1 F — h 1/n is 
close to the integral f(n) = _m_. 

The n test applies when a n+1 ja„-* L. There is con- 
vergence if o . divergence if p . and no decision if 
q . The same is true for the t test, when (a n ) 1/J * — L. 
For a geometric-p-series combination d n = x J, /n p , the ratio 
a A+ i/a„ equals « . Its limit is L= * so there is con- 

vergence if u . For the exponential e* = Hx n jn\ the limit- 
ing ratio is L = v . This series always w 

because n! grows faster than any x" or n F . 

There is no sharp line between * and r . But if 
Lb„ converges and ajb n -*> L, it follows from the * test 
that In* also converges. 

1 Here is a quick proof that a finite sum 1 F i F A F mm =s 

is impossible. Division by 2 would give £ + ? + £+ — = £s. 
Subtraction would leave 1 F A F A F it ' = Those last two 
series cannot both add to because . 

2 Behind every decimal s = ,abc.„ is a convergent series 

o/IQF fr/lOQ F F By a comparison test prove 

convergence. 

3 From these partial sums s n , find a n and also s = 

1 2n 

(a) s H = 1 - - (b) s n = 4n (c) s n = In 

n it f i 

4 Find the partial sums s n = n, +a 2 + 

(a) d^I/3" -1 (b) a„ = ln — ~ (c) o„ = n 

n F l 

5 Suppose 0<fl n <b fl and converges. What can be 
deduced about !£>*? Give examples. 

6 {a} Suppose b n + c n <a n (all positive) and La„ converges. 
What can you say about Eb n and Lc„? 

(b) Suppose a n <b n + c„ (all positive) and diverges. 
What can you say about Tb A and Ec,,? 


Decide convergence or divergence in 7-10 (and give a reason). 
7 tAu F F 3A73 + ■** 8 lAn F rAx + rh* + " h + 


EXERCISES 


9 F + tAj + tA?+ 

1® iir + TiF + F+ *" 

Establish convergence or divergence in 11-20 by a comparison 
test. 

11 Iv + 10 

12 

13S «+^ 

11 ?■/:, 


16 £;?“(;) 

17 

1. 

19 y 1 

^3"- 2* 

20 y 1 , 

^e n -n e 

For 21-28 find the limit L in 

the ratio test or root test. 

3" 

21 

22 ^ 

-2'Vj 

23 ZlT 


25 


27 t‘) 

28 Z? 


29 (+ — A) + (i — A) + (i — i) ts “ telescoping” because { and i 
cancel —A and — A* Add the infinite telescoping series 

30 Compute the sum s for other “telescoping series”: 

.(HMHMH)- 

(b) In | F In f Fin |f ' 

31 In the integral test, what sum is larger than J" y(x) dx and 
what sum is smaller? Draw a figure to illustrate. 

32 Comparing sums with integrals, find numbers larger and 
smaller than 



F 


1 . , 1 

+ r r and s„ * 1 F - F 

2r\ — 1 8 



33 Which integral test shows that 1 /e" converges? What 

is the sum? 


34 Which integral test shows that nje * converges? What 

is the sum? 
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Decide for or agairat convergence in 35-42* based on J y(x) dx, 

35 y-j- — 36 y-^— 

^n 2 + 1 ^3n + 5 

37 Y 38 Y *— ^is decreasing?^ 


39 Xn'/n* 
41 X^/ti" 


40 y — — 

2 n(ln n)(ln In n) 
42 y n /e " 3 


43 (a) Explain why D„ = ^1 + ^ — In n is positive 

by using rectangles as in Figure 10.2. 

(b) Show that D „ , , is less than D„ by proving that 

1 p'+'dx 
n + ljn ** 

(c) (Calculator) The decreasing D a y s must approach a limit. 
Compute them until they go below *6 and below .58 
(when?). The limit of the D n is Euler's constant y = .577.,.. 

44 In the harmonic series, use s n % .577 + In tt to show that 

s„= 1 + ^ + - needs more than 600 terms to reach s* > 7, 

2 n 

How many terms for s n > 10? 

45 (a) Show that 1 - \ + ^ - i- - = — |-r + + ^-by 

2 3 4 2n n + 1 2n 

adding 2^ + ^ + **■ + ^ to both sides. 

(b) Why is the right side close to In 2n~ln n? Deduce that 
1 — i + + — approaches In 2. 

46 Every second a computer adds a million terms of 
JM/(nlnn). By comparison with j dxf(x In x) t estimate the 
partial sum after a million years (see Question in text). 

1000 j 

47 Estimate J] -= by comparison with an integral, 

ioo n 

48 If converges (all a n > 0) show that converges. 


49 If X a n converges (all a n > 0) show that X sin a A converges. 
How could X sin a R converge when Xa„ diverges? 

50 The nth prime number p„ satisfies pjn In n -* 1. Prove that 

^ 1 1111 1 
^ = 2 + 3 + 5 + 7 + n + ‘" d,VerBeS - 

Construct a series Xa H that converges faster than l>b H but 
slower than Xc* (meaning aJb R -* 0, a R jc n -* co). 

51 b n = l/n\ c n = 1/n 3 52 b n = *(*)", = & 

53 = I/nl, c n = \jn R 54 b a = I/n% c n = 

In Problem 53 use Stirling’s formula ^/litn n n je?n\ -* L 

55 For the series i + i + i + A + A+ show that the 
ratio test fails. The roots (fl,,) 1 ^ do approach a limit L. Find 
L from the even terms a lk = 1/2*. Does the series converge? 

56 (For instructors) If the ratios a n + i (a R approach a positive 
limit L show that the roots (a„) 1/n also approach L 


Decide convergence in 57-66 and name your test. 


57 X 1 
^{In nT 


59 4; 

60 X, * 

^ln ( 10 "} 

61 Xln" + j 
^ n + 1 

62 

63 E (ln (t “ lall!,) 

M (test all p) 

n r 

65 £ 4 .- 2 - 

(test all p, 9 ) 


67 Suppose aJb R -* 0 in the limit comparison test. Prove that 
I a n converges if I b n converges. 

68 Can you invent a series whose convergence you and your 
instructor cannot decide? 


10.3 Convergence Tests: All Series ■■■ 

This section finally allows the numbers a„ to be negative. The geometric series 1 - 
£ + £ — £+•■■=£ is certainly allowed. So is the series jr = 4- | + |-1+---. If we 
change all signs to + , the geometric series would still converge (to the larger sum 2). 
This is the first test, to bring back a positive series by taking the absolute value |aj 
of every term. 

DEFINITION The series l,a„ is “absolutely convergent” if X \a tt \ is convergent. 
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Changing a negative number from a n to |a„| increases the sum. Main point: The 
smaller series Z a n is sure to converge if Z \a n \ converges. 


10G If Z|aJ converges then Z a n converges (absolutely). But Z a n might con- 
verge, as in the series for n y even if Z |aj diverges to infinity. 


EXAMPLE 1 Start with the positive series £ + £ + £ + Change any signs to minus . 
Then the new series converges (absolutely). The right choice of signs will make it 
converge to any number between -1 and 1. 

EXAMPLE 2 Start with the alternating series 1 + + ••• which converges to 

In 2. Change to plus signs. The new series 1+i + iH — diverges to infinity. The 
original alternating series was not absolutely convergent. It was only “ conditionally 
convergent." A series can converge (conditionally) by a careful choice of signs — even 
if Z|flJ = oo. 

If Z \a n \ converges then Z a n converges . Here is a quick proof. The numbers a n + \a n \ 
are either zero (if a n is negative) or 2\a n \. By comparison with Z 2|aJ, which converges, 
Z(a„ + |aj) must converge. Now subtract the convergent series Z \a n \. The difference 
Z a n also converges, completing the proof. All tests for positive series (integral, ratio, 
comparison, ...) apply immediately to absolute convergence, because we switch to 

kl. 

EXAMPLE 3 Start with the geometric series i + £ + tt+ — which converges to 
Change any of those signs to minus. Then the new series must converge (absolutely). 
But the sign changes cannot achieve all sums between - { and \. This time the sums 
belong to the famous (and very thin) Cantor set of Section 3.7. 

EXAMPLE 4 (looking ahead) Suppose Z a n x n converges for a particular number x. 
Then for every x nearer to zero, it converges absolutely. This will be proved and used 
in Section 10.6 on power series, where it is the most important step in the theory. 

EXAMPLE 5 Since Z 1/n 2 converges, so does Z(cosn)/n 2 . That second series has 
irregular signs, but it converges absolutely by comparison with the first series (since 
| cos n\ < 1). Probably Z(tan n)/n 2 does not converge, because the tangent does not 
stay bounded like the cosine. 


ALTERNATING SERIES 

The series 1— i + $ — i + ••• converges to In 2. That was stated without proof. This 
is an example of an alternating series , in which the signs alternate between plus and 
minus. There is the additional property that the absolute values 1, i, ... decrease 
to zero. Those two facts — decrease to zero with alternating signs — guarantee 
convergence. 


10H An alternating series a { - a 2 + a 3 - a 4 — converges (at least condition- 
ally, maybe not absolutely) if every a n4l < a n and a n 0. 


The best proof is in Figure 10.3. Look at a { - a 2 + a 3 . It is below a x , because a 3 (with 
plus sign) is smaller than a 2 (with minus sign). The sum of five terms is less than the 
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+a : 


Fig. 10.3 An alternating series converges when ihe absolute values decrease to zero. 


sum of three terms, because a 5 is smaller than a 4 . These partial sums $1, s 3 , s 5 , ... 
with an odd number of terms are decreasing , 

Now look at two terms a 1 —a 2 ^ then four terms, then six terms. Adding on a 3 - n 4 
increases the sum (because a 3 ^ n 4 ). Similarly s 6 is greater than s 4 (hecause it includes 
a 5 — a 6 which is positive). So the sums s 2 , s 4 , s e* — are increasing. 

The difference between s n _ l and s n is the single number ± a n . It is required by 10H 
to approach zero. Therefore the decreasing sequence s lT s 3 , ... approaches the same 
limit 5 as the increasing sequence s 2 , s 4) .... The series converges to s, which always 
lies between s n - x and s n , 

This plus-minus pattern is special but important. The positive series T*a n may not 
converge. The alternating series is E(— l)" + 1 a„. 

EXAMPLE 6 The alternating series 4-y + y -7 ” is conditionally convergent (to 
tt). The absolute values decrease to zero. Is this series absolutely convergent? No. 
With plus signs, 4(1 + j + 3 H — ) diverges like the harmonic series. 

EXAMPLE 7 The alternating series 1 — 1 + 1 — 1 + **■ is not convergent at all. Which 
requirement in I0H is not met ? The partial sums Si,s 3 j 5 5 ,... all equal 1 and 
s 2 , s 4 ,s 6 , ... all equal 0 — but they don't approach the same limit s. 

MULTIPLYING AND REARRANGING SERIES 

In Section 10.1 we added and subtracted and multiplied series. Certainly addition 
and subtraction are safe. If one series has partial sums s n -► s and the other has partial 
sums t n -* t, then addition gives partial sums s n + t n -► s + But multiplication is 
more dangerous, because the order of the multiplication can make a difference. More 
exactly, the order of terms is important when the series are conditionally convergent. 
For absolutely convergent series, the order makes no difference. We can rearrange 
their terms and multiply them in any order, and the sum and product comes out 
right: 


I i ' r . , ’ 7 7 ; , ■ ■ ■ - r 1 . ■ t V V ■ * " "' V ■ ■ H " " " ' ■ '■ 

: 1QI Suppose oopyt^pabsplateij', If ■ 4 ... is any reordering of the 

I,#*, jn tWoeworderS^atetf converges absolutely. ; 

; i- .■ ; ; ■ ■ ; ;; - : i : '■ i * r ; ■ : . ; ■ ■' ■ : ‘ 

1M - Jloppoaef faj *? ? jtfid pbj, = ; t;cohy^ri;a absolute^, Uteh the infinitely 
.ifianj'terfttfc ti$) in their product? qtcSd ffet ar^arderj to stl 


Rather than proving 101 and 10J, we show what happens when there is only condi- 
tional convergence. Our favorite is + converging conditionally to 
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In 2. By rearranging, it will converge conditionally to anything ! Suppose the desired 
sum is 1000. Take positive terms 1 T- j + ’■* until they pass 1000. Then add negative 
terms — j — i— *** until the subtotal drops below 1000. Then new positive terms 
bring it above 1000, and so on. All terms are eventually used, since at least one new 
term is needed at each step. The limit is s = 1000. 

We also get strange products, when series fail to converge absolutely: 


(' j^^rX 1 v2 + vf) 1 VJi)'"- 


On the left the series converge (conditionally). The alternating terms go to zero. On 
the right the series diverges. Its terms in parentheses don't even approach zero, and 
the product is completely wrong. 

I close by emphasizing that it is absolute convergence that matters. The most 
important series are power series t\ Like the geometric series (with all a n — 1) 
there is absolute convergence over an interval of x's. They give functions of x, which 
is what calculus needs and wants. 

We go next to the series for which is absolutely convergent everywhere. From 
the viewpoint of convergence tests it is too easy — the danger is gone. But from the 
viewpoint of calculus and its applications, e x is unconditionally the best. 


10.3 EXERCISES 


Read-through questions 

The series is absolutely convergent if the series q is 
convergent. Then the original series is also b . But 
the series Za n can converge without converging absolutely. 
That is called c convergence, and the series d is an 
example. 

For alternating series, the sign of each a n + 1 is • to the 
sign of a n . With the extra conditions that f and p , 
the series converges (at least conditionally). The partial sums 
Sj,s 3 , ... are h and the partial sums s 2 , s 4 , . .. are I 
The difference between s n and s n _i is \ Therefore the 
two series converge to the same number s. An alternating 
series that converges absolutely [conditionally] (not at all) is 

* [_!_] ( m l With absolute [conditional] con- 

vergence a reordering (can or cannot?) change the sum. 


13 Suppose converges absolutely. Explain why keeping 
the positive a ' s gives another convergent series. 

\4 Can converge absolutely if all a„ are negative? 

15 Show that the alternating series 1 — — i + i — i + 

does not converge, by computing the partial sums s 2 , .... 

Which requirement of 10H is not met? 

16 Show that $ — 1 + 4~5+'" does not converge. Which 
requirement of 10H is not met? 

17 (a) For an alternating series with terms decreasing to zero, 
why does the sum s always lie between s„ _ ! and s„? 

(b) Is s — s n positive or negative if s„ stops at a positive a„? 

18 Use Problem 17 to give a bound on the difference between 
s 5 = 1 — { + $ — £ -f i and the sum 5 = In 2 of the infinite 
series. 


Do the series 1-12 converge absolutely or conditionally ? 


i K-ir 1 


n + 3 




5 x(-ir +, 3v^/( n +i) 


7 £(-ir +l ln 
ii £(-ir + v / " 


2 j;(-ir-v>/»+3 

4K-.r'i 

6 X(-l)" +1 sin 2 n 

8I(-D“ +1 — 

^ n 

io x(-ir i 2 i/ * 

12 £(-ir +J (l -n 1 ^) 


19 Find the sum 1 — — + rr — — + *•• = s. The partial sum s 4 

2! 3! 4! 

is (above 5)(below s) by less than . 

20 Give a bound on the difference between s 10( > = 

_ 111 n 2 . 

21 Starling from ry + + -^ + ” - — , with plus signs, 

12 3 6 

show that the alternating series in Problem 20 has s — n 2 j 12. 

22 Does the alternating series in 20 or the positive series in 
21 give n 2 more quickly? Compare 1/101 2 — 1/102 2 4 — with 
1/101 2 + 1/102 2 + --. 





10.3 Convergence Tetls: All Series 


385 


23 If Za n does not converge show that LIjJ does not 34 Verify the Schwarz inequality {La H h n ) 3 if 

converge. a A = (if and b H = 


24 Find conditions which guarantee that a x + a 2 — a 3 + 
(** + a s — a£+ - will converge (negative term follows two 
positive terms). 

25 If the terms of In 2 = 1 — i + 4 — 4 + "" are rearranged into 
1 — { — i + i — i — -J-H show that this series now adds to 
i In 2. (Combine each positive term with the following nega- 
tive term.) 

26 Show that the series 1 + j i + i + ^ — J H — converges 
to | In 2. 

27 What is the sum of 1 + ^ i + | — J + £+*■■? 

28 Combine ! + *■■ + -— In n->y and 1 — £ + ^ ► In 2 

n 

to prove i+i + | — i-J-i + "* = hi2. 

29 (a) Prove that this alternating series converges: 


35 Under what condition does X(a n+1 — a n ) converge and 
what is its sum? 

36 For a conditionally convergent series, explain how the 
terms could be rearranged so that the sum is + oo. All terms 
must eventually be included, even negative terms. 

37 Describe the terms in the product (1 + i + i+ ’*■)(! + i + 
£+*■■) and find their sum. 

38 True or false : 

(a) Every alternating series converges. 

(b) Za n converges conditionally if Z\a H \ diverges. 

(c) A convergent series with positive terms is absolutely 
convergent. 

(d) If Za„ and Zb n both converge, so does Z(a„ + b n ). 


C 2 dx l _ m 
! X + 2 3 


*dx 1 
2 * + 3 



39 Every number x between 0 and 2 equals 1 +i + ■■■ 
with suitable terms deleted. Why? 


(b) Show that its sum is Euler's constant y, 

30 Prove that this series converges. Its sum is k/ 2. 



31 The cosine of B- 1 radian is 1 — r + — ■ . Compute 

21 4! 

cos 1 to five correct decimals (how many terms?). 

It* 7t 5 

32 The sine of 6 = n radians is n — — + . Compute 

sin it to eight correct decimals (how many terms?). 

33 If Zal and Zb 2 are convergent show that I is abso- 
lutely convergent. 

Hint (a ± b) 1 > 0 yields 2 \ab\ < a 2 + b 2 > 


40 Every number s between —1 and 1 equals ±%±i±i± — 
with a suitable choice of signs. (Add 1 = \ J J to get 
Problem 39.) Which signs give s = — 1 and s = 0 and s — ^? 

41 Show that no choice of signs will make ± i ± £ ± ff ± *" 
equal to zero. 

42 The sums in Problem 41 form a Cantor set centered at 
zero. What is the smallest positive number in the set? Choose 
signs to show that £ is in the set. 

*43 Show that the tangent of 0 = t(tt- l) is sin 1/(1 - cos 1). 
This is the imaginary part of s= — ln( 1 — e l ). From 
s = Ze Itf /n deduce the remarkable sum I (sin n)fn = {{n — 1). 

44 Suppose Za n converges and |x|<l. Show that Za„x * 
converges absolutely. 


10.4 The Taylor Series for e* sin x, and cos x 


This section goes back from numbers to functions. Instead of Za n = s it deals with 
Za n x" =/{*). The sum is a function of x. The geometric series has all a n - 1 (including 
a 0f the constant term) and its sum is/{x) = 1/(1 - xf The derivatives of 1 + x + x 1 + ■■■ 
match the derivatives of /. Now we choose the a n differently, to match a different 
function. 

The new function is e*. All its derivatives are e*. At x — 0, this function and its 
derivatives equal U To match these Us, we move factorials into the denominators. 
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Term by term the series is 




X X 
+ ~ + 
1 ! 2 ! 



(i) 


*7n! has the correct nth derivative (— 1). From the derivatives at x = 0, we have buih 
back the function'. At x = 1 the right side is 1 + 1 + i + £ + — and the left side is e “ 
2.71828.... At x- - 1 the series gives l~l + £ — £ + which is e -1 . 

The same term-by-term idea works for differential equations, as follows. 


EXAMPLE 1 Solve dy/dx — — y starting from y — 1 at x “ 0. 

Solution The zeroth derivative atx-Ois the function itself: y — 1. Then the equation 
y' “ — y gives / = — 1 and y* = — / = + 1. The alternating derivatives 
1, — 1, l t - 1, ... are matched by the alternating series for e~ x : 

y — 1 - x + yx 2 - £x 3 H — = e~* (the correct solution to f - - y). 


EXAMPLE 2 Solve d 2 y/dx 2 — — y starting from y — 1 and / = 0 (the answer is cos x). 


Solution The equation gives y” — — 1 (again at x — 0). The derivative of the equation 
gives y m = - y f = 0. Then y*" = — y" = + 1. The even derivatives are alternately + 1 
and -1, the odd derivatives are zero. This is matched by a series of even powers, 
which constructs cos x: 


« 1 2 1 4 1 * 

+ 4!* "S* + "■“ C0!x - 


The first terms 1 — £x 2 came earlier in the book. Now we have the whole alternating 
series. It converges absolutely for all x, by comparison with the series for e 1 (odd 
powers are dropped). The partial sums in Figure 10.4 reach further and further before 
they lose touch with cos x. 



HQ. 10.4 The partial sums 1 — x J /2 + x 4 /24 — ••• of the cosine series. 

If we wanted plus signs instead of plus-minus, we could average e 1 and e ~ x . The 
differential equation for cosh x is d 2 y/dx 2 = + y, to give plus signs: 

e 1 + e~ x )= 1 + ~x 2 + ^x 4 + ^x 6 + — (which is cosh x). 

TAYLOR SERIES 

The idea of matching derivatives by powers is becoming central to this chapter. The 
derivatives are given at a basepoint (say x = 0). They are numbers /(0),/'(0), .... The 
derivative / <n, (0) will be the nth derivative of a„x\ if we choose a„ to be /‘"’(Oj/n! 
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Then the series has the same derivatives at the basepoint as /(x): 


10K The Taylor series that matches /(x) and all its derivatives at x = 0 is 

m +f'(0)x + -f"(0)x 2 + 0)x 3 + - = I -4 i ^- 

2 o B =o n: 


The first terms give the linear and quadratic approximations that we know well. The 
x 3 term was mentioned earlier (but not used). Now we have all the terms — an “infinite 
approximation” that is intended to equal /(x). 

Two things are needed. First, the series must converge. Second, the function must 
do what the series predicts, away from x = 0. Those are true for e* and cos x and 
sin x; the series equals the function. We proceed on that basis. 

The Taylor series with special basepoint x = 0 is also called the “Maclaurin series .” 

EXAMPLE 3 Find the Taylor series for f(x) = sin x around x = 0. 

Solution The numbers /‘"’(G) are the values of/= sin x,f = cos x,f" = - sin x, ... 

at x = 0. Those values are 0, 1, 0, -1, 0, I AH even derivatives are zero. To find 

the coefficients in the Taylor series, divide by the factorials: 

sin x-x- &x 3 + rajx s . (2) 


EXAMPLE 4 Find the Taylor series for /(x) = (1 + x) s around x = 0. 


Solution This function starts at/(0) = 1, Its derivative is 5(1 + x) 4 , so/'(0) = 5. The 
second derivative is 5'4*(l+x) 3 , so /"(0) = 5-4, The next three derivatives are 
5 ■ 4 • 3, 5 ■ 4 • 3 • 2, 5 • 4 ■ 3 • 2 ■ 1. After that all derivatives are zero. Therefore the Taylor 
series stops after the x 5 term: 


5-4 , 5-4'3 , 

1 + 5x+ — x 2 + — — x 3 + 
2! 3! 


5 * 4 * 3 * 2 . 5-4-3-2-I , 

x 4 + x s . 


4! 


5! 


( 3 ) 


You may recognize 1, 5, 10, 10, 5, 1. They are the binomial coefficients, which appear 
in Pascal’s triangle (Section 2.2), By matching derivatives, we see why 0!, II, 2!, ... are 
needed in the denominators, 


There is no doubt that x = 0 is the nicest basepoint. But Taylor series can be con- 
structed around other points x = a. The principle is the same — match derivatives by 
powers — but now the powers to use are (x - a)". The derivatives / <n, (ti) are computed 
at the new basepoint x = a. 

The Taylor series begins with f(a) +/'(fl)(x — a). This is the tangent approximation 
at x = a. The whole “infinite approximation” is centered at a — at that point it has 
the same derivatives as /(x), 


10L The Taylor series for /(x) around the basepoint x = a is 

1 ® fin) („\ 

J\x) = f(a) +f'(a)(x ~ a) + r/"(a)(x -a) 2 + = £ J —~ (x - a)\ (4) 

2 n = 0 Hi 


EXAMPLE 5 Find the Taylor series for /(x) = ( 1 + x) 5 around x = a = 1 . 

Solution At x = 1, the function is (1 + l) 3 = 32. Its first derivative 5(1 4 x} 4 is 
5- 16 = 80. We compute the nth derivative, divide by nl, and multiply by (x - 1)": 

32 + 80(x - 1) + 80(x - l) 2 + 40{x - l} 3 + 10(x - l) 4 + (x - l) s . 


(5) 
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That Taylor series (which stops at n — 5) should agree with (1 + x) 5 . It does. We could 
rewrite 1 + x as 2 + (x - 1), and take its fifth power directly. Then 32, 16, 8, 4, 2, I will 
multiply the usual coefficients 1, 5,10,10,5,1 to give our Taylor coefficients 
32, 80, 80, 40, 10, I. The series stops as it will stop for any polynomial — because the 
high derivatives are zero. 

EXAMPLE 6 Find the Taylor series for /(*) — e* around the basepoint x = 1. 

Solution At x = 1 the function and all its derivatives equal e. Therefore the 
Taylor series has that constant factor (note the powers of x - 1, not x): 

e* = e+e(x-l)+l(x- l) 2 + |(x - l) 3 + (6) 

DEFINING THE FUNCTION BY ITS SERIES 

Usually, we define sinx and cosx from the sides of a triangle. But we could start 
instead with the series. Define sin x by equation (2). The logic goes backward, but it 
is still correct: 

First, prove that the series converges. 

Second, prove properties like (sin x)' = cos x. 

Third, connect the definitions by series to the sides of a triangle. 

We don't plan to do all this. The usual definition was good enough. But note first: 
There is no problem with convergence. The series for sin x and cos x and e* all have 
terms ± x n j n\. The factorials make the series converge for all x. The general rule for 
e* times e 31 can be based on the series. Equation (6) is typical: e is multiplied by 
powers of {x - 1). Those powers add to e*~ l . So the series proves that e x = ee x ~ 
That is just one example of the multiplication (e*)(e v ) = 

(l + x+y + ...^l + j>+^ + ...^-l+x+y+ j + xy+^ + .„. {7) 

Term by term, multiplication gives the series for e x + y . Term by term, differentiating 
the series for e* gives Term by term, the derivative of sin x is cos x: 



We don’t need the famous limit (sin x)/x — ► 1, by which geometry gave us the deriva- 
tive. The identities of trigonometry become identities of infinite series. We could even 
define n as the first positive x at which x - £x 3 + equals zero. But it is certainly 
not obvious that this sine series returns to zero — much less that the point of return 
is near 3.14, 

The function that will be defined by infinite series is e 10 . This is the exponential of 
the imaginary number i6 (a multiple of i — — 1). The result e tB is a complex number , 

and our goal is to identify it. (We will be confirming Section 9.4.) The technique is to 
treat i6 like all other numbers, real or complex, and simply put it into the series: 

DEFINITION e ie is the sum of 1 + (ifl) + + ~(i0) 3 + (9) 

Now use i 2 = - 1. The even powers are i 4 = + I, i 6 = - 1, i® = + 1, .... We are 
just multiplying - 1 by - 1 to get I. The odd powers are i 3 = - i, t 5 = + i, .... There- 
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fore e ie splits into a real part (with no i’s) and an imaginary part (multiplying i): 

«*- (' - ^ + S® 4 - -) + ‘(® 4' 4’ ' -)• <‘°> 


y = r sin 8 



You recognize those series. They are cos 0 and sin 0. Therefore: 

Euler's formula is e w = cos 0 + i sin 0 . Note that e 2 * 1 — L 

The real part is x — cos 0 and the imaginary part is y = sin 0. Those coordinates pick 
out the point e iB in the “complex plane.” Its distance from the origin (0, 0) is r = 1, 
because (cos 0) 2 + (sin 0) 2 — 1. Its angle is 0, as shown in Figure 10,5. The number 
“1 is e in , at the distance r— 1 and the angle n. It is on the real axis to the left of 
zero. If e* 9 is multiplied by r = 2 or r = i or any r 5 * 0, the result is a complex number 
at a distance r from the origin: 

Complex numbers : re i6 - r { cos 0 + i sin 0) = r cos 0 4- ir sin 0 = x 4- j>. 

With e* e , a negative number has a logarithm. The logarithm of -l is imaginary 
(it is rir, since e in ” — 1). A negative number also has fractional powers. The fourth 
root of —1 is (— 1) 1/4 = e 1 * 14 . More important for calculus: The derivative of x 5/4 is 
$x 1/4 . That sounds old and familiar, but at x = - 1 it was never allowed. 

Complex numbers tie up the loose ends left by the limitations of real numbers . 


The formula e iB = cos 0 + i sin 0 has been called “one of the greatest mysteries of 
undergraduate mathematics.” Writers have used desperate methods to avoid infinite 
series. That proof in (10) may be the clearest (I remember sending it to a prisoner 
studying calculus) but here is a way to start from d/dx(e iX ) = ie? x . 

A different proof of Euler's formula Any complex number is e iX = r{ cos 0 4- i sin 0) 
for some r and 0. Take the x derivative of both sides, and substitute for ie tx : 


(cos 0 + j sin 0]drjdx 4- r^-sin 0 + i cos 0)d9/dx — ir { cos 0 + i sin 0). 

Comparing the real parts and also the imaginary parts, we need drjdx = 0 and 
d9/dx — 1. The starting values r — 1 and 0 “0 are known from e i0 = 1. Therefore r is 
always 1 and 0 is x. Substituting into the first sentence of the proof, we have Euler’s 
formula e iB - l(cos 0 4- i sin 0). 


10.4 EXERCISES 


Read-through questions 

The o series is chosen to match f[x) and all its b 
at the basepoint. Around x = 0 the series begins with 
/(0) 4 e x 4 d x 1 . The coefficient of x h is For 

f(x) ~ e x this series is | For f{x) = cos x the series is 
q . For /(x) — sin x the series is h . If the signs were 
all positive in those series, the functions would be cosh _x and 
i Addition gives cosh x + sinh x — 1 . 

In the Taylor series for f{x) around x = a, the coefficient of 
(x — af is b K = k . Then b„{x — a) 71 has the same [ as 
/ at the basepoint. In the example f(x) = x 2 , the Taylor coeffi- 
cients are b 0 = m , hi = n . b 2 = ° . The series 
b 0 4 b t {x — a) 4 h 2 (x — a) 2 agrees with the original P . 
The series for e 1 around x = a has b K = Q . Then the 
Taylor series reproduces the identity e x = ( * H » ) . 


We define e x y sin x, cos x, and also e l 6 by their series. The 
derivative d/dx(l +x + £x z 4- ■**) = 1 4x4 * <i translates to 
t . The derivative of 1 — £x 2 4 is u . Using i 2 = 
- 1 the series 1 + *0 4 i(*0) 2 -I — splits into e* = v . Its 
square gives e 2lg = » . Its reciprocal is e~ t$ = * . 

Multiplying by r gives re tB = v 4 i 1 . which connects 

the polar and rectangular forms of a A number. The 
logarithm of e te is p 


1 Write down the series for e 2x and compute all derivatives 
at x = 0. Give a series of numbers that adds to e 2 . 

2 Write down the series for sin 2x and check the third 
derivative at x = 0. Give a series of numbers that adds to 
sin 2n = 0. 
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In 3-8 find the derivatives of /(x) at x = 0 and the Taylor series 
(powers of x) with those derivatives. 

3/M = e* 4/(x)=I/(l+x) 

5 f(x)= 1/(1 —2x) 6 /(x) = cosh x 

7 f[x) = In (1 — x) 8/{x) = in(l + *) 

Problems 9-14 solve differential equations by series. 

9 From the equation dyldx = y — 2 find all the derivatives 

of y at x-0 starting from >{0) = h Construct the infinite 

series for y, identify it as a known function, and verify that 
the function satisfies y* = y — 2. 

10 Differentiate the equation y' = cy + s {c and s constant) 
to find all derivatives of y at x = 0. If the starting value is 
y 0 = 0, construct the Taylor series for y and identify it with 
the solution of y f = cy + s in Section 6,3. 

11 Find the infinite series that solves / = — y starting from 
y = 0 and /= 1 at x = 0. 

12 Find the infinite series that solves / = y starling from y = 

1 at x = 3 (use powers of x-3), Identify y as a known 
function. 

13 Find the infinite series (powers of x) that solves y" = 
2 y' — y starting from y = 0 and y r = 1 at x = 0. 

14 Solve y" = y by a series with y = 1 and / = 0 at x = 0 and 
identify y as a known function. 

15 Find the Taylor series for f{x) = (1 + x) z around x = a = 
0 and around x = a — 1 (powers of x — 1). Check that both 
series add to (1 + x) 2 . 

16 Find all derivatives o f/(x) = x 3 at x = a and write out the 
Taylor series around that point. Verify that it adds to x 3 , 

17 What is the series for (1 -x) 5 with basepoint a = 1? 

18 Write down the Taylor series for / ^ cos x around x = 2n 
and also for / = cos (x — 2n) around x = 0. 

In 19-24 compute the derivatives of / and its Taylor series 
around x=\ r 

19/(x)=l/x 20/(x) = l/(2-x) 

21 /(x) = In x 22 f(x ) = x 4 

23 f(x) = e' x 24 f(x) = e lx 

In 25-33 write down the first three nonzero terms of the Taylor 
series around x = 0, from the series for cos x, and sin x. 


25 xe ** 

26 cos y/x 

27 (I — cos x)/x : 


29 p — dx 

30 sin x 2 

X 

Jo X 


31 e ** 

32 ij* = e* lnb 

33 e* cos x 


*34 For x <0 the derivative of x" is still nx" 

d d , 41x1 

dx ^ dx ( = le 

What is 4|x|/4x? Rewrite this answer as nx" -1 , 

35 Why doesn’t f{x) = *fx have a Taylor series around x = 
0? Find the first two terms around x = 1. 

36 Find the Taylor series for 2 X around x = 0. 


In 37-44 find the first three terms of the Taylor series around 

x = 0. 

38 f[x) = sin _1 x 


37 /(x) = tan -1 x 

39 f(x) = tan x 

41 f{x) = c ,lni 

43 /(x) = cos 2 x 44 /(x) = sec 2 x 

45 From e^ = cos 0 + i sin 0 and (T^costf-isintf, add 
and subtract to find cos 9 and sin 6 . 


40/(x) = ln(cos x) 
42 f(x) = tanh -1 x 


46 Does (e i& ) 2 equal cos 2 0 + i sin 2 0 or cos 9 2 4- i sin 9 1 ? 

47 Find the real and imaginary parts and the 99th power of 
e l * t e iltf4 , and 

48 The three cube roots of 1 are I, e 2 *^ 3 , e 4ft,73 + 

(a) Find the real and imaginary parts of e lKif3 . 

(b) Explain why (e 2 *‘ /3 ) 3 = 1. 

(c) Check this statement in rectangular coordinates. 

49 The cube roots of -1 =e iK are e** /3 and _____ and 
. Find their sum and their product, 

50 Find the squares of 2e iKf 3 = 1 4- ^/3 i and 4e‘*' 4 — 
2^/2 + (2^/2 in both polar and rectangular coordinates, 

51 Multiply e ts = cos s -F i sin s times e ti = cos t + i sin t to 
find formulas for cos(s 4- f) and sin(s + £)< 

52 Multiply e is times e~ u to find formulas for cos(s — f) and 
sin(s — t). 

53 Find the logarithm of i. Then find another logarithm of i. 
(What can you add to the exponent of without changing 
the result?) 

54 (Proof that e is irrational) If e = p\q then 


N = p\ 



1 - 


i I_ 
1! + 2! 



would be an integer, (Why?) The number in brackets— the 
distance from the alternating series to its sum Ije — is less 
than the last term which is 1 jp\ Deduce that \N\ < 1 and reach 
a contradiction, which proves that e cannot equal p/q. 

55 Solve dyjdx = y by infinite series starting from y = 2 at 
x — 0. 
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10.5 Power Series 

This section studies the properties of a power series. When the basepoint is zero, the 
powers are x n . The series is Ha n x n . When the basepoint is x~a t the powers are 
(x - a) n > We want to know when and where (and how quickly) the series converges 
to the underlying function. For e* and cosx and sin x there is convergence for all 
x—but that is certainly not true for 1/(1 -x). The convergence is best when the 
function is smooth. 

First I emphasize that power series are not the only series. For many applications 
they are not the best choice. An alternative is a sum of sines, /(x) = E6 n sin nx. That 
is a iL Fourier sine series”, which treats all x’s equally instead of picking on a basepoint. 
A Fourier series allows jumps and corners in the graph— it takes the rough with the 
smooth. By contrast a power series is terrific near its basepoint, and gets worse as 
you move away. The Taylor coefficients a n are totally determined at the base - 
point — where all derivatives are computed. Remember the rule for Taylor series: 

a„ — {mb derivative at the basepoint)/n! =f {n \a)/n\ (I) 

A remarkable fact is the convergence in a symmetric interval around x = a. 


10M A power series Ha n x n either converges for all x, or it converges only at 
the basepoint x = 0, or else it has a radius of convergence r : 

X a n x n converges absolutely if |x| < r and diverges if |x| > r. 


The series Sx fl /n! converges for all x {the sum is e*). The series In!x fl converges for 
no x (except x = 0). The geometric series Ex' 1 converges absolutely for jx| < 1 and 
diverges for jx| > 1. Its radius of convergence is r — 1. Note that its sum 1/(1 — x) is 
perfectly good for |x|> 1 — the function is all right but the series has given up. If 
something goes wrong at the distance r, a power series can't get past that point. 

When the basepoint is x = a , the interval of convergence shifts over to |x - a\ < r. 
The series converges for x between a — r and a + r (symmetric around a )♦ Wc cannot 
say in advance whether the endpoints a ± r give divergence or convergence (absolute 
or conditional). Inside the interval, an easy comparison test will now prove con- 
vergence. 

PROOF OF 10M Suppose Za„X n converges at a particular point X . The proof will 
show that Za n x n converges when [x] is less than the number \X\. Thus convergence 
at X gives convergence at all closer points x (I mean closer to the basepoint 0). Proof: 
Since £ a n X n converges, its terms approach zero. Eventually < 1 and then 

\a„x n \ = \a n X n \\x/Xr<mr 

Our series La n x n is absolutely convergent by comparison with the geometric series 
for \x/X\, which converges since | xjX\ < L 

EXAMPLE 1 The series Tnx n (4 n has radius of convergence r = 4. 

The ratio test and root test are best for power series. The ratios between terms 
approach x/4 (and so does the nth root of nx 74*): 

in + l)x rt+1 jnx rt x n + 1 . x 

4**1 / ~jT = 4 — approaches L = ~. 

The ratio test gives convergence if L < 1, which means |x| < 4. 
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EXAMPLE 2 


The sine series x — 



— ■" has r — oo (it converges everywhere). 


The ratio of x n + z /{n -f 2)! to x n jn\ is x z /(n + 2)(n -F 1). This approaches L = 0. 


EXAMPLE 3 The series £(x - 5)"/n z has radius r = 1 around its basepoint a= 5. 

The ratios between terms approach L = x — 5. (The fractions n 2 /(n + l) 2 go toward 
1 .) There is absolute convergence if |x - 5| < 1 . This is the interval 4 < x < 6, symmet- 
ric around the basepoint. This series happens to converge at the endpoints 4 and 6, 
because of the factor 1 jn 2 . That factor decides the delicate question — convergence at 
the endpoints — but all powers of n give the same interval of convergence 4 < x < 6. 


CONVERGENCE TO THE FUNCTION; REMAINDER TERM AND RADIUS r 


Remember that a Taylor series starts with a function /(*)♦ The derivatives at the 
basepoint produce the series. Suppose the series converges: Does it converge to 
the function ? This is a question about the remainder R n (x) =f(x) — s„(x), which is the 
difference between /and the partial sum s n = a 0 + — + 0„(x - of. The remainder R ft 
is the error if we stop the series , ending with the nth derivative term a„(x — a ) n . 


ION Suppose / has an (n + l)st derivative from the basepoint a out to x, Then 
for some point c in between (position not known) the remainder at x equals 

*u*) =/m - *<*) =r +n (c)(* - «r> + 1)! (2) 


The error in stopping at the nth derivative is controlled by the (n T l)st derivative. 

You will guess, correctly, that the unknown point c comes from the Mean Value 
Theorem. For n= 1 the proof is at the end of Section 3.8. That was the error e(x) in 
linear approximation: 

R i(x) =/(x) -f(a) -f(a)(x - a) = jf"(c)(x - a} 2 . 

For every n T the proof compares R n with (x - a) ri + l . Their (h + l)st derivatives are 
/ (fl + u and (n + 1)! The generalized Mean Value Theorem says that the ratio of K, r to 
(x — a) fl + l equals the ratio of those derivatives, at the right point c. That is equation 
(2). The details can stay in Section 3.8 and Problem 23. because the main point is 
what we want. The error is exactly like the next term /x - af +i y except that the 
(n + l)st derivative is at c instead of the basepoint a . 


EXAMPLE 4 When /is e x , the (n + I)st derivative is e x . Therefore the error is 


/ x rt \ x fl + 

l n = e* — [ 1 + x + — + — r I = e c - 

" l nlj (n + 




(3) 


At x “ 1 and n = 2, the error is e- (1 + 1 + ,218. The right side is e c l6. The 

unknown point is c = In (.218 - 6) — .27. Thus c lies between the basepoint a = 0 and 
the error point x= 1, as required. The series converges to the function, because 

R n -+ 0 . 

In practice, n is the number of derivatives to be calculated. We may aim for an 
error ]RJ below 10 -6 , Unfortunately, the high derivative in formula (2) is awkward 
to estimate (except fore*). And high derivatives in formula (1) are difficult to compute. 
Most real calculations use only a few terms of a Taylor series. For more accuracy we 
move the basepoint closer, or switch to another series. 
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There is a direct connection between the function and the convergence radius r, 
A hint came for f[x) = 1/(1 — x). The function blows up at x = 1 — which also ends 
the convergence interval for the series* Another hint comes for /= 1 jx 7 if we expand 
around x = a = 1: 

- = 1 - 7 ? — ; = 1 + a - X) + (1 - xf + (4) 

This geometric series converges for |1 - x\ < 1* Convergence stops at the end point 
x — 0— exactly where 1 jx blows up* The failure of the function stops the convergence 
of the series . But note that 1/(1 + x 2 ), which never seems to fail, also has convergence 
radius r = 1 : 


1/(1 + x 1 )— 1 - x 2 + x 4 - x 6 H — converges only for |x| < 1* 

When you see the reason, you will know why r is a “radius.” There is a circle, and 
the function fails at the edge of the circle* The circle contains complex numbers as 
well as real numbers* The imaginary points i and - i are at the edge of the circle* 
The function fails at those points because 1/(1 + i 2 ) = oo. 

Complex numbers are pulling the strings, out of sight. The circle of convergence 
reaches out to the nearest “singularity” of /{*), real or imaginary or complex* For 
1/(1 + x 2 ), the singularities at i and —i make r = 1* If we expand around a = 3, the 
distance to i and — i is r = v /To* If we change to In (1 + x), which blows up at 
x = - 1, the radius of convergence of x - ^ x 2 + *x 3 is r= 1* 


1/(1 + i 2 ) = « 




fig. 10.6 Convergence radius r is distance from basepoint a to nearest singularity. 


THE BINOMIAL SERIES 

We close this chapter with one more series* It is the T&ylor series for (1 + x)*\ around 
the basepoint x = 0. A typical power is p — where we want the terms in 

v /l + x=l + ix + a 2 x 2 +"-. 

The slow way is to square both sides, which gives 1 + x + (2 a 2 + i)* 2 on the right. 
Since 1 + x is on the left, a 2 = - i is needed to remove the x 1 term. Eventually a 3 
can be found. The fast way is to match the derivatives of/={l — x) 1/2 : 

r = 4u + *r 1,2 r=(i)(- m + r = tix- ix- so + *r s/2 . 
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At x — 0 those derivatives are f . Dividing by 1!, 2!, 3! gives 



These are the binomial coefficients when the power is p-j. 

Notice the difference from the binomials in Chapter 2. For those, the power p was 
a positive integer. The series (1 + x) 2 = 1 + 2x + x 2 stopped at x 2 . The coefficients for 
p = 2 were 1,2, 1, 0, 0, 0, .... For fractional p or negative p those later coefficients are 
not zero, and we find them from the derivatives of (1 + xY : 

(i + xy rti + jr 1 rtp-i)(i+xr 2 s^=jip-\y-{p-n^\){\^xY-\ 
Dividing by Of, If, 2!, n\ at x = 0, the binomial coefficients are 

, . P(P- 1) /’■'(O) p(p- l)-(p-«+l) 

' p - — n! • (5) 


For p = n that last binomial coefficient i$ nl/nl = 1. It gives the final x n at the end of 
(1 + x ) n . For other values of p, the binomial series never stops. // converges for |x| < 1: 


(l+x) p = 1 + px + ~ P ^ —x 2 + 


V P(P~ l)-(p~n+ 1) 
» - o n! 


( 6 ) 


When p = 1, 2, 3, ... the binomial coefficient p\jn\(n — p)l counts the number of ways 
to select a group of n friends out of a group of p friends . If you have 20 friends, you 
can choose 2 of them in (20)(19)/2 - 190 ways. 

Suppose p is not a positive integer. What goes wrong with (1 + x)* 7 , to stop the 
convergence at |x| = 1? The fa ilure is at x - -1. If p is negative, (1 + xY blows up. 
If p is positive, as in y/\ + x, the higher derivatives blow up. Only for a positive 
integer p = n does the convergence radius move out to r = oo. In that case the series 
for (1 + xf stops at x", and/ never fails. 

A power series is a function in a new form. It is not a simple form, but sometimes 
it is the only form. To compute / we have to sum the series. To square/ we have to 
multiply series. But the operations of calculus — derivative and integral — are easier. 
That explains why power series help to solve differential equations, which are a rich 
source of new functions. (Numerically the series are not always so good.) I should 
have said that the derivative and integral are easy for each separate term a n x n — and 
fortunately the convergence radius of the whole series is not changed. 


If f{x) — 'La fl x n has convergence radius r, so do its derivative and its integral: 
dfjdx = E/i^x*’ 1 and f f(x)dx - E< 2 „x" + 1 /{n + 1) also converge for |x| < r. 

EXAMPLES The scries for 1/(1 — x) and its derivative 1/(1 - x) 2 and its integral 
- ln(l - x) all have r = 1 (because they all have trouble at x — 1). The series are Ex* 
and Enx"‘ l and Ex" + l /(n+ 1). 


EXAMPLE 6 We can integrate e* 1 (previously impossible) by integrating every term 
in its series: 




1 +x 2 + — x* + 

2 ! 




This always converges (r= oo). The derivative of e x 2 was never a problem. 
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10.5 EXERCISES 


Read-through questions 

If |x| < \X\ and ^a„X" converges, then the series La fl x" also 
There is convergence in a b interval around the 
c For E(2x) n the convergence radius is r = d For 
Lx"/ n - the radius is r = e For £(x — 3) n there is con- 
vergence for \x — 31 < t Then x is between a and 

h 

Starting with /(x), its Taylor series 'La n x* has a„ = i 

With basepoint a, the coefficient of (x — af is \ The 

error after the x" term is called the it R.fxf. It is equal to 
* where the unknown point c is between m . Thus 
the error is controlled by the n derivative. 

The circle of convergence reaches out to the first point 
where f(x) fails. For /=4/(2 — x), that point is x = o 
Around the basepoint a = 5, the convergence radius would be 
r = p For sin x and cos x the radius is r — g . 

The series for X + x is the r series with p Its 
coefficients are a„ = & Its convergence radius is t 

Its square is the very short series 1 + x. 

In 1-6 find the Taylor series for f(x) around x = 0 and its radius 
of convergence r. At what point do es/(x) blow up? 

1 /(x) = 1/(1 - 4x) 2 f(x) = 1/(1 - 4.x 2 ) 

3 f(x) = e x ~* 4 f(x) = tan x (through x 2 ) 

5 f{x) = ln(<? + x) 6 f(x) = 1/(1 + 4x 2 ) 


15 From the series for (1 — cos x)fx 2 find the limit as x -►O 
faster than rHopital’s rule. 

16 Construct a power series that converges for 0 < x < In* 


17-24 are about remainders and 25-36 are about binomials. 


17 If the cosine series stops before x e /8! show from (2) that 
the remainder R 7 is less than x 0 /8! Does this also follow 
because the series is alternating? 

18 If the sine series around x = 2n stops after the terms in 
problem 10, estimate the remainder from equation (2). 

19 Estimate by (2) the remainder R n = x * + 1 + x n+2 + in 
the geometric series. Then compute R n exactly and find the 
unknown point c for n = 2 and x = 

20 For — ln(l — x) = x + i* 2 + ix J + Jt 3> use equation (2) to 

show that at x = 

21 Find R n in Problem 20 and show that the series converges 
to the function at x = j (prove that -* 0), 


22 By estimating R„ prove that the Taylor series for e x around 
x = 1 converges to e* as n -► oo. 

23 (Proof of the remainder formula when n = 2) 

(a) At x = a find ^21 ^2* Ri ’■ 

(b) At x = a evaluate gfx) = {x — a) 3 and g\ g" f g ,f \ 


(c) What rule gives 


RtW-RAa) 

g{x)-g{a} 


Riic i) 0 
g'{Cl) ‘ 


Find the interval of convergence and the function in 7-10, 
7c -C 


1 /w=l 




9 /(*)=!— 7(x-ar + ' 

0 n + 1 

10 /M = (a- - 2 71) - | X 


11 Write down the Taylor series for (e* — l)/x, based on the 
series for e x . At x — 0 the function is 0/0, Evaluate the series 
at x — 0. Check by THopitaFs Rule on ( e x — l)/x. 

12 Write down the Taylor series for xe x around x = 0. Inte- 
grate and substitute x = 1 to find the sum of l/n!(n + 2). 

13 If /(x) is an even function, so /(— x) = /(x), what can you 
say about its Taylor coefficients in/= 'La n x Ii 2 


14 Puzzle out the sums of the following series: 
(aJjc + jc 2 — x 3 -t-x 4 + x 5 — * 6 + ■■■ 


(c> (jc- l)-i(x- + l) 3 - ■■■ 


(d) In 


~ R-2( a ) 

g'(c l )-g\a) 


R&2) 

g”i.c 2 ) 


and 


^2(^2) — Rifa ) ^2 M 


where are c v and c 2 and c? 


f’ic) 

(e) Combine (a-b-c-d) into the remainder formula (2), 


24 All derivatives of f(x) = e~ 1,xi are zero at x = 0, including 
/(0) = 0. What is /(.l)? What is the Taylor series around 
x = 0? What is the radius of convergence? Where does the 
series converge to /(x)? For x = I and n = I what is the 
remainder estimate in (2)? 


25 (a) Find th e first three terms in the binomial series for 
1/^1 -x 2 , 

(b) Integrate to find the first three terms in the Taylor 
series for sin - 1 x. 

26 Show that the binomial coefficients in tf^/t — x = 
are a n = 1 - 3 -5--(2n — l)/2"ft! 


27 For p = — l and p = — 2 find nice formulas for the bino- 
mial coefficients. 


28 Change the dummy variable and add lower limits to make 
X*W l = £* (n + l)x". 
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29 In (1 — x) 1 = Lx n the coefficient of x n is the number of 
groups of n friends that can be formed from 1 friend (not 
binomial repetition is allowed!). The coefficient is 1 and 
there is only one group — the same friend n times, 

(a) Describe all groups of n friends that can be formed 
from 2 friends. (There are n + 1 groups,) 

(b) How many groups of 5 friends can be formed from 3 
friends? 

30 (a) What is the coefficient of x n when 1 +xTx 2 + ”’ 
multiplies 1 + x + x 2 + ■? Write the first three terms, 

(b) What is the coefficient of x 5 in (Ex ft ) 3 ? 

31 Show that the binomial series for 1 + 4x has integer 
coefficients. (Note that x n changes to (4*)". These coefficients 
are important in counting trees, paths, parentheses..,) 

32 In the series for l/ v ' 1 — 4x, show that the coefficient of x rt 
is {2n)\ divided by frr!) 2 . 

Use the binomial series to compute 33-36 with error less than 
1/1000. 

33 (15)' 4 34 ( 1001) 1 3 

35 (I,!) 1 - 1 36 e Jl00 ° 

37 F rom see x = l /[ l — ( 1 — cos x)] find the Taylor series of 
sec x up to x t . What is the radius of convergence r (distance 
to blowup point)? 

38 From sec 2 x = 1 / [ 1 — sin 2 x] find the Taylor series up to 
x 2 . Check hy squaring the secant series in Problem 37. Check 
by differentiating the tangent series in Problem 39, 


39 (Division of series) Find tan x by long division of sin x/ 
cos x: 



40 (Composition of series) If /= u 0 + a^x + a 2 x 2 + and 
g = biX + b 2 x 2 + - j find the 1, x 9 x 2 coefficients of f(g(x)}. 
Test on /= 1/(1 + x), g = x/(l x), with /(g(x)) = 1 — x. 

41 (Multiplication of series) From the series for sin x and 
1/(1 — x) find the first four terms for / = sin x/(l — x), 

42 (Inversion of series) Uf=a l x + a 2 x 2 + ■■ find coefficients 
b z in g = bi* -\- b 2 x 2 + ■■ so that /’(^(x)) = x. Compute 

b lt b 2 forf=c*-\ t x=f '^W+xi. 

43 From the multiplication (sin x)(sin x) or the derivatives of 
f(x) = sin 2 x find the first three terms of the series. Find the 
first four terms for cos 2 x by an easy trick, 

44 Somehow find the first six nonzero terms for f=( 1 —x)j 
(L-x 3 ), 

45 Find four terms of the series for 1 jyfl— x. Then square 
the series to reach a geometric series, 

46 Compute e~ xi dx to 3 decimals by integrating the 
power series. 

47 Compute sin 2 J dt to 4 decimals by power series. 

48 Show that Ijc"/n converges at x = — 1, even though its 
derivative Ex" -1 diverges. How can they have the same 
convergence radius? 

49 Compute lim (sin x — tan x)/x 3 from the series, 

x -* 0 

50 If the nth root of a„ approaches L > 0, explain why Ia n x" 
has convergence radius r= I/L. 

51 Find the convergence radius r around basepoints a = 0 
and a= 1 from the blowup points of (l + tan x)/(J + x 2 ). 
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CHAPTER 11 


Vectors and Matrices 


This chapter opens up a new part of calculus. It is multidimensional calculus , because 
the subject moves into more dimensions. In the first ten chapters, all functions 
depended on time t or position x — but not both. We had /(f) or y(x). The graphs 
were curves in a plane. There was one independent variable (x or £) and one dependent 
variable ( y or /). Now we meet functions /(x, t) that depend on both x and r. Their 
graphs are surfaces instead of curves. This brings us to the calculus of several variables . 

Start with the surface that represents the function /(x,r) or/(x,y) or f(x,y,.t). I 
emphasize functions, because that is what calculus is about. 

EXAMPLE 1 /(x, t) = cos (x - f) is a traveling wave (cosine curve in motion). 

At f = 0 the curve is /=cosx. At a later time, the curve moves to the right 
{Figure 11.1), At each f we get a cross-section of the whole x-£ surface. For a wave 
traveling along a string, the height depends on position as well as time. 

A similar function gives a wave going around a stadium. Each person stands up 
and sits down. Somehow the wave travels. 

EXAMPLE 2 /{x, y) = 3x + y + l is a sloping roof (fixed in time). 

The surface is two-dimensional — you can walk around on it. It is flat because 
3x + y + 1 is a linear function. In the y direction the surface goes up at 45°. If y 
increases by 1, so does/. That slope is 1. In the x direction the roof is steeper (slope 3). 
There is a direction in between where the roof is steepest (slope ^10). 

EXAMPLE 3 /(x, y, £) = cos(x — y — t) is an ocean surface with traveling waves. 

This surface moves. At each time £ we have a new x-y surface. There are three 
variables, x and y for position and £ for time. I can’t draw the function, it needs four 
dimensions! The base coordinates are x, y, t and the height is / The alternative is a 
movie that shows the x-y surface changing with £. 

At time £ = 0 the ocean surface is given by cos (x — y). The waves are in straight 
lines. The line x — y = 0 follows a crest because cos 0=1. The top of the next wave 
is on the parallel line x — y — 2n f because cos In = 1. Figure 11.1 shows the ocean 
surface at a fixed time. 

The line x - y~t gives the crest at time £. The water goes up and down (like people 
in a stadium). The wave goes to shore , but the water stays u i the ocean. 
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Vectors and Matrices 



Fig. 11.1 Moving cosine with a small optical illusion — the darker 
bands seem to go from top to bottom as you turn. 


c = 3.v + y + I 



Of course multidimensional calculus is not only for waves. In business, demand is 
a function of price and date. In engineering, the velocity and temperature depend on 
position * and time t. Biology deals with many variables at once (and statistics is 
always looking for linear relations like z - x 4 - 2y\ A serious job lies ahead, to carry 
derivatives and integrals into more dimensions. 
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In a plane, every point is described by two numbers. We measure across by x and 
up by y. Starting from the origin we reach the point with coordinates (x, y). I want 
to describe this movement by a vector — the straight line that starts at (0, 0) and ends 
at (x, y). This vector \ has a direction, which goes from (0, 0) to (x ? y) and not the 
other way. 

In a picture, the vector is shown by an arrow. In algebra, v is given by its two 
components. For a column vector , write x above y: 


v = 


_x 

V 


(v and y are the components of v). 


a) 


Note that v is printed in boldface; its components x and y are in lightface.f The 
vector — v in the opposite direction changes signs. Adding v to — v gives the zero 
vector {different from the zero number and also in boldface): 


— x 


x — X 


0 


and v - v = 






j ” y_ 


0 


Notice how vector addition or subtraction is done separately on the x’s and y s: 


PI 


r-n 




+ 


= 


1 


2 


3 


( 3 ) 


tAnother way to indicate a vector is v. You will recognize vectors without needing arrows. 
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Rg. 11,3 Parallelogram for v + w, stretching for 2v, signs reversed for -v. 


The vector v has components v : = 3 and = 1. (I write for the first component 
and v 2 for the second component- I also write x and y, which is fine for two com- 
ponents,) The vector w has w t = — 1 and w 2 = 2. To add the vectors, add the com- 
ponents. To draw this addition, place the start of w at the end of v, Figure 11,3 shows 
how w starts where v ends, 

VECTORS WITHOUT COORDINATES 

In that head-to-tail addition of v 4 w, we did something new. The vector w was moved 
away from the origin. Its length and direction were not changed! The new arrow is 
parallel to the old arrow — only the starting point is different. The vector is the same 
as before. 

A vector can be defined without an origin and without x and y axes. The purpose 
of axes is to give the components — the separate distances x and y. Those numbers 
are necessary for calculations. But x and y coordinates are not necessary for head- 
to-tail addition v4 w, or for stretching to 2\, or for linear combinations 2v4 3w. 
Some applications depend on coordinates, others don’t. 

Generally speaking, physics works without axes — it is “coordinate-free," A velocity 
has direction and magnitude, but it is not tied to a point, A force also has direction 
and magnitude, but it can act anywhere — not only at the origin. In contrast, a vector 
that gives the prices of five stocks is not floating in space. Each component has a 
meaning— there are five axes, and we know when prices are zero. After examples 
from geometry and physics (no axes), we return to vectors with coordinates. 

EXAMPLE 1 (Geometry) Take any four-sided figure in space. Connect the midpoints 
of the four straight sides. Remarkable fact : Those four midpoints lie in the same plane. 
More than that, they form a parallelogram. 

Frankly, this is amazing. Figure 1 1 .4a cannot do justice to the problem, because it 
is printed on a flat page. Imagine the vectors A and D coming upward, B and C go 
down at different angles. Notice how easily we indicate the four sides as vectors, not 
caring about axes or origin. 

I will prove that V = W. That shows that the midpoints form a parallelogram. 

What is V? It starts halfway along A and ends halfway along B. The small triangle 
at the bottom shows V = ^A 4 ^B. This is vector addition — the tail of ^B is at the 
head of \ A. Together they equal the shortcut V, For the same reason W = -r ^D. 
The heart of the proof is to see these relationships. 

One step is left. Why is ? A 4 equal to jC 4- ^D? In other words, why is A 4 B 
equal to C 4 D? (I multiplied by 2.) When the right question is asked, the answer 
jumps out, A head-to-tail addition A 4 B brings us to the point R , Also C 4 D brings 
us to R . The proof comes down to one line: 

A+B=FR = C4D. Then V = j A 4 equals W = }C + ^D. 
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Fig. 11.4 Four midpoints form a parallelogram (V = W), Three medians meet at P. 


EXAMPLE 2 (Also geometry) In any triangle, draw lines from the corners to the 
midpoints of the opposite sides. To prove by vectors: Those three lines meet at a point. 
Problem 38 finds the meeting point in Figure 1 1.4c. Problem 37 says that the three 
vectors add to zero. 

EXAMPLE 3 (Medicine) An electrocardiogram shows the sum of many small vectors, 
the voltages in the wall of the heart. What happens to this sum — the heart vector 
V — in two cases that a cardiologist is watching for? 

Case I. Part of the heart is dead (infarction). 

Case 2. Part of the heart is abnormally thick (hypertrophy^ 

A heart attack kills part of the muscle. A defective valve, or hypertension, overworks 
it. In case 1 the cells die from the cutoff of blood (loss of oxygen). In case 2 the heart 
wall can triple in size, from excess pressure. The causes can be chemical or mechanical. 
The effect we see is electrical. 

The machine is adding small vectors and “projecting” them in twelve directions The 
leads on the arms, left leg, and chest give twelve directions in the body. Each graph 
shows the component of V in one of those directions. Three of the projections 
two in the vertical plane, plus lead 2 for front-back™ produce the “mean QRS vector'" 
in Figure 11.5. That is the sum V when the ventricles start to contract. The left 
ventricle is larger, so the heart vector normally points down and to the left. 



Fig. 11.5 V is a sum of small voltage vectors, at the moment of depolarization. 
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Rg. 11.6 Changes in V show dead muscle and overworked muscle. 


We come soon to projections, but here the question is about V itself How does 
the ECG identify the problem? 

Case 1; Heart attack The dead cells make no contribution to the electri- 
cal potential. Some small vectors are missing- Therefore the sum V 
turns away from the infarcted part. 

Case 2; Hypertrophy The overwork increases the contribution to the 
potential. Some vectors are larger than normal. Therefore V turns 
toward the thickened part. 

When V points in an abnormal direction, the ECG graphs locale the problem. The 
P, Q y R, 5, T waves on separate graphs can all indicate hypertrophy, in different 
regions of the heart. Infarctions generally occur in the left ventricle, which needs the 
greatest blood supply. When the supply of oxygen is cut back, that ventricle feels it 
first. The result can be a heart attack (= myocardial infarction = coronary occlusion). 
Section 1 1.2 shows how the projections on the ECG point to the location. 

First come the basic facts about vectors — components, lengths, and dot products. 

COORDINATE VECTORS AND LENGTH 


To compute with vectors wc need axes and coordinates. The picture of the heart is 
"coordinate-free," but calculations require numbers. A vector is known by its compo- 
nents. The unit rectors along the axes are i and j in the plane and i, j, k in space: 



Notice how easily we moved into three dimensions! The only change is that vectors 
have three components. The combinations of i and j (or i, j, k) produce all vectors v 
in the plane (and all vectors V in space): 




402 


1 1 Vectors and Matrices 


Those vectors are also written v = (3, 1) and V = (1,2, - 2). The components ofthe vector 
are also the coordinates of a point. (The vector goes from the origin to the point.) This 
relation between point and vector is so dose that we allow them the same notation: 
P = (x, y, z) and v = (x, y, z) = xi + yj + zk. 

The sum v + V is totally meaningless. Those vectors live in different dimensions. 

From the components we find the length. The length of (3, 1) is \/i 2 + 1 2 — ,/To. 
This comes directly from a right triangle. In three dimensions, V has a third com - 
ponent to be squared and added. The length of V = (x, y, z) is |V] = ,/x 2 4- y 2 + z 2 . 
Vertical bars indicate length, which takes the place of absolute value. The length 
of v = 3i + j is the distance from the point (0, 0) to the point (3, 1): 


|v| = ,/pf + v\ = yio |V| = yt 2 + 2 2 + (-2) 2 = 3. 

A unit vector is a vector of length one. Dividing v and V by their lengths produces 
unit vectors in the same directions: 


v 

M 


p/x/To" 

and — = 

1/3" 

2/3 

|_i/ v To_ 

|V| 

-2/3 


are unit vectors. 


1 1A Each nonzero vector has a positive length |v|. The direction of v is given 
by a unit vector u = v/|v|. Then length times direction equals v. 


A unit vector in the plane is determined by its angle 9 with the x axis: 
cos 9 


cos 9 

u = 

sin 9 


= (cos 0)i + (sin 0}j is a unit vector: |uj 2 = cos 2 f? + sin 2 # = 1. 


In 3-space the components of a unit vector are its “direction cosines”: 

U = (cos x)i + (cos P)\ + (cos y}k: <x, ft, y = angles with x, y, z axes. 

Then cos 2 a + cos 2 /? + cos 2 y = 1. We are doing algebra with numbers while we are 
doing geometry with vectors. It was the great contribution of Descartes to see how 
to study algebra and geometry at the same time. 




Fig. 11.7 Coordinate vectors i, j, k. Perpendicular vectors v • w = (6)(1) + (— 2)(3) - 0. 


THE DOT PRODUCT OF TWO VECTORS 

There are two basic operations on vectors. First, vectors are added (v + w). Second, 
a vector is multiplied by a scalar (7v or - 2w). That leaves a natural question — how 
do you multiply two vectors? The main part of the answer is — you don’t. But there 
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is an extremely important operation that begins with two vectors and produces a 
number. It is usually indicated by a dot between the vectors, as in v • w, so it is called 
the dot product . 


DEFINITION 1 The dot product multiplies the lengths |v| times |w| times a cosine : 
v ■ w = |v| |w| cos 0, 0 = angle between v and w. 


EXAMPLE 


has length 3, 


has length ,/8, the angle is 45°. 


The dot product is |v||w| cos 8 = (3)(.y8)( l/y/2), which simplifies to 6. The square 
roots in the lengths are “canceled” by square roots in the cosine. For computing v * w. 
a second and much simpler way involves no square roots in the first place. 


DEFINITION 2 The dot product v • w multiplies component by component and adds: 


V* W = + t'2 W 2 


3 

0 



= (3)(2) + (0)(2) = 6. 


The first form |v| |w| cos 0 is coordinate-free. The second form o,w, + y 2 w 2 computes 
with coordinates. Remark 4 explains why these two forms are equal. 


1 1B The dot product or scalar product or inner product of three-dimensional 
vectors is 

V- W= |V||W| cos 8= F, + V 2 W 2 + V 3 W 3 . (4) 

If the vectors are perpendicular then 8 = 90° and cos 0 = 0 and V • W = 0. 







1 


r4 


2 


-1 

2 

- 

5 

— 32 (not perpendicular) 

2 

- 

2 

3 


6 


-1 


2 


= 0 ( perpendicular ). 


These dot products 32 and 0 equal |V| |W| cos 8. In the second one, cos 8 must be 
zero. The angle is 7t/2 or — x/2 — in either case a right angle. Fortunately the cosine 
is the same for 8 and - 0, so we need not decide the sign of 8. 

Remark 1 When V = W the angle is zero but not the cosine! In this case cos 8 = 1 

and V • V = |V| 2 . The dot product of\ with itself is the length squared: 

V • V = ( F, , V 2 , F 3 )-(F„ v 2 , F 3 ) = v\+ F 2 + F| = |V| 2 . (5) 

Remark 2 The dot product of i — ( 1, 0. 0) with j = (0. 1, 0) is i • j = 0. The axes are 

perpendicular. Similarly i • k = 0 and j • k = 0. Those are unit vectors: i • i = j • j = 
k * k = 1. 


Remark 3 The dot product has three properties that keep the algebra simple: 

1. V • W = W • V 2. (cV) • W = c(V • W) 3. (U + V) • W = U • W + V • W 

When V is doubled (c = 2) the dot product is doubled. When V is split into i, j, k 
components, the dot product splits in three pieces. The same applies to W, since 
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Hg. H.8 Length squared = (V — W) ■ (V — W), from coordinates and the cosine law. 


V • W = W • V. The nine dot products of i, j, k are zeros and ones, and a giant splitting 
of both V and W gives back the correct V • W: 

V ■ W = + Vj * W 2 \ + K 3 k * W^k + six zeros = V 2 Wx + V 2 W 2 + V 3 W Z . 

Remark 4 The two forms of the dot product are equal , This comes from computing 
| V — W| 2 by coordinates and also by the l iaw of cosines”: 

with coordinates: |V - W| 2 = (V 2 - W x ) 2 + (V 2 - W 2 ) 2 + (K 3 - W 3 f 

from cosine law: |V - W| 2 = |V| 2 + |W| 2 - 2|V| |W| cos 8 . 

Compare those two lines* Line 1 contains V\ and V\ and V\. Their sum matches 
|V| 2 in the cosine law. Also W\ + W\ + W\ matches |W| 2 . Therefore the terms contain- 
ing ^ 2 are the same (you can mentally cancel the -2). The definitions agree : 

“ Wi + V 2 W 2 + V 3 W 3 ) equals - 2|V| |W| cos 0 equals - 2V • W* 

The cosine law is coordinate-free. It applies to all triangles (even in n dimensions). 
Its vector form in Figure 1T8 is |V - W| 2 = |V| 2 - 2V- W + |W| 2 . This application to 
V'W is its brief moment of glory. 


Remark 5 The dot product is the best way to compute the cosine of 8: 


cos 8 = 


V-W 

|V|]Wf 


( 6 ) 


Here are examples of V and W with a range of angles from 0 to k: 


i and 3i have the same direction cos 8= 1 0 = 0 

i ■ (i + ]) = 1 is positive cos 8 — 1/^/2 8 = ti/4 

i and j are perpendicular: i • j = 0 cos 8 = 0 8 -n/2 

i - i + j) = — 1 is negative cos 8 = — 1 j^/2 8 = 3tt/4 

i and - 3i have opposite directions cos 8 = — 1 8 = n 


Remark 6 The Cauchy-Schwarz inequality |V* W| ^ |V| [W| comes from |cos 8\ ^ L 

The left side is |V| |W| |cos 0|. It never exceeds the right side |V||W|, This is a key 
inequality in mathematics, from which so many others follow: 

Geometric mean *Jxy ^ arithmetic mean ^(x + y) {true for any x^O and y^O). 

Triangle Inequality |V + W| ^ |V| + |W| ( |V|, |W[, ]V + W| are lengths of sides). 

These and other examples are in Problems 39 to 44. The Schwarz inequality 
|V-W|£ |V[ |W| becomes an equality when |cos 8\ = 1 and the vectors are , 
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HA EXERCISES 


Read-through questions 

A vector has length and a . If v has components 6 and 
— 8, its length is Ivl = b and its direction vector is u = 
c The product of [v| with u is d This vector goes 
from (0, 0) to the point x = * y = f A combination 

of the coordinate vectors i = 0 and j = h produces 

v = ] 1 -f J j. 

To add vectors we add their k . The sum of (6, — 8) and 
(li 0} is < To see v + i geometrically, put the m of i 
at the n of v. The vectors form a o with diagonal 
v + i. (The other diagonal is p .1 The vectors 2v and — v 
are Q and r Their lengths are i and t 

In a space without axes and coordinates, the tail of V can 
be placed u . Two vectors with the same v are the 
same. If a triangle starts with V and continues with W, the 
third side is w The vector connecting the midpoint of V 
to the midpoint of W is x t That vector is Y the third 
side. In this coordinate-free form the dot product is V * W = 

i 

Using components, V- W= * and (1,2,1)- 
(2, —3, 7) = > . The vectors are perpendicular if c . 

The vectors are parallel if D . V "V is the same as E , 

The dot product of U + V with W equals F The angle 
between V and W has cos 0 = ft . When V - W is negative 
then 0 is H . The angle between i + j and i + k is I 

The Cauchy- Schwarz inequality is J and for V = i + j 
and W = i + k it becomes 1 < K . 

In 1-4 compute V + W and 2V — 3W and |V|* and V -W and 

cos 0. 

1 V = (l, 1, 1) ( W,(-1, -1, -1) 

2 V = i + j s W = j — k 

3 V — i — 2j + k t W = i -h j — 2k 

4 V = (l, 1, 1, Hff = (U3,4) 

5 (a) Find a vector that is perpendicular to (t^, v 2 ). 

(b) Find two vectors that are perpendicular to (u, , u 2 , 

6 Find two vectors that are perpendicular to (1, 1, 0) and to 
each other. 

7 What vector is perpendicular to all 2-dimen$ional vectors? 
What vector is parallel to all 3-dimensional vectors? 

8 In Problems 1-4 construct unit vectors in the same direc- 
tion as V. 

9 If v and w are unit vectors, what is the geometrical mean- 
ing of v * w? What is the geometrical meaning of (v - w)v? Draw 
a figure with v = i and w = (3/5)1 + (4/5]j, 

10 Write down all unit vectors that make an angle 0 with the 
vector {1, 0). Write down all vectors at that angle. 


11 True or false in three dimensions: 

L If both U and V make a 30° angle with W, so does 
U + V. 

2. If they make a 90° angle with W, so does U + V, 

3. If they make a 90° angle with W they are perpendicular: 

u-v = o. 

12 From W = (1, 2, 3) subtract a multiple of V = (1, 1, 1) so 
that W — cV is perpendicular to V. Draw V and W and 
W-cV 

13 (a) What is the sum V of the twelve vectors from the center 
of a clock to the hours? 

(b) If the 4 o’clock vector is removed, find V for the other 
eleven vectors. 

(c) If the vectors to 1, 2, 3 are cut in half, find V for the 
twelve vectors. 

14 (a) By removing one or more of the twelve dock vectors, 
make the length |V| as large as possible. 

(b) Suppose the vectors start from the top instead of the 
center (the origin is moved to 12 o’clock, so y lt = 0). What 
is the new sum V*? 

15 Find the angle POO by vector methods if P = (l, 1,0), 
O = (0, 0, 0},Q = (1, 2, -2). 

16 (a) Draw the unit vectors Ui=(cos0, stn0) and u 2 = 
(cos0, sin 0). By dot products find the formula for 
cos (0 — 0). 

(b) Draw the unit vector u^ from a 90° rotation of u 2 . By 
dot products find the formula for sin (8 + 0). 

17 Describe all points (x, y) such that v = xi + y\ satisfies 

(a)|v| = 2 (b) |v - i| = 2 

(c) v *i = 2 (d) v - i — ]v| 

18 (Important) If A and B are non-parallel vectors from the 
origin, describe 

(a) the endpoints of fB for all numbers t 

(b) the endpoints of A 4- iB for all f 

(c) the endpoints of sA + rB for all s and t 

(d) the vectors v that satisfy v "A — v <# B 

19 (a) If v + 2w = i and 2v 4- 3w — j find v and w. 

(b) If v = i -t- j and w = 3i -h 4j then i = v + 

w. 

20 If P = (0, 0) and ii — (0, 1) choose Q so the angle PQR is 

9Cf\ All possible Q’s lie in a . 

21 (a) Choose d so that A = 2i 4- 3j is perpendicular to 
B = 9i + d\. 

(b) Find a vector C perpendicular to A = i + j + k and 
B = i-k. 




406 


1 1 Vector* and Matrices 


22 If a boat has velocity V with respect to the water and the 

water has velocity W with respect to the land, then . 

The speed of the boat is not |V| + |W| hut , r . 

23 Find the angle between the diagonal of cube and (a) an 
edge (b) the diagonal of a face (c) another diagonal of the 
cube. Choose lines that meet, 

24 Draw the triangle PQR in Example 1 (the four-sided figure 

in space). By geometry not vectors, show that PR is twice as 
long as V, Similarly |PR| = 2|W|. Also V is parallel to W 
because both are parallel to L . So V = W as before, 

25 (a) If A and B are unit vectors, show that they make equal 
angles with A + B, 

(b) If A, B, C are unit vectors with A + B + C = 0, they 

form a triangle and the angle between any two 

is , 

26 (a) Find perpendicular unit vectors I and J in the plane 
that are different from i and j. 

(b) Find perpendicular unit vectors I, J, K different from 
i,j, k. 

27 If I and J are perpendicular, take their dot products with 
A = al + hJ to find a and b , 

28 Suppose 1 = (i + j )!yfl and J = (i — jV^/2. Check I • J = 0 
and write A = 2i + 3j as a combination al + bJ. (Best method: 
use a and b from Problem 27, Alternative: Find i and j from 
I and J and substitute into A.) 

29 (a) Find the position vector OP and the velocity vector 
PQ when the point P moves around the unit circle (see figure) 
with speed 1, (b) Change to speed Z 

30 The sum (A * i) 2 + (A • j) 2 + (A • k) 2 equals - 

31 In the semicircle find C and D in terms of A and B, Prove 
that C • D = 0 (they meet at right angles). 


35 The vector from the earth's center to Seattle is oi + hj + ck, 

(a) Along the circle at the latitude of Seattle, what two 
functions of a, b, c stay constant? k goes to the North Pole, 

(b) On the circle at the longitude of Seattle — the 
meridian — what two functions of a, b t c stay constant? 

(c) Extra credit: Estimate a, b,c in your present position. 
The 0° meridian through Greenwich has b = 0. 

36 If | A + B| 2 = |A| 2 4- |B| 2 , prove that A is perpendicular to B. 

37 In Figure 11.4, the medians go from the comers to the 
midpoints of the opposite sides. ExpressM], M 2 ,Mj in terms 
of A, B, C. Prove that M x + M 2 + M 3 = 0, What relation 
holds between A, B, C? 

38 The point $ of the way along is the same for all three 

medians. This means that A -F = - Prove 

that those three vectors are equal. 

39 (a) Verify the Schwarz inequality [V • W|< ]V| |W| for V = 
i + 2j + 2k and W = 2i + 2j + k. 

(b) What does the inequality become when V = (^/x, ^fy) 
and W=(V^, J~xf. 

40 By choosing the right vector W in the Schwarz inequality, 
show that (Vj + V 2 + F 3 ) 2 ^ 3(Kj + V\ + V\). What is W? 

41 The Schwarz inequality for ai + bj and ci + dj says that 
(i a 2 + b 2 )(c 2 + d 2 } ^ (ac + bd) 2 . Multiply out to show that the 
difference is ^0. 

42 The vectors A, B, C form a triangle if A + B + C = 0. The 

triangle inequality |A + Bj< |A| + ]B| says that any one side 
length is [ess than . The proof comes from Schwarz: 

|A + B| 2 =A-A + 2A-B + B-B 

« |A| 2 + + |B| 2 = ( |A| + |B|) 2 . 




32 The diagonal PR has \PR\ 2 = (A + B)(A + B) = A- A + 
A*B + B*A + B*B. Add IQS) 2 from the other diagonal to 
prove the parallelogram law: |PR| 2 + |QS| 2 = sum of squares 
of the four side lengths, 

33 If (1, 2, 3), (3, 4, 7), and (2, 1, 2) are comers of a parallelo- 
gram, find all possible fourth corners. 

34 The diagonals of the parallelogram are A + B and 

. If they have the same length, prove that A • B = 0 

and the region is a . 


43 True or false, with reason or example: 

(a) |V + W| 2 is never larger than |V| 2 + |W| 2 

(b) In a real triangle |V + W| never equals |V| + |W] 

(c) V • W equals WV 

(d) The vectors perpendicular to i + j + k lie along a line, 

44 If V = i + 2k choose W so that V • W = | V| |W| and 
|V + WJ=JV| + |W|. 
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45 A methane molecule has a carbon atom at (0, 0, 0) and 
hydrogen atoms at (1, 1, -1), (1, -1, 1), (— 1, 1, 1), and 
(-1, -1, -1). Find 

(a) the distance between hydrogen atoms 

(b) the angle between vectors going out from the carbon 
atom to the hydrogen atoms. 


46 (a) Find a vector V at a 45 c angle with i and j, 

(b) Find W that makes a 60* angle with i and j, 

(c) Explain why no vector makes a 30° angle with i and j, 



11.2 Planes and Projections 



The most important “curves” are straight lines. The most important functions are 
linear. Those sentences take us back to the beginning of the book — the graph of 
mx + b is a line. The goal now is to move into three dimensions, where graphs are 
surfaces. Eventually the surfaces will be curved. But calculus starts with the flat 
surfaces that correspond to straight lines: 

What are the most important surfaces? Planes. 

What are the most important functions? Still linear. 

The geometrical idea of a plane is turned into algebra, by finding the equation of a 
plane . Not just a general formula, but the particular equation of a particular plane. 

A line is determined by one point (x 0 , y 0 ) and the slope m. The point-slope equation 
is y — y 0 = m(x — x 0 ). That is a linear equation, it is satisfied when y = y 0 an ^ x = x o* 
and dyjdx is m . For a plane, we start again with a particular point — which is now 
(x 0j y 0 , z 0 )* But the slope of a plane is not so simple. Many planes climb at a 45° 
angle — with “slope 1” — and more information is needed. 

The direction of a plane is described by a vector N. The vector is not in the plane, 
but perpendicular to the plane. In the plane, there are many directions. Perpendicular 
to the plane, there is only one direction. A vector in that perpendicular direction is 
a normal vector . 

The normal vector N can point “up” or “down”. The length of N is not crucial (we 
often make it a unit vector and call it n). Knowing N and the point P 0 = (x 0l y 0 , z 0 ), 
we know the plane (Figure 11.9). For its equation we switch to algebra and use the 
dot product — which is the key to perpendicularity. 

N is described by its components ( a t b, c). In other words N is a\ + b] + ck. This 
vector is perpendicular to every direction in the plane. A typical direction goes from 



Fig. 11.9 The normal vector to a plane. Parallel planes have the same N. 
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P 0 to another point P = (x, y, z) in the plane. The vector from P 0 to P has components 
(x — x 0f y — y 0 , z — z 0 ). This vector lies in the plane, so its dot product with N is zero: 

11C The plane through P 0 perpendicular to N = (a, b , c) has the equation 
(a,fr,cHx-x 0 , y-y<>, z-zo) =0 or 
a{x - x 0 ) + b(y - y 0 ) + c(z - z 0 ) = 0. (1 ) 

The point P lies on the plane when its coordinates x, y, z satisfy this equation. 


EXAMPLE 1 The plane through P 0 = (1, 2, 3) perpendicular to N = (1, 1, 1} has the 
equation (x — 1) + (y - 2) + (z - 3) “ 0. That can be rewritten as x + y + z = 6. 

Notice three things. First, P 0 lies on the plane because 1 + 2 + 3 = 6. Second, N = 
( 1 , 1 , 1 ) can be recognized from the x, y, z coefficients in x + y + z = 6, Third, we could 
change N to (2, 2, 2) and we could change P 0 to (8, 2, - 4)“ because N is still perpen- 

dicular and P 0 is still in the plane: 8 + 2 - 4 = 6. 

The new normal vector N = (2, 2, 2) produces 2(x ™ 1) + 2{y - 2) + 2(z — 3) = 0. 
That can be rewritten as 2x + 2y + 2z = 12. Same normal direction, same plane. 

The new point P 0 = (8, 2, - 4) produces {x — 8) + (y “ 2) + (z + 4) = 0. That is 
another form of x + y + z = 6. All we require is a perpendicular N and a point P 0 in 
the plane. 

EXAMPLE 2 The plane through (1, 2, 4) with the same N=<1, 1,1) has a different 

equation: {x — 1) + (y — 2) H- (z — 4) = 0. This is x + y + z = 7 (instead of 6), These 

planes with 7 and 6 are parallel . 

Starting from a(x - x 0 ) + fr(y - y 0 ) ^ c '( z “ z o) = 0, we often move ax 0 + 6y 0 + cz 0 
to the right hand side — and call this constant d : 


1 1 D With the P 0 terms on the right side, the equation of the plane is N • P = d\ 
ax + fry + ci — ax 0 + fry 0 + cz 0 = d. (2) 

A different d gives a parallel plane ; d — 0 gives a plane through the origin . 


EXAMPLE 3 The plane x - y -f 3z = 0 goes through the origin (0, 0, 0). The normal 
vector is read directly from the equation: N = ( 1 , - 1 , 3). The equation is satisfied hy 
P 0 = (1, 1,0) and P = (1, 4, 1), Subtraction gives a vector V = (0, 3, 1) that is in the 
plane, and N • V = 0. 

The parallel planes x — y H- 3z = d have the same N but different d y s. These planes 
miss the origin because d is not zero (x = 0, y = 0, z = 0 on the left side needs d = 0 
on the right side). Note that 3x - 3y + 9z = - 15 is parallel to both planes. N is 
changed to 3N in Figure 11.9, but its direction is not changed. 

EXAMPLE 4 The angle between two planes is the angle between their normal vectors . 

The planes x — y+3z = 0 and 3y + z = 0 are perpendicular, because (1, — 1 , 3) • 
(0,3, 1) = 0. The planes z — 0 and y = 0 are also perpendicular, because (0,0, 1)* 
(0, 1,0) = 0, (Those are the xy plane and the xz plane.) The planes x + y = 0 and 
x + z = 0 make a 60' angle, because cos 60 u = (1, 1, 0) - (1, 0, \)j^/2^/i = 

The cosine of the angle between two planes is |N X • N 2 [/]N,[ |N Z |* See Figure 1 M0, 
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Fig. 11.10 Angle between planes wangle between normals. Parallel and perpendicular to a 
line. A line in space through P 0 and Q. 


Remark I We gave the “point-slope'' equation of a line (using m), and the “point- 
normal'’ equation of a plane (using N). What is the normal vector N to a line? 

The vector V = (1, m) is parallel to the line y = mx + h. The line goes across by I 
and up by m. The perpendicular vector is N — (— m, 1), The dot product N*V is 
- m + m — 0. Then the point-normal equation matches the point-slope equation: 

- m(x — x a ) + \{y — y 0 ) = 0 is the same as y — y 0 = m(x — x 0 ). (3) 

Remark 2 What is the point-slope equation for a plane? The difficulty is that a 
plane has different slopes in the x and y directions. The function /(x, y) = 
m(x — x 0 ) + M{y— y 0 ) has two derivatives m and M. 

This remark has to stop. In Chapter 13, “slopes" become “partial derivatives." 

A LINE m SPACE 

In three dimensions, a line is not as simple as a plane. A line in space needs two 
equations, Bach equation gives a plane, and the line is the intersection of two planes. 

The equations .x + y + z = 3 and lx A- 3 y + z = 6 determine a line. 

Two points on that line are P 0 = (U L 1) and Q - (3, 0. 0). They satisfy both equations 
so they lie on both planes. Therefore they are on the line of intersection. The direction 
of that line, subtracting coordinates of from Q, is along the vector V = 2i — j — k. 

The line goes through P 0 = (1 , ! . 1 1 in the direction of V = 2i — j — k. 

Starting from (x 0 , y 0 , z 0 ) = (1, 1, 1), add on any multiple fV. Then x = l + 2 1 and 
y— I— t and 2 = 1 — t. Those are the components of the vector equation 
P = P 0 + rV — which produces the line. 

Here is the problem. The line needs two equations — or a vector equation with a 
parameter f. Neither form is as simple as ax + by + cz = d . Some books push ahead 
anyway, to give full details about both forms. After trying this approach, I believe 
that those details should wait. Equations with parameters are the subject of 
Chapter 12, and a line in space is the first example. Vectors and planes give plenty 
to do here — especially when a vector is projected onto another vector or a plane. 

PROJECTION OF A VECTOR 

What is the projection of a vector B onto another vector A? One part of B goes along 
A — that is the projection. The other part of B is perpendicular to A. We now compute 
these two parts, which are P and B - P. 
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In geometry, projections involve cos 9. In algebra, we use the dot product (which 
is closely tied to cos 0). In applications, the vector B might be a velocity V or a force 

F: 

An airplane flies northeast, and a 100-mile per hour wind blows due 
east. What is the projection of V = (100, 0) in the flight direction A? 

Gravity makes a ball roll down the surface 2x + 2y -I- z = 0. What are 
the projections of F = (0, 0, - mg) in the plane and perpendicular to 
the plane? 

The component of V along A is the push from the wind (tail wind). The other 
component of V pushes sideways (crosswind). Similarly the force parallel to the 
surface makes the ball move. Adding the two components brings back V or F. 



Fig. 11.11 Projections along A of wind velocity V and force F and vector B. 


We now compute the projection of B onto A. Call this projection P. Since its 
direction is known — P is along A — we can describe P in two ways: 

1) Give the length of P along A 

2) Give the vector P as a multiple of A. 

Figure 11.11b shows the projection P and its length. The hypotenuse is |B|. The 
length is |P| = |B| cos 9. The perpendicular component B - P has length |B| sin 9. The 
cosine is positive for angles less than 90°. The cosine (and P!) are zero when A and 
B are perpendicular. |B| cos0 is negative for angles greater than 90°, and the pro- 
jection points along — A (the length is |B| |cos 0|). Unless the angle is 0° or 30° or 45° 
or 60° or 90°, we don’t want to compute cosines — and we don’t have to. The dot 
product does it automatically: 

AB 

|A| |B| cos 9 = A • B so the length of P along A is |B| cos 9 = — 77 *. (4) 

|A| 

Notice that the length of A cancels out at the end of (4). If A is doubled, P is 
unchanged. But if B is doubled, the projection is doubled. 

What is the vector P? Its length along A is A • B/|A|. If A is a unit vector, then 
|A| = 1 and the projection is A • B times A. Generally A is not a unit vector, until we 
divide by |A|. Here is the projection P of B along A: 

P - </«■*.* of Pita,-, ,-ecor) - (7) (j£j) - £j? A- 


( 5 ) 
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EXAMPLE 5 For the wind velocity V = (100, 0) and flying direction A = (1, I), find P. 
Here V points east, A points northeast. The projection of V onto A is P: 

, , AV 100 „ AV 100 

length |P| = -j^- = -j*' vector P = A = — (1, 1) = (50, 50). 


rr itu 


EXAMPLE 6 Project F = (0 T 0, — mg) onto the plane 
The projection of F along N is not the answer. But compute that first: 


P= y N= _!"i (2 2i) 

|N| 3 |N| 2 9 1 h 


P is the component of F perpendicular to the plane. It does not move the ball. The 
in-plane component is the difference F - P. Any vector B has two projections, along 
A and perpendicular: 


A * B 

The projection P — 2 A is perpendicular to the remaining component B — P. 

|A| 


EXAMPLE 7 Express B = i - j as the sum of a vector P parallel to A = 3i + j and a 
vector B - P perpendicular to A. Note A ■ B = 2. 


Solution 


_ AB t 2 6 2 , 

|A| 2 A 10 A 10 ,+ 10 J ' 


Then B-P = 


12 . 

10 1 I0 J 


Check. P ■ (B - P) = (-ntK'ns) — (reX'ro) = 0. These projections of B are perpendicular. 
Pythagoras: |P| 2 + |B- P| 2 equals |B| 2 . Check that too: 0.4 + 1.6 = 2.0. 

Question When is P = 0? Answer When A and B are perpendicular. 


EXAMPLE 8 Find the nearest point to the origin on the plane x + 2y + 2z = 5. 

The shortest distance from the origin is along the normal vector N. The vector P to 
the nearest point (Figure 1 1.12) is r times N, for some unknown number (. We find t 
by requiring P= (N to lie on the plane. 

The plane x + 2y + 2z = 5 has normal vector N = (1, 2, 2). Therefore P = rN = 
(t, 2 (, 2t). To lie on the plane, this must satisfy x + 2y + 2z = 5: 

r + 2(2() + 2(2t) = 5 or 9f = 5 or t = §. (6) 

Then P = = (§, ^, ^). That locates the nearest point. The distance is ||N| = 3. 

This example is important enough to memorize, with letters not numbers: 


HE On the plane ax + by + cz = d, the nearest point to (0, 0, 0) is 


(da, db, dc) 
a 2 + b 2 + c 1 ' 


The distance is 


\£ 

yfc? + b 2 + c 2 


(?) 


The steps are the same. N has components a, b, c. The nearest point on the plane is 
a multiple (ta, tb, tc). It lies on the plane if a(ta) + b(tb) + c((c) = d. 

Thus r = dj(a 2 + b 2 + c 2 ). The point (ta, tb, tc) = tN is in equation (7). The distance 
to the plane is |tN| = |<f|/|N|. 
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/ \p = /N |p| = 


5 


V ! +4 + 4 3 



V 1 +4 + 4 


^ plane x + 2y + 2z = 5 / P - Q + f N 

Fig. 11.12 Vector to the nearest point P is a multiple fN, The distance is in (?) and (9). 


Question How far is the plane from an arbitrary point Q = (x lT y lT Zj)? 

Answer The vector from Q to P is our multiple t N. In vector form P = Q + f N. This 
reaches the plane if P * N = d< and again we find t: 

(Q + tN) - N - d yields t = (d- Q- N)/|N[ 2 . (8) 

This new term Q-N enters the distance from Q to the plane: 

distance — [r N] = \d — Q * N|/|N| = | d — ax x — by l — cz^/^ui 2 + b 2 + c 2 . (9) 

When the point is on the plane, that distance is zero— because ax x + by l + cz l = d. 
When Q is i T 3j + 2k, the figure shows Q - N = 11 and distance = 2. 

PROJECTIONS OF THE HEART VECTOR 

An electrocardiogram has leads to your right arm left arm-left leg. You produce the 
voltage . The machine amplifies and records the readings. There are also six chest 
leads, to add a front-back dimension that is monitored across the heart. We will 
concentrate on the big “Einthoven triangle/' named after the inventor of the ECG. 

The graphs show voltage variations plotted against time. The first graph plots the 
voltage difference between the arms. Lead II connects the left leg to the right arm. 
Lead III completes the triangle, which has roughly equal sides (especially if you are 
a little lopsided). So the projections are based on 60" and 120° angles. 

The heart vector V is the sum of many small vectors — all moved to the same 
origin, V is the net effect of action potentials from the cells — small dipoles adding to 
a single dipole. The pacemaker (S-zl node) starts the impulse. The atria depolarize 
to give the P wave on the graphs. This is actually a P loop of the heart vector — the 



LEAD il 



LEAD oV R 


LEAD III 



Fig. A The graphs show the component of the moving heart vector along each lead. These 
figures are reproduced with permission from the Cl BA Collection of Medical Illus- 
trations by Frank H. Net ter, M,D. Copyright 1978 CIBA-GEIGY, all rights reserved. 
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graphs only show its projections. The impulse reaches the AV node, pauses, and 
moves quickly through the ventricles* This produces the QRS complex— the large 
sharp movement on the graph* 

The total QRS interval should not exceed 1/10 second (2£ spaces on the printout), 
V points first toward the right shoulder. This direction is opposite to the leads, so 
the tracings go slightly down. That is the Q wave, small and negative* Then the heart 
vector sweeps toward the left leg* In positions 3 and 4, its projection on lead I 
{between the arms) is strongly positive. The R wave is this first upward deflection in 
each lead. Closing the loop, the S wave is negative (best seen in leads I and aVR). 

Question 1 How many graphs from the arms and leg are really independent? 
Answer Only two! In a plane, the heart vector V has two components* If we know 
two projections, we can compute the others* (The ECG does that for us.) Different 
vectors show better in different projections* A mathematician would use 90° angles, 
with an electrode at your throat* 

Question 2 How are the voltages related? What is the aVR lead? 

Answer Project the heart vector V onto the sides of the triangle: 

The lead vectors have L, — L,i H- L m = 0 — they form a triangle. 

The projections have V T - V„ + V m = V- I fl -V-L n + V-L ni = 0. 

TheaVR lead is — — ^Ljj. It is pure algebra (no wire). By vector addition it points 

toward the electrode on the right arm* Its length is if the other lengths are 2* 
Including aVL and aVF to the left arm and foot, there are six leads intersecting at 
equal angles * Visualize them going out from a single point (the origin in the chest). 



R 



Rg.B Heart vector goes around the QRS loop. Projections are spikes on the ECG. 

Question 3 If the heart vector is V = 2\ — j, what voltage differences are recorded? 
Answer The leads around the triangle have length 2. The machine projects V: 

Lead I is the horizontal vector 2i So V * = 4* 

Lead II is the —60° vector i - So V *L ]t = 2 + 

Lead III is the — 120° vector - i — ^/3j. So V *L in = — 2 + y/3. 

The first and third add to the second* The largest R waves are in leads I and II* In 
aVR the projection of V will be negative {Problem 46), and will be labeled an S wave* 
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Question 4 What about the potential (not just its differences). Is it zero at the center? 
Answer It is zero if we say so. The potential contains an arbitrary constant C. (It is 
like an indefinite integral. Its differences are like definite integrals.) Cardiologists 
define a “central terminal*’ where the potential is zero. 

The average of V over a loop is the mean heart vector H. This average requires 
J \dt , by Chapter 5. With no time to integrate, the doctor looks for a lead where the 
area under the QRS complex is zero. Then the direction of H (the axis) is perpendicu- 
lar to that lead. There is so much to say about calculus in medicine. 


11.2 EXERCISES 


Read-through questions 

A plane in space is determined by a point P 0 = (x 0l yo, z 0 ) 
and a q vector N with components (o,f>, c). The point 
P = (x, y, z) is on the plane if the dot product of N with u 
is zero. (That answer was not P\) The equation of this plane 
is d c ) + M d ) + c(_£_) = 0. The equation is also 
written as ax + by + cz — d t where d equals * . A parallel 

plane has the same B and a different h . A plane 
through the origin has d — I 

The equation of the plane through P 0 = (2, 1, 0) perpendic- 
ular to N = (3, 4, 5) is 1 A second point in the plane is 
P — (0, 0. fc 1. The vector from P 0 to P is I and it is 
m to N. (Check by dot product.) The plane through P Q = 
(2, 1, 0) perpendicular to the z axis has N = n and equa- 
tion o . 

The component of B in the direction of A is p . where 
0 is the angle between the vectors. This is A • B divided by 
q . The projection vector P is |B| cos 0 times a * 
vector in the direction of A. Then P = ( |B| cos 0)(A/|A|) sim- 
plifies to * . When B is doubled, P is * When A is 

doubled, P is u If B reverses direction then P v If 
A reverses direction then P * . 

When B is a velocity vector, P represents the * . When 

B is a force vector, P is Y . The component of B perpen- 
dicular to A equals i . The shortest distance from (0, 0, 0) 
to the plane ax + by + cz = d is along the * vector. The 
distance is b and the closest point on the plane is P = 
c . The distance from Q = (x 1 ,yi,z 1 ) to the plane is 
D 


Find two points P and P a on the planes 1-6 and a normal 
vector N. Verify that N • (P — P 0 ) = 0. 

1 x + 2y + 3z = 0 2 x + 2y + 3z = 6 3 the yz plane 

4 the plane through (0, 0, 0) perpendicular to i + j — k 

5 the plane through (1, I, 1) perpendicular to i + j — k 

6 the plane through (0, 0, 0) and (1, 0, 0) and (0, 1, 1). 


Find an x — y — z equation for planes 7-10. 

7 The plane through P 0 ^(1, 2, —1) perpendicular to N = 

i+j 

8 The plane through P 0 = (1, 2, —1) perpendicular to N = 
i + 2 j - k 

9 The plane through (1, 0, 1) parallel to x + 2y 4- z = 0 

10 The plane through (x 0 , yo, ^o) parallel to x + y + z = 1. 

11 When is a plane with normal vector N parallel to the 
vector V? When is it perpendicular to V? 

12 (a) If two planes are perpendicular (front wall and side 
wall), is every line in one plane perpendicular to every line 
in the other? 

(b) If a third plane is perpendicular to the first, it might 
be (parallel) (perpendicular) (at a angle) to the second. 

13 Explain why a plane cannot 

(a) contain (1, 2, 3) and (2, 3, 4) and be perpendicular to 
N = i +j 

(b) be perpendicular to N = i + j and parallel to V = i + k 

(c) contain (1, 0, 0), (0, 1, 0), (0, 0, 1), and (1, 1, 1) 

(d) contain (1,1, — 1) if it has N = i + j — k (maybe it can) 

(e) go through the origin and have the equation 

+ by + = 1. 

14 The equation 3x + 4y + 7z — r = 0 yields a hyperplane in 
four dimensions. Find its normal vector N and two points /\ 
Q on the hyperplane. Check (P — Q) * N = 0. 

15 The plane through (x, y, z) perpendicular to a\ + bj + ck 

goes through (0, 0, 0) if l . The plane goes through 

(x 0 , y 0 ,z 0 ) if . 

16 A curve in three dimensions is the intersection of 

surfaces. A line in four dimensions is the intersection of 
t hyperplanes. 

17 (angle between planes) Find the cosine of the angle 
between x + 2y + 2z“0 and (a) x + 2z = Q (b) x + 2z = 5 

(c) x =0. 
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18 N is perpendicular to a plane and V is along a line. Draw 
the angle 0 between the plane and the line, and explain why 
V*N/|V| |N| is sin 0 not cos 0, Find the angle between the xy 
plane and V ^=i + j + *j2k. 

In 19-26 find the projection P of D along A, Also find |P|. 
19A={4,2,4),B = a~l,0) 

20 A = (1, —1, 0), B = (4, 2, 4) 

21 B = unit vector at 60° angle with A 

22 B = vector of length 2 at 60° angle with A 

23 B = - A 24 A — i +j, B = i + k 

25 A is perpendicular to x — y -f z = 0, B = i+j. 

26 A is perpendicular to x — y + z = 5, B = i + j + 5k. 

27 The force F = 3i — 4k acts at the point (1, 2, 2). How much 
force pulls toward the origin? How much force pulls vertically 
down? Which direction does a mass move under the force F? 

28 The projection of B along A is P = . The projec- 
tion of B perpendicular to A is . Check the dot 

product of the two projections. 

29 P = (*, y, z) is on the plane ax + by + cz = 5 if P * N = 

|P|]N| cos 0 = 5. Since the largest value of cos 0 is 1, the small- 
est value of |P| is . This is the distance between 

30 If the air speed of a jet is 500 and the wind speed is 50, 
what information do you need to compute the jet’s speed over 
land? What is that speed? 

31 How far is the plane x + y — z — 1 from (0, 0, 0) and also 
from (1, 1, —I)? Find the nearest points. 

32 Describe all points at a distance 1 from the plane 
x + 2y + 2z = 3. 

33 The shortest distance from Q = (2,1,1) to the plane 

x+y + z — 0 is along the vector .. The point P — 

Q + tN *= (2 + 1 7 1 + f, 1 + r) lies on the plane if f = . 

Then P = and the shortest distance is r , 

(This distance is not |P|.) 

34 The plane through (1, U 1) perpendicular to N = 

i + 2j + 2k is a distance from (0, 0, 0). 

35 (Distance between planes) 2x — 2y + z = 1 is parallel 

to 2x —2 y + z = 3 because , Choose a vector Q on 

the first plane and find I so that Q + tN lies on the second 
plane. The distance is |r N| = ♦ 


36 The distance between the planes x + y + 5z = 1 and 

+ 2y + z = 1 is zero because , 

In Problems 37-41 all points and vectors are in the xy plane, 

37 The line 3 jc + 4y = 10 is perpendicular to the vector N = 

. On the line, the closest point to the origin is P — 

fN. Find t and P and [P|, 

38 Draw the line x + 2y — 4 and the vector N = i + 2j. The 
closest point to Q = (3, 3) is P — Q + tN, Find f. Find P. 

39 A new way to find P in Problem 37: minimize x 2 + y 2 - 

x 2 + — jx) 2 . By calculus find the best x and y. 

40 To catch a drug runner going from (0, 0) to (4, 0) at 8 
meters per second, you must travel from (0, 3) to (4, 0) at 

meters per second. The projection of your velocity 

vector onto his velocity vector will have length , 

41 Show by vectors that the dis tance fr om (x ir yi) to the line 
ax + by — d is \d — ax i — byJ/^/a 2 + b 2 . 

42 It takes three points to determine a plane. So why does 
ax + by + cz — d contain four numbers a, b, c, d? When does 
ex +/y + gz= 1 represent the same plane? 

43 (projections by calculus) The dot product of B — fA with 
itself is |B| 2 — 2fA -B + t 2 |A| 3 . (a) This has a minimum at 

t = (b) Then fA is the projection of . A 

figure showing B, fA, and B — rA is worth 1000 words, 

44 From their equations, how can you tell if two planes are 
(a) parallel (b) perpendicular (c) at a 45° angle? 


Problems 45-48 are about the ECG and heart vector 

45 The aVR lead is — iLi — iW Find the aVL and aVF 
leads toward the left arm and foot. Show that 
aVR + aVL + aVF = 0. They go out from the center at 120° 
angles. 

46 Find the projection on the aVR lead of V =2i~j in 
Question 3. 

47 If the potentials are = t (right arm) and <p LA — 2 and 
<P LL = — 3, find the heart vector V. The differences in potential 
are the projections of V, 

48 IfV is perpendicular to a lead L, the reading on that lead 

is If JV(f)d£ is perpendicular to lead L, why is the 

area under the reading zero? 
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11.3 Cross Products and Determinants 



After saying that vectors are not multiplied, we offered the dot product. Now we 
contradict ourselves further, by defining the cross product. Where A * B was a number, 
the cross product A x B is a vector . It has length and direction: 

The length is |A| |B| Jsin 8\, The direction is perpendicular to A and B. 

The cross product (also called vector product) is defined in three dimensions only. 
A and B lie on a plane through the origin. A x B is along the normal vector N, 
perpendicular to that plane. We still have to say whether it points “up’’ or “down” 
along N. 

The length of A x B depends on sin 0, where A • B involved cos 0. The dot product 
rewards vectors for being parallel (cos 0=1). The cross product is largest when A is 
perpendicular to B (sin nj2 - 1). At every angle 

|A • B| 2 + |A x B| 2 = |A| 2 |Bj 2 cos 2 0 + JA| 2 |B| 2 sin 2 0 = |A| 2 |B|\ (1) 

That will be a bridge from geometry to algebra. This section goes from definition to 
formula to volume to determinant. Equations (6) and (14) are the key formulas for 
A x B. 

Notice that A x A - 0. (This is the zero vector, not the zero number.) When B is 
parallel to A, the angle is zero and the sine is zero. Parallel vectors have A x B = 0. 
Perpendicular vectors have sin 0 = 1 and |A x B| = |A| |B| = area of rectangle with 
sides A and B. 

Here are four examples that lead to the cross product A x B. 

EXAMPLE 1 (From geometry) Find the area of a parallelogram and a triangle. 

Vectors A and B, going out from the origin, form two sides of a triangle. They produce 
the parallelogram in Figure 11.13, which is twice as large as the triangle. 

The area of a parallelogram is base times height (perpendicular height not sloping 
height). The base is |A|. The height is |B||sin0|. We take absolute values because 
height and area are not negative. Then the area is the length of the cross product: 

area of parallelogram = |A| |B| |sin 0\ = |A x B|. (2) 



base | A[ 

area [ A [ [ B [ | sin 0 | = | A x B | 



Rg. 11.13 Area | A x B| and moment |R x F[. Cross products are perpendicular to the page. 


EXAMPLE 2 (From physics) The torque vector T = R x F produces rotation. 

The force F acts at the point (jc, y, z). When F is parallel to the position vector R = 
xi + yj + zk, the force pushes outward (no turning ). When F is perpendicular to R, 
the force creates rotation . For in-between angles there is an outward force |F| cos 9 
and a turning force |F| sin 9 . The turning force times the distance ]R| is the moment 
|R| |F[ sin 6. 
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The moment gives the magnitude and sign of the torque vector T=RxF. The 
direction of T is along the axis of rotation, at right angles to R and F. 

EXAMPLE 3 Does the cross product go up or down? Use the right-hand rule. 

Forces and torques are probably just fine for physicists. Those who are not natural 
physicists want to see something turo.f We can visualize a record or compact disc 
rotating around its axis — which comes up through the center. 

At a point on the disc, you give a push. When the push is outward (hard to do), 
nothing turns. Rotation comes from force '‘around" the axis. The disc can turn either 
way — depending on the angle between force and position, A sign convention is 
necessary, and it is the right-hand rule : 

A x B points along your right thumb when the fingers curl from A toward B. 

This rule is simplest for the vectors i, j, k in Figure 1 1.14— which is all we need. 

Suppose the fingers curl from i to j. The thumb points along k. The x-y-z axes 
form a “right-handed triple." Since ji| — 1 and |j| = 1 and sin jt/2 = 1, the length of i x j 
is 1. The cross product is ixj = k. The disc turns counterclockwise — its angular 
velocity is up — when the force acts at i in the direction j. 

Figure 1 1.14b reverses i and j. The force acts at j and its direction is i. The disc 
turns clockwise (the way records and compact discs actually turn). When the fingers 
curl from j to i. the thumb points down. Thus j x i = - k. This is a special case of an 
amazing rule: 

The cross product is andcommutative: B x A = - (A x B). (3) 

That is quite remarkable. Its discovery by Hamilton produced an intellectual revolu- 
tion in I9lh century algebra, which had been totally accustomed to AB = BA. This 
commutative law is old and boring for numbers (it is new and boring for dot pro- 
ducts). Here we see its opposite for vector products A x B. Neither law holds for 
matrix products. 


i x j = k 



EXAMPLE 4 A screw goes into a wall or out. following the right-hand rule. 

The disc was in the xy plane. So was the force. (We are not breaking records here.) 
The axis was up and down. To see the cross product more completely we need to 
turn a screw into a wall. 

Figure 1 1.14b shows the xz plane as the wail. The screw is in the y direction. By 
turning from x toward z we drive the screw into the wall — which is the negative y 
direction. In other words i x k equals minus j. We turn the screw clockwise to make 
it go in. To take out the screw, twist from k toward i. Then k x i equals plus j. 


tEverybody is a natural mathematician. That is the axiom behind this book. 
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To summarize : k x i = j and j x k - j have plus signs because kij and jki are in the 
same “cyclic order ' as ijk. (Antic yctic is minus.) The z-x-y and y-z-x axes form right- 
handed triples like x-y-z . 

THE FORMULA FOR THE CROSS PRODUCT 


We begin the algebra of A x B. It is essential for computation, and it comes out 
beautifully. The square roots in |A| |Bj |sin 0| will disappear in formula (6) for A x B, 
(The square roots also disappeared in A ■ B, which is |A| |B[ cos 0 . But |A| |Bj tan 0 
would be terrible.) Since A x B is a vector we need to find three components . 

Start with the two-dimensional case. The vectors a x i T a 2 \ and b x \ + b 2 \ are in the 
xy plane. Their cross product must go in the z direction. Therefore A x B= ? k 
and there is only one nonzero component. It must be |A||B| sin 0 {with the correct 
sign), but we want a better formula. There are two clean ways to compute A x B, 
either by algebra (a) or by a bridge ( b ) to the dot product and geometry: 


(а) {a l i + a 2 H x (h l i + h 2 |) = a 1 h 1 ix i + a l b 2 i x j + a 2 b x \ x l + a 2 b 2 j x j. (4) 
On the right are 0. a x b 2 k, -a 2 b x k, and 0. The cross product is (a x b 2 - a 2 b x )k, 

(б) Rotate B = b x \ + b 2 \ clockwise through 90 into B* = b 2 i ~ hi j- Its length is 
unchanged (and B ■ B* = 0), Then |A| |B| sin 0 equals J A| |B*j cos 0, which is A 4 B*; 


1 A 1 1 B| sin 0 = A * B* 



= a l b 2 -a 2 b l . 


(5) 


11F In the xy plane, A x B equals (a l 6 2 — ^ 2^1 )^- The parallelogram with 
sides A and B has area I a x b 2 - a 2 b x \. The triangle OAB has area b 2 - a 2 bi\. 


EXAMPLE 5 For A = i T- 2j and B = 4i T 5j the cross product is(l - 5 — 2 - 4)k = — 3k. 
Area of parallelogram = 3, area of triangle = 3/2. The minus sign in A x B = — 3k is 
absent in the areas. 

Note Splitting A x B into four separate cross products is correct, but it does not 
follow easily from |A] |B| sin Method (a) is not justified until Remark 1 below. An 
algebraist would change the definition of A x B to start with the distributive law 
(splitting rule) and the anticommutative law: 

A x (B + C) = (A x B) + {A x C) and A x B = - (B x A). 

THE CROSS PRODUCT FORMULA (3 COMPONENTS) 

We move to three dimensions. The goal is to compute ail three components of A x B 
(not just the length). Method (a) splits each vector into its i, j. k components, making 
nine separate cross products: 

{a 1 i + a 2 j + tf 3 k) x (b : i+ b 2 j T fc 3 k) — «|6 2 {i x i) + £j 1 fj 2 (i x j) + seven more terms. 

Remember ixi = jxj = kxk = 0. Those three terms disappear. The other six terms 
come in pairs, and please notice the cyclic pattern: 

FORMULA A x B = (n 2 fe 2 — a 2 b 2 ) i + Aj 3 )| + (a x b 2 - {6) 

The k component is the 2 x 2 answer. w r hen = b 2 ^ 0. The i component involves 
indices 2 and 3, j involves 3 and 1, k involves 1 and 2. The cross product formula is 
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written as a “determinant” in equation (14) below — many people use that form to 
compute A x B. 

EXAMPLE 6 (i + 2j + 3k) x (4i + 5j + 6k) = (2 • 6 — 3 - 5)i + (3 -4 — 1 • 6)j + (1 -5 — 2-4)k. 
The i, j, k components give A x B = - 3i 4* 6j - 3k. Never add the - 3, 6, and - 3. 

Remark 1 The three-dimensional formula (6) is still to be matched with A x B from 
geometry. One way is to rotate B into B* as before, staying in the plane of A and B. 
Fortunately there is an easier test. The vector in equation (6) satisfies all four geo- 
metric requirements on A x B: perpendicular to A, perpendicular to B, correct length , 
right-hand rule. The length is checked in Problem 16 — here is the zero dot product 
with A: 


A *(A x B) = a x (a 2 b 3 - a 3 b 2 ) + a 2 (a 3 b l - a^) + a 3 (a l h 2 - a 2 b x ) = 0. (7) 

Remark 2 (Optional) There is a wonderful extension of the Pythagoras formula 
a 2 4- b 2 = c 2 . Instead of sides of a triangle, we go to areas of projections on the yz, xz, 
and xy planes. 3 2 + 6 2 + 3 2 is the square of the parallelogram area in Example 6. 

For triangles these areas are cut in half. Figure 1 1.15a shows three projected trian- 
gles of area Its Pythagoras formula is (?) 2 + (£) 2 + (£) 2 = (area of PQR) 2 . 

EXAMPLE 7 P = ( 1 , 0, 0), Q = (0, 1,0 ), P = (0, 0, 1 ) lie in a plane. Find its equation. 

Idea for any P,Q, R. Find vectors A and B in the plane. Compute the normal N = A x B. 

Solution The vector from P to Q has components — 1, 1,0. It is A = j — i (subtract 
to go from P to Q). Similarly the vector from P to R is B = k - i. Since A and B are 
in the plane of Figure 1 1.15, N = A x B is perpendicular: 

( j - i) x (k - i) = ( j x k) - (i x k) - ( j x i) + (i x i) = i + j + k. (8) 

The normal vector is N = i + j + k. The equation of the plane is 1.x + 1>'+ 1 z = (l. 
With the right choice d = 1, this plane contains P, Q, R. The equation isx + y + z = 1. 

EXAMPLE 8 What is the area of this same triangle PQR ? 

Solution The area is half of the cross-product length |A x B| = |i + j + k| = N /3. 



Fig. 11.15 Area of PQR is v /3/2. N is PQ x PR. Volume of box is |A • (B x C)|. 


DETERMINANTS AND VOLUMES 

We are close to good algebra. The two plane vectors a t i + a 2 j and 6,i + b 2 \ are the 
sides of a parallelogram. Its area is a a 6 2 “ a 2 ^i' possibly with a sign change. There 
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is a special way to write these four numbers — in a square matrix." There is also a 
name for the combination that leads to area. It is the "determinant of the matrix 


The matrix is 


a i a 2 

b ] b 2 


its determinant is 


a x a 2 

b t b 2 


“ a j b 2 i ■ 


This is a 2 by 2 matrix (notice brackets) and a 2 by 2 determinant (notice vertical 
bars). The matrix is an array of four numbers and the determinant is one number: 


Examples of determinants: 


2 i 


2 1 


1 0 


II 

CTv 

1 

■J^ 

II 


= 0 , 


4 3 


2 1 


0 I 


- 1 , 


The second has no area because A = B, The third is a unit square (A = i, B = j). 

Now move to three dimensions, where determinants are most useful. The parallelo- 
gram becomes a parallelepiped. The word “box” is much shorter, and we will use it, 
but remember that the box is squashed . (Like a rectangle squashed to a parallelogram, 
the angles are generally not 90°. ) The three edges from the origin are A = (u, , a 2 , a 3 ), 
B = (^, 62 , 63 ), C = (c u c 2i c 2 y Those edges are at right angles only when A-B = 
A - C = B ■ C = 0. 

Question : What is the volume of the box ? The right-angle case is easy — it is length 
times width times height. The volume is |A| times |B| times |C|, when the angles are 
90°, For a squashed box (Figure 11,15) we need the perpendicular height, not the 
sloping height. 

There is a beautiful formula for volume, R and C give a parallelogram in the base, 
and |B x C] is the base area. This cross product points straight up. The third vector 
A points up at an angle— its perpendicular height is |A| cos 0 . Thus the volume is 
area |B x C| times |A| times cos 0 . The volume is the dot product of A with B x C. 


1 1G The triple scalar product is A * (B x C). Volume of box = | A ■ (B x C)j. 


Important: A*(B x C) is a number, not a vector. This volume is zero when A is in 
the same plane as B and C (the box is totally flattened). Then B x C is perpendicular 
to A and their dot product is zero. 

Useful facts : A ■ (B x C) = (A x B) * C = C M A x B) = B * (C x A). 

All those come from the same box, with different sides chosen as base — but no change 
in volume. Figure 11.15 has B and C in the base but it can be A and B or A and t 
The triple product A ■ (C x B) has opposite sign , since C x B = — (B x C). This order 
ACB is not cyclic like ABC and CAB and BCA. 

To compute this triple product A MB x C), we take BxC from equation ( 6 ): 


A*(B x C) — ^3^2) ."h b l Cj) + o 2 {h l c 2 b 2 Ci). ( 9 ) 

The numbers a 2 > multiply 2 by 2 determinants to give a 3 by 3 determinant! 
There are three terms with plus signs (like a 1 b 2 c 1 ). The other three have minus signs 
(like — a x b 2 c 2 ), The plus terms have indices 123, 231, 312 in cyclic order. The minus 
terms have anticyclic indices 132, 213, 321, Again there is a special way to write the 
nine components of A, B, C — as a “3 by 3 matrix.” The combination in (9), which 
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gives volume, is a “3 by 3 determinant:” 


matrix = 

a l a 2 a 3 

b\ b 2 b 2 

, determinant = A - (B x Q - 

^2 ^3 

b\ b 2 £>3 


Ci c 2 c 3 


Ci C 2 C3 


A single number is produced out of nine numbers, by formula (9). The nine numbers 
are multiplied three at a time, as in — except this product is not allowed. Each 
row and column must be represented once. This gives the six terms in the determinant: 


dl G 2 

b 1 b 2 ^3 
Cl C 2 C 2 


+ Qjb^Ci + Q 2 biC 2 
” ^1^3 C 2 ” ” ^3^2 C 1 


( 10 ) 


The trick is in the + signs. Products down to the right are 11 plus": 


2 1 I 
1 2 t 
1 1 2 


2 * 2*2 + 1 - 1 * 1 + 1 * 1*1 8 + 1+1 

= 4. 

— 2 * 1*1 — 1 * 1*2 — 1 - 2-1 - 2 - 2-2 


With practice the six products like 2 -2 -2 are done in your head. Write down only 
8 + 1 + 1— 2 — 2 — 2 = 4. This is the determinant and the volume. 

Note the special case when the vectors are i, j, k. The box is a unit cube: 


volume of cube = 


1 0 0 
0 1 0 
0 0 1 


1+0 + 0 

= 1. 

- 0 - 0-0 


If A. B. C lie in the same plane, the volume is zero. A zero determinant is the test 
to see whether three vectors lie in a plane, Here row A = row B— row C: 

0 ■ 1 *1 + 1 ■ 0 * (— 1 ) + (— 1 ) • (— 1 ) • 0 

= 0 . ( 11 ) 

- 0 - 0 - 0 - 1 -(- 1)- 1 -(- 1)- 1 -(- 1 ) 

Zeros in the matrix simplify the calculation. All three products with plus signs — 
down to the right — are zero. The only two nonzero products cancel each other. 

If the three - l's are changed to + 1 ’s, the determinant is — 2. The determinant can 
be negative when all nine entries are positive! A negative determinant only means 
that the rows A, B, C form a “left-handed triple.” This extra information from the 
sign — right-handed vs. left-handed — is free and useful, but the volume is the absolute 
value. 

The determinant yields the volume also in higher dimensions. In physics, four 
dimensions give space-time. Ten dimensions give superstrings. Mathematics uses all 
dimensions. The 64 numbers in an 8 by 8 matrix give the volume of an eight- 
dimensional box — with 8! = 40,320 terms instead of 3! = 6. Under pressure from my 
class I omit the formula. 


0 1 -1 

-1 1 0 

-1 0 1 



Question When is the point (x, y> z) on the plane through the origin containing B 
and C? For the vector A = xi + jrj + zk to lie in that plane, the volume A • (B x C) 
must be zero. The equation of the plane is determinant — zero , 

Follow this example for B = j - i and C = It - i to find the plane parallel to B and C: 


x y 2 
-1 1 0 
-1 0 1 


x - 1 - 1 + y - 0 - (— 1 ) + 2 - 0 • (- 1 ) 
— x*0-0 — 1 * ( — 1) — r * 1 - ( — 1) 


( 12 ) 


This equation is x + y + z = 0. The normal vector N = B x C has components 1,1,1. 


THE CROSS PRODUCT AS A DETERMINANT 


There is a connection between 3 by 3 and 2 by 2 determinants that you have to see. 
The numbers in the top row multiply determinants from the other rows: 


°2 

b 2 hj hj 
Cl «2 Cj 



b± 

k 


*1 

k 

+ a 3 

*1 

k 


, £z 

£1 

a 2 

Cl 

C 3 

Cl 




(13) 


The highlighted product ai(b 2 c 3 - b 3 c 2 ) gives two of the six terms. Alt six products 
contain an a and b and c from different columns. There are 3! = 6 different orderings 
of columns 1 , 2 , 3. Note how a 3 multiplies a determinant from columns 1 and 2. 

Equation (13) is identical with equations (9) and (10). We are meeting the same six 
terms in different ways. The new feature is the minus sign in front of a 2 — and the 
common mistake is to forget that sign. In a 4 by 4 determinant, a u ~a 2 , a 3 , -a 4 
would multiply 3 by 3 determinants. 

Now comes a key step. We write A x B as a determinant. The vectors i, j, k go in 
the top row, the components of A and B go in the other rows. The “ determinant ” is 
exactly A x B: 


A x B = 

i 

j 

k 

= i 

a 2 



fli 03 

+ k 


a 2 


a 2 

Cj 

k 

k 

- j 

k k 

b 1 

k 


k 

b 2 











This time we highlighted the j component with its minus sign. There is no great 
mathematics in formula (14) — it is probably illegal to mix i, j, k with six numbers but 
it works. This is the good way to remember and compute A x B. In the example 
(j - i) x (k - i) from equation ( 8 ), those two vectors go into the last two rows: 

i 1 k 

^1 1 0 

zl 0 1 

The k component is highlighted, to see a 3 b 2 - a 2 b i again. Note the change from 
equation (11), which had 0, 1, - 1 in the top row. That triple product was a number 
(zero). This cross product is a vector i + j + k. 


1 0 


-1 0 


1 


”i 


+ k 


0 1 

-1 1 


zl 0 
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Review question 1 With the i, j, k row changed to 3, 4, 5, what is the determinant? 
Answer 3-lH-4-lH-5*l = 12. That triple product is the volume of a box. 

Review question 2 When is A x B = 0 and when is A • (B x C) = 0? Zero vector, 
zero number. 

Answer When A and B are on the same line. When A, B, C are in the same plane. 

Review question 3 Does the parallelogram area |A x B| equal a 2 by 2 determinant? 
Answer If A and B lie in the xy plane, yes . Generally no . 

Review question 4 What are the vector triple products (AxB)xC and 
Ax(BxC)? 

Answer Not computed yet. These are two new vectors in Problem 47. 


Review question 5 Find the plane through the origin containing A = i + j + 2k and 
B = i + k. Find the cross product of those same vectors A and B, 

Answer The position vector P = xi + yj + zk is perpendicular toN = AxB: 


P*(AxB) = 


X 

y 

z 


» j 

k 

1 

l 

2 

= x + _y — z = 0. A x B = 

1 1 

2 

1 

0 

1 


1 0 

1 


H.3 EXERCISES 


Read-through questions 

The cross product A x B is a Q whose length is b . 
Its direction is c to A and B. That length ts the area of 
a d . whose base is |A| and whose height is When 

A = ^1 + a 2 \ and B = b x i + b 2 j, the area is f . This equals 
a 2 by 2 a . In general |A * B[ 2 + |A x B| 2 — h . 

The rules for cross products are Ax A= I and 
A x B = — f J 1 and Ax(BlC) = AxB+ fc . In 
particular AxB needs the * -hand rule to decide its 
direction. If the lingers curl from A towards B (not more than 
180°), then m points n . By this rule i x j = o 

and i X It = P and j x k = q . 

The vectors aj + + a 3 k and bj + b 2 j + b 2 k have cross 

product t [ + » | + t k. The vectors A = 

i + j + k and B = i + j have AxB= u . {This is also the 
3 by 3 determinant v .1 Perpendicular to the plane con- 
taining (0, 0, 0), (1, 1, 1), (1, 1, 0) is the normal vector N = 
w . The area of the triangle with those three vertices is 
* . which is half the area of the parallelogram with fourth 

vertex at y . 

Vectors A, B, C from the origin determine a Its vol- 
ume |A '(_a_)| comes from a 3 by 3 There are six 

terms, C with a plus sign and d with minus. In every 
term each row and C is represented once. The rows 
(1, 0, 0), (0, 0, 1), and (0, 1, 0) have determinant — f . That 
box is a s . but its sides form a H -handed triple in 
the order given. 


If A, B, C lie in the same plane then A • (B x Q is I . 

For A = xl + yj + zY the first row contains the letters J . 

So the plane containing B and C has the equation K = 
0. When B = I + j and C ^ k that equation is L . B x C is 
M . 

A 3 by 3 determinant splits into n 2 by 2 determinants. 
They come from rows 2 and 3, and are multiplied by the 
entries in row 1. With i, j, k in row 1, this determinant equals 
the o product. Its j component is P , including the 
Q sign which is easy to forget. 


Compute the cross products 1-8 from formula (6) or the deter- 
minant (14). Do one example both ways. 

1 (ixj)xlt 2 (i x j) x i 

3 (2i + 3j) x (i + k) 4 (2i + 3j + k) x (2i + 3j - k) 

5 (2i + 3j + k) x (i - j - k) 6 (i + j — k) x (i - \ + k) 

7 (i + 2j + 3k) x (4i — 9j) 

8 (i cos 0 -hj sin 0) x (i sin 0 — j cos 0) 

9 When are |A x B| = |A| |B| and |A*(B x C)| = |A| |B| |C|? 

10 True or false: 

(a) A x B never equals A * B. 

(b) If A x B = 0 and A * B = 0, then either A = 0 or B = 0. 

(c) If A x B — A x C and A ^ 0, then B = C. 
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In 1 1—16 find |A x B| by equation (1) and then by computing 
A x B and its length. 

11 A = i+j + k*B = i 12 A = i+j,B = i-j 

13 A= -B 14 A = i+j s B = j + k 

15 A = a } \ + a^j, B = b A \ + b 2 j 

16 A = {a ly a 2 , fl 3 ), B = (b l , b 2 , b 3 ) 


In Problem 16 (the general case), equation (1) proves that the 
length from equation (6) is correct. 

17 True or false, by testing on A = i, B = j, C = k: 

(a) A x (A x B) = 0 (b)A-(BxC) = (A x B)C 

(c) A • (B x C) = C - (B x A) 

(d) (A - B) x (A + B) = 2(A x B). 

18 (a) From A x B = — (B x A) deduce that Ax A = 0. 

(b) Split (A + B) x (A + B) into four terms, to deduce that 
(A x B) = — (B x A). 


What are the normal vectors to the planes 19-22? 

19 (2, 1, ())•(*, y, z) = 4 20 3x + 4z = 5 



x y 

z 



x y z 

21 

1 1 

0 

= 2 

22 

1 1 1 


0 1 

1 



[ 1 2 


Find N and the equation of the plane described in 23-29. 

23 Contains the points (2, 1, 1), (1, 2, 1), (l, 1, 2) 

24 Contains the points (0, 1, 2), (l s 2, 3), (2, 3, 4) 

25 Through (0, 0, 0), (1, 1, 1), fab, c) [What ifa = b = c?] 

26 Parallel to i + j and k 

27 N makes a 4 5°' angle with i and j 

28 N makes a 60 c angle with i and j 

29 N makes a 90 ' angle with i and j 

30 The triangle with sides i and j is as large as the 

parallelogram with those sides. The tetrahedron with edges 

i, j, k is _ as large as the box with those edges. Extra 

credit: In four dimensions the "simplex” with edges i, j, k, I 
has volume = . 

31 If the points (x, y, z), (1, 1, 0), and (1, 2, 1) lie on a plane 
through the origin, what determinant is zero? What equation 
does this give for the plane? 

32 Give an example of a right-hand triple and left-hand triple. 
Use vectors other than just i, j, k. 


33 When B = 3i + j is rotated 90° clockwise in the xy plane 

it becomes B* = __ * When rotated 90° counterclock- 
wise it is , When rotated 180 J it Is 

34 From formula (6) verify that B • (A x B) = 0, 

35 Compute 


1 2 3 


2 1 0 


1 0 2 

2 3 4 


1 2 1 

> 

0 3 0 

3 4 6 


0 1 2 


2 0 1 


36 Which of the following are equal to A x B? 

(AtB)xB, (— B) x (— A), |A||Bl|sin0[, (A + C)x(B-C), 
ftA - B) x (A + B). 

37 Compare the six terms on both sides to prove that 


fli 

b , 

Cl 






bi 

c 2 

= 

bi 

b 2 

h 

^3 

bs 

c 3 


Cl 

Cl 

C J 


The matrix is "transposed " — same determinant. 

38 Compare the six terms to prove that 
Ui a 2 a ^ 
b i b 2 b$ 

Cy c 2 c 3 

This is an “expansion on row 2.” Note minus signs. 

39 Choose the signs and 2 by 2 determinants in 
fli <*2 
hi b 2 hs 

ci 

40 Show that (AxB) + (BxC) + (Cx A) is perpendicular to 
B — A and C — B and A — C. 


= + f : 


«2 

b 2 b 2 


±c 2 


± C 3 


a 2 a^ 

+ ^2 


\-bi 

O i o 2 

Ci c$ 


c i C* 


C, c 2 


Problems 41-44 compute the areas of triangles. 

41 The triangle PQR in Example 7 has squared area 
{Jlilf = <*) 2 + (§) 2 + (i) 2 , from the 3D version of Pythagoras 
in Remark 2. Find the area of PQR when P = (a, 0, 0), Q = 
(0, b, 0), and R = (0, 0, c). Check with A|A x B|. 

42 A triangle in the xy plane has corners at (a, , b, ), ( a 2i b 2 ) 
and (u 3s b 3 ). Its area A is half the area of a parallelogram. 
Find two sides of the parallelogram and explain why 

A = il(ai- Oi)(i>3 - a, )(b 2 - &i }|. 

43 By Problem 42 find the area A of the triangle with corners 
(2, 1) and (4, 2) and (1, 2). Where is a fourth corner to make a 
parallelogram? 
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44 Lifting the triangle of Problem 42 up to the plane z = 1 
gives comers (a x , b u 1 \{a z ,b 2 , b^ 1). The area of the 
triangle times £ is the volume of the upside-down pyramid 
from (0, 0, 0) to these comers. This pyramid volume is £ the 
box volume, so £ (area of triangle) = £ (volume of box): 


area of triangle = - 


ui b x 
@2 ^2 
*3 *>3 


1 

1 

1 


Find the area A in Problem 43 from this determinant. 


45(1) The projections of A = a x \ + a 2 j + a 3 k and B = 
M + b 2 \ + h 3 k onto the xy plane are . 

(2) The parallelogram with sides A and B projects to a 

parallelogram with area * 

(3) General fact: The projection onto the plane normal to 
the unit vector d has area (A x B)* n* Verify for n = k. 

46 (a) For A = i + j - 4k and B = - i + j, compute (AxB)-i 
and (A x B) • j and (A xB)*k. By Problem 45 those are 
the areas of projections onto the yz and xz and xy planes, 
(b) Square and add those areas to find |A x B| 2 . This is 
the Pythagoras formula in space (Remark 2). 


47 (a) The triple cross product (A x B) x C is in the plane of 
A and because it is perpendicular to the cross product 


(b) Compute (A x B) x C when A = a{i + a 2 j + d 3 k, B = 
fiii + M + W^C-L 

(c) Compute (A • C)B — (B • C)A when C = i* The answers 
in (b) and (c) should agree. This is also true if C = j or C = 
k or C = c,i + c 3 j + c 3 k, That proves the tricky formula 

(AxB)xC = (A'C)B-(B'QA (+) 

48 Take the dot product of equation (*) with D to prove 

(A x BMC x D) = (A * C)(B *D) — (B * C)(A - D). 

49 The plane containing P = (0, 1, 1) and Q = (I, 0, 1) and 
R = (1,1,0) is perpendicular to the cross product N = 

* Find the equation of the plane and the area of 

triangle PQR. 

50 Let P- (1,0, -i\Q = (hU 1), * = (2,2, 1). Choose 5 so 
that PQRS is a parallelogram and compute its area* Choose 
T, U , V so that OPQRSTUV is a box (parallelepiped) and 
compute its volume. 


11.4 Matrices and Linear Equations 


We are moving from geometry to algebra* Eventually we get back to calculus, where 
functions are nonlinear — but linear equations come first. In Chapter 1, y = mx + b 
produced a line. Two equations produce two lines. If they cross, the intersection point 
solves both equations — and we want to find it. 

Three equations in three variables x, y, z produce three planes. Again they go 
through one point (usua//y). Again the problem is to find that intersection point 
— which solves the three equations. 

The ultimate problem is to solve n equations in n unknowns* There are n hyper- 
planes in /i-dimensional space, which meet at the solution* We need a test to be sure 
they meet* We also want the solution* These are the objectives of linear algebra , which 
joins with calculus at the center of pure and applied mathematics*! 

Like every subject, linear algebra requires a good notation* To state the equations 
and solve them, we introduce a “ matrix ” The problem will be Au - d. The solution 
will be ii = A l i. It remains to understand where the equations come from, where 
the answer comes from, and what the matrices A and A ~ 1 stand for* 

TWO EQUATIONS IN TWO UNKNOWNS 

Linear algebra has no reason to choose one variable as special* The equation y - y 0 = 
m(x — x 0 ) separates y from x. A better equation for a line is ax + by = d. (A vertical 


t Linear algebra dominates some applications while calculus governs others* Both are essential. 
A fuller treatment is presented in the author’s book Linear Algebra and Its Applications 
(Harcourt Brace Jovanovich, 3rd edition 1988), and in many other texts. 
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line like x = 5 appears when b = 0. The first form did not allow slope m = oo.) This 
section studies two lines: 

a ^ + b l y = d 1 (1) 

a 2 x + b 2 y^d 2 . 

By solving both equations at once, we are asking (jc, y) to lie on both lines. The 
practical question is: Where do the lines cross? The mathematician’s question is: Does 
a solution exist and is it unique? 

To understand everything is not possible. There are parts oflife where you never 
know what is going on {until too late). But two equations in two unknowns can have 
no mysteries. There are three ways to write the system — by rows, by columns, and 
by matrices. Please look at all three, since setting up a problem is generally harder 
and more important than solving it. After that comes the concession to the real world: 
we compute x and y. 

EXAMPLE 1 How do you invest $5000 to earn S400 a year interest, if a money market 
account pays 5% and a deposit account pays 10%? 

Set up equations by rows : With x dollars at 5% the interest is .05*. With y dollars at 
10% the interest is .10y. One row for principal, another row for interest: 

* + v = 5000 

( 2 ) 

.05_x+.l0y = 400. 

Same equations by columns: The left side of (2) contains x times one vector plus y 
times another vector. The right side is a third vector. The equation by columns is 


1 

+ >' 

1 



5000 

.05 

.10 


400 


Same equations by matrices: Look again at the left side. There are two unknowns x 
and y, which go into a vector u. They are multiplied by the four numbers 1, .05, 1, 
and .10, which go into a two by two matrix A, The left side becomes a matrix times 
a vector: 



X 


5000] 

1 

O 

o 
li 



1 

o 

o 

j 


Now you see where the “rows’" and “columns" came from. They are the rows and 
columns of a matrix. The rows entered the separate equations (2). The columns 
entered the vector equation (3). The matrix-vector multiplication Au is defined so 
that all these equations are the same: 


Au by rows: 


Au by columns: 


J 

i 

i," 

i 

i 

i 

& 

b 2 

i 

i 


i 

a 

zx- 

1 

1 

* 

1 

J*i b 2 J 

1 

1 


a 1 x + b x y 


a 2 x + b 2 y 





+ y 


b i 

b 2 


(each row is 
a dot product) 

(combination of 
column vectors) 


A is the coefficient matrix. The unknown vector is u. The known vector on the right 
side, with components 5000 and 400, is d. The matrix equation is Au = d. 
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Fig. 11.16 Each row of Au = d gives a line. Each column gives a vector. 


This notation Au = d continues to apply when there are more equations and more 
unknowns. The matrix A has a row for each equation (usually m rows). It has a column 
for each unknown (usually n columns). For 2 equations in 3 unknowns it is a 2 by 3 
matrix (therefore rectangular). For 6 equations in 6 unknowns the matrix is 6 by 6 
(therefore square). The best way to get familiar with matrices is to work with them. 
Note also the pronunciation: “matri sees” and never “matrixes/' 

Answer to the practical question The solution is x = 2000, y=3000. That is the 
intersection point in the row picture (Figure 1 1.16). It is also the correct combination 
in the column picture. The matrix equation checks both at once, because matrices 
are multiplied by rows or by columns. The product either way is d: 


1 

1 " 

~2000~ 

2000 + 3000 


[5000] 

05 

.10 

3000 

(.05)2000 + {.10)3000 


1 

O 

O 

TT 


Singular case In the row picture, the lines cross at the solution. But there is a case 
that gives trouble. When the lines are parallel , they never cross and there is no solution. 
When the lines are the same, there is an infinity of solutions: 

2x + y = 0 2x + y = 0 

parallel lines same line (5) 

2x + y — 1 Ax + 2y = 0 

This trouble also appears in the column picture. The columns are vectors a and b. 
The equation Au = d is the same as xa + yb = d. We are asked to find the combination 
of a and b (with coefficients x and y) that produces d. In the singular case a and b lie 
along the same line (Figure 11.17). No combination can produce d, unless it happens 
to lie on this line. 




Fig. 11.17 Row and column pictures: singular (no solution) and nonsingular (x = y = 1). 
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The investment problem is notfsingular, and 2000 a + 3000 b equals d. We also drew 
Example 2: The matrix A multiplies u - (1, 1) to solve x + 2 y = 3 and x — y = 0: 



1 

2 

i 


4 +2 


“3" 


T 


2 


‘3 

•1u — 

1 

-1 



1-1 


0 

By columns 

l 

+ 

-1 


0 


The crossing point is (1, 1) in the row picture. The solution is x—l t y—l in the 
column picture (Figure 11.17b). Then 1 times a plus 1 times b equals the right side d. 

SOLUTION BY DETERMINANTS 


Up to now we just wrote down the answer. The real problem is to find x and y w hen 
they are unknown. We solve two equations with letters not numbers: 

a l x + b l y = d l 


a 2 x + b 2 y = d 2 . 

The key is to eliminate x . Multiply the first equation by a 2 and the second equation 
by a x . Subtract the first from the second and the x's disappear: 

(aib 2 - a 2 b i )y = (a i d 1 -a 2 d l ). (6) 

To eliminate >\ subtract 6, limes the second equation from b 2 times the first: 


(Mi -b x a 2 )x = (b 2 d , -b x d 2 ). 


(7) 


What you see in those parentheses are 2 by 2 determinants! Remember from 
Section 1 1.3: 


The determinant of 


a x 

a 2 


h x 

h 


is the number 


r 

*2 


b l 


= a i b 2 -a 2 h l . 


This number appears on the left side of (6) and (7). The right side of (7) is also a 
determinant — but it has d * s in place of as. The right side of (6) has *fs in place of 
b's. So x and y are ratios of determinants, given by Cramer's Rule: 


11H Cramer *s Rule 


The solution is x = 


d, 

*>1 


a \ 

d l 


d 2 

b 2 


a 2 

d 2 



b i 

t y 




“2 

*>2 


a 2 

h 



The investment example is solved by three determinants from the three columns: 


1 1 


5000 1 


1 5000 


= .05 


= 100 


.05 .10 


400 .10 


.05 400 


Cramer's Rule has x- 1 00/.05 = 2000 and y~ 15Q/.05 = 3000. This is the solution. 
The singular case is when the determinant of A is zero — and we can't divide by it. 


Ilf Cramer's Rule breaks down when del A — 0— which is the singular case. 
Then the lines in the row r picture are parallel, and one column is a multiple of 
the other column. 
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EXAMPLE 3 The lines 2x + y = 0, 2x + y = 1 are parallel. The determinant is zero: 


2 f 



0" 

has det A = 

2 1 

2 1 

J _ 


l 


2 1 


The iines in Figure 11.17a don’t meet. Notice the columns: [f] is a multiple of [[]. 

One final comment on 2 by 2 systems. They arc small enough so that all solution 
methods apply. Cramer’s Rule uses determinants . Larger systems use elimination 
(3 by 3 matrices are on the borderline), A third solution {the same solution!) comes 
from the inverse matrix A~\ to be described next. But the inverse is more a symbol 
for the answer than a new way of computing it, because to find A~ l we still use 
determinants or elimination. 


THE INVERSE OF A MATRIX 


The symbol A~ l is pronounced “A inverse” It stands for a matrix — the one that 
solves Au = d. I think of A as a matrix that takes u to d. Then A~ 1 is a matrix that 
takes d back to u. If An = d then u = A ~ l d (provided the inverse exists). This is exactly 
like functions and inverse functions: g(x) = y and x = g~ *(y). Our goal is to find A " 1 
when we know A. 

The first approach will be very direct. Cramer’s Rule gave formulas for x and y, 
the components of u. From that rule we can read off /l -1 , assuming that D — 
a l b 2 ~ a 2 b l is not zero. D is det A and we divide by it: 


Cramer, u = — 

l 

o- 

1 

tj 

1 

This is A 1 d = — 

D 

L“Mi +a t d 2 J 

D 


b 2 

-a 2 



( 8 ) 


The matrix on the right (including I/D in all four entries) is A~ l . Notice the sign 
pattern and the subscript pattern. The inverse exists if D is not zero — this is impor- 
tant. Then the solution comes from a matrix-vector multiplication, A~ : times d. We 
repeat the rules for that multiplication: 


DEFINITION A matrix M times a vector v equals a vector of dot products: 

(row l)‘v 


My = 


row I 
row 2 


(row 2) * v 


( 9 ) 


Equation (8) follows this rule with M = A 1 and v = d. Look at Example 1: 



1 1 " 

1 

.10 -l" 


2 -20 

A = 


, det A = .05, A = — 


— 



o 

L/l 

o 

.05 

-.05 1 


o 1 

1 

1 


There stands the inverse matrix. It multiplies d to give the solution u; 


2 - 20~| I" 5C 

tool 

(2) (5000) - (20) (400)” 


2000" 

- 1 2oJ ^ 

100 

(-1)(5000) + {20)(400) 


3000 


The formulas work perfectly, but you have to see a direct way to reach A " *d. Multiply 
both sides of An = & by A~ l . The multiplication “cancels” A on the left side, and 
leaves u = A ~ 1 d. This approach comes next. 
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MATRIX MULTIPLICATION 


To understand the power of matrices, we must multiply them. The product of A' 1 
with An is a matrix times a vector. But that multiplication can be done another way. 
First A ” 1 multiplies A y a matrix times a matrix. The product A ~ l A is another matrix 
(a very special matrix). Then this new matrix multiplies u. 

The matrix-matrix rule comes directly from the matrix-vector rule. Effectively, a 
vector v is a matrix V with only one column. When there are more columns, M times 
V splits into separate matrix-vector multiplications, side by side: 


DEFINITION A matrix M times a matrix V equals a matrix of dot products: 


MV = 


row 1 
row 2 




EXAMPLE 4 


"1 2] ["5 

6" 


1 -5 + 2-7 

t-6 + 2-8 


_3 4JL7 

8_ 


3 ■ 5 + 4- 7 

3 ■ 6 + 4 ■ 8 



(row 1) * Vj (row I) 

(row 2) * Vj (row r 2) 

19 22 
43 50 


EXAMPLE 5 Multiplying A 1 times A produces the “identity matrix' 


A~ l A = ■ 


;] 

[: 


b 2 

-*r 



(iibz Ojb] 

0 

_^ a 2 

a i_ 

r«i b, 


0 

— a 2 b i + dib 2 

D 

1 — 

tv) 


D 


l o 
o 1 


(10) 


( 11 ) 


This identity matrix is denoted by /. It has l’s on the diagonal and 0’s off the diagonal. 
It acts like the number 1 . Every vector satisfies l u = u. 


11 J (Inverse matrix and identity matrix) A A ~ 1 = I and A i A = I and /ti = u: 

:] --t: I [: !;]-[;] » 

Note the placement of a , b, c $ d. With these letters D is ad — be. 


The next section moves to three equations. The algebra gets more complicated (and 
4 by 4 is worse). It is not easy to write out A -1 . So we stay longer with the 2 by 2 
formulas, where each step can be checked. Multiplying Au = d by the inverse matrix 
gives A~ t Au = A~ l i ~ and the left side is / u = u. 



Fig. 11.18 Rotate v forward into Av. Rotate d backward into A l d + 
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EXAMPLE 6 




cos 0 
sin 0 


-sin 9 
cos 9 


rotates every v to Ay, through the angle 0. 


Question 1 
Question 2 
Question 3 


Where is the vector y = 
What is 


V 

0 


rotated to? 


Which vector u is rotated into d = 




•? 


1 


Solution I 


Solution 2 


v rotates into A\ = 


det A = 1 so A 


cos 9 

-sin 9 

r 


cos 6 

sin 9 

cos 9 

0 


sin 9 

f cos 9 

sin 9 

1 




— sin 0 cos 9 


= rotation through —9. 


Solution 3 


If Au = d then u = A 


cos 9 

sin 9 

- o 


sin 9 

— sin 0 

cos 0 

1 

1 

1 1 

cos 9 


Historical note I was amazed to learn that it was Leibniz (again!) who proposed the 
notation we use for matrices. The entry in row i and column j is The identity 
matrix has = a l2 = 1 and a 12 — o 2i = 0. This is in a linear algebra book by Charles 
Dodgson — better known to the world as Lewis Carroll, the author of Alice in 
Wonderland . I regret to say that he preferred his own notation instead of a u . 
4 T have turned the symbol toward the left, to avoid all chance of confusion with 
It drove his typesetter mad. 


PROJECTION ONTO A PLANE = LEAST SQUARES FITTING BY A LINE 


We dose with a genuine application. It starts with three-dimensional vectors a, b, d 
and leads to a 2 by 2 system. One good feature: a, b, d can be n-dimensionai with no 
change in the algebra. In practice that happens. Second good feature: There is a 
calculus problem in the background. The example is to fit points by a straight line , 

There are three ways to state the problem, and they look different: 

1. Solve xa + yb = d as well as possible (three equations, two unknowns x and y). 

2 . Project the vector d onto the plane of the vectors a and b. 

3. Find the closest straight line (“least squares' 1 ) to three given points. 

Figure 11.19 shows a three-dimensional vector d above the plane of a and b. Its 
projection onto the plane is p = xa + yb. The numbers x and y are unknown, and 
our goal is to find them. The calculation will use the dot product, which is always 
the key to right angles. 

The difference d - p is the “error” There has to be an error, because no combination 
of a and b can produce d exactly. (Otherwise d is in the plane.) The projection p is 
the closest point to d, and it is governed by one fundamental law: The error is 
perpendicular to the plane. That makes the error perpendicular to both vectors a 
and b; 


a ■ (xa + yb - d) = 0 and b * (xa + yb - d) = 0. 


(13) 
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Rewrite those as two equations for the two unknown numbers x and y\ 

(a ~ft)x + (a **b)y = a * d 
(b ~a)x + (b -b )y = b ~d_ 

These are the famous normal equations in statistics, to compute x and y and p. 


EXAMPLE 7 For a = (1, 1, 1) and b = (1, 2, 3) and d = (0, 5, 4), solve equation (14): 

3x + 6y= 9 x= — 1 

gives so p = — a + 2b = (1, 3, 5) = projection. 

6x+ 14y = 22 y- 2 

Notice the three equations that we are not solving (we can’t): xa + yb = d is 


x+ y - 0 1 1 

x+2y = 5 with the 3 by 2 matrix >4=12. 


x + 3y = 4 


3 


( 15 ) 


For d = (0, 5, 4) there is no solution; d is not in the plane of a and b, For p = (1, 3, 5) 
there is a solution, x = - 1 and y = 2. The vector p is in the plane. The error d - p 
is (-1, 2, —1). This error is perpendicular to the columns (1, 1, 1) and {1, 2, 3), so it is 
perpendicular to their plane. 


SAME EXAMPLE (written as a line-fitting problem) Fit the points (1, 0) and (2, 5) and 
(3, 4) as closely as possible (“least squares”) by a straight line. 

Two points determine a line. The example asks the line /= x + yt to go through three 
points. That gives the three equations in (15), which can’t be solved with two un- 
knowns. We have to settle for the closest line, drawn in Figure 1 1 .19b, This line is 
computed again below, by calculus. 

Notice that the closest line has heights 1, 3, 5 where the data points have heights 
0, 5, 4, Those are the numbers in p and d! The heights 1, 3, 5 fit onto a line; the heights 
0, 5, 4 do not. In the first figure, p = (1, 3, 5) is in the plane and d = (0, 5, 4) is not. 
Vectors in the plane lead to heights that lie on a line. 

Notice another coincidence. The coefficients x = — 1 and y = 2 give the projection 
- a + 2b. They also give the closest line /= - 1 + 2r. All numbers appear in both 
figures. 



Fig, 11.19 Projection onto plane is (1, 3, 5) with coefficients —1, 2. Closest line has heights 
1, 3, 5 with coefficients —1, 2. Error in both pictures is —1, 2, —1. 
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Remark Finding the closest line is a calculus problem: Minimize a sum of squares. 
The numbers x and y that minimize E give the least squares solution: 

E(x> y) — (x + y - 0) 2 + (x + 2 y - 5) 2 + (x + 3y - 4) 2 . (16) 

Those are the three errors in equation (15), squared and added. They are also the 
three errors in the straight line fit, between the line and the data points. The projection 
minimizes the error (by geometry), the normal equations (14) minimize the error (by 
algebra), and now calculus minimizes the error by setting the derivatives of £ to zero. 

The new feature is this: E depends on two variables x and y. Therefore E has two 
derivatives. They both have to be zero at the minimum. That gives two equations for 
x and y : 

x derivative of E is zero: 2(x + y) + 2(x + 2 y — 5) + 2(x + 3y — 4) =0 

y derivative of E is zero: 2(x T y) + 2(x + 2 y — 5)(2) + 2(x T 3 y - 4)(3) = 0. 

When we divide by 2, those are the normal equations 3x + 6y = 9 and 6x T 14y = 
22. The minimizing x and y from calculus are the same numbers —1 and 2. 

The x derivative treats y as a constant. The y derivative treats x as a constant. 
These are partial derivatives. This calculus approach to least squares is in Chapter 13, 
as an important application of partial derivatives. 

We now summarize the /east squares problem — to find the closest line to n data 
points. In practice n may be 1000 instead of 3. The points have horizontal coordinates 
b lt b 2t b„* The vertical coordinates are d n . These vectors b and d, 

together with a = (I, 1, ..., 1), determine a projection— the combination p — xa + yb 
that is closest to d. This problem is the same in n dimensions — the error d — p is 
perpendicular to a and b. That is still tested by dot products, p ■ a = d * a and p ■ b = 
d * b, which give the normal equations for x and y: 

(a ■ a)x + (a ■ b )y = a * d (n) x + (2 ^ )y = 2<f* 

or * (17) 

(b * a)x + (b * b)y — b ■ d ( 26* )x + (Lbf)y= 26^. 


1 1K The least squares problem projects d onto the plane of a and b. The 
projection is p = xa + yb, in n dimensions. The closest line is/ = x + yf, in two 
dimensions. The normal equations (17) give the best x and y. 


11.4 EXERCISES 


Read-through questions 

The equations 3.x + >■ = 8 and x + y = 6 combine into the vec- 
tor equation .x a + v b = c - d. The left side is 
Au i, with coefficient matrix A = d and unknown vector 
a- e . The determinant of A is t so this problem 
is not g The row picture shows two intersecting h « 
The column picture shows xa + yb = d. where a = l and 
b = i The inverse matrix is A~ 1 = k . The solution 
is u = ^ _1 d = i . 


A matrix-vector multiplication produces a vector of dot 
m from the rows, and also a combination of the n : 





'3 f| 



- 

“ 

1 

1 1_ 

5_ 




If the entries are a , b , c\ d , the determinant is D = o . A ~ 1 
is r P 1 divided by D< Cramer's Rule shows components 
of u = A “ L d as ratios of determinants: x = q ID and y — 
_>_ //)■ 
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A matrix-matrix multiplication MV yields a matrix of dot 
products, from the rows of * and the columns of t ; 


M" *R :] 

p nr 1/2 -i/2] r. 
U lJL-l/2 3/2 J ^ L 



The last line contains the u matrix, denoted by L It has 
the property that I A = A I = v for every matrix A, and 
/u — w for every vector u. The inverse matrix satisfies 
A~ l A — x Then Au = d is solved by multiplying both 
sides by v to give u = * There is no inverse matrix 

when A , 


The combination xa + yb is the projection of d when the 
error B is perpendicular to C and D If a = 
(1, 1, 1), b = (1, 2, 3), and d =(0, 8 , 4), the equations for x and 
y are E . Solving them also gives the closest F to the 
data points ( 1 , 0), c , and (3, 4), The solution is x = 0, y = 
2, which means the best line is H . The projection is 
0a + 2b = i The three error components are J 
Check perpendicularity: K — 0 and l- = 0. Applying 
calculus to this problem, x and y minimize the sum of squares 
E = m 


In 1~8 find the point (x, y) where the two lines intersect (if they 
do). Also show how the right side is a combination of the 
columns on the left side (if it is). Also find the determinant D. 


1 x+y = 7 
x — y = 3 

3 3x — y = 8 
x — 3y = G 

5 2x — 4y = 0 
x — 2y = 0 

7 ax + by = 0 
lax + 2 by ~ 2 


2 2 x + y= 11 
x + y= 6 

4 x + 2y = 3 
2x + 4y = 7 

6 lOx + y = 1 
x + y = 1 

8 ax + by = 1 
cx + dy = 1 


9 Solve Problem 3 by Cramer’s Rule, 


10 Try to solve Problem 4 by Cramer’s Rule, 

11 What are the ratios for Cramer’s Rule in Problem 5? 


12 If A = I show how Cramer’s Rule solves Au = d. 


13 Draw the row picture and column picture for Problem 1. 

14 Draw the row and column pictures for Problem 6 * 

15 Find A -1 in Problem 1. 

16 Find A ' 1 in Problem 8 if ad — be — 1. 

17 A 2 by 2 system is singular when the two lines in the row 

picture . This system is still solvable if one equation 

is a of the other equation. In that case the two lines 

are ____ and the number of solutions is 


18 Try Cramer’s Rule when there is no solution or infinitely 
many: 

3x + y = 0 3x+ y = 1 

or 

6 x + 2 y = 2 6 x + 2 y = 2 , 

19 Au = d is singular when the columns of A are . 

A solution exists if the right side d is ♦ In this solvable 

case the number of solutions is . 

20 The equations x — y = di and 9x — 9y = d 2 can be solved 
if 

21 Suppose x = J billion people live in the U.S. and y = 5 
billion live outside. If 4 per cent of those inside move out and 
2 per cent of those outside move in, find the populations d t 
inside and d 2 outside after the move. Express this as a matrix 
multiplication Au = d (and find the matrix). 

22 In Problem 21 what is special about a i + a 2 and b\ + b 2 
(the sums down the columns of A)? Explain why d ^ + d 2 equ- 
als x + y. 

23 With the same percentages moving, suppose d i — 0.58 bil- 
lion are inside and d 2 = 4.92 billion are outside at the end. Set 
up and solve two equations for the original populations x 
and y, 

24 What is the determinant of A in Problems 21-23? What 
is A “ *? Check that A " l A = J. 


25 The equations ax + y = 0, x + ay = 0 have the solution 
x = y = 0, For which two values of a are there other solutions 
(and what are the other solutions)? 

26 The equations ax + by = 0, ex + dy — 0 have the solution 

x = y = 0, There are other solutions if the two lines are 
, This happens if a , b t c t d satisfy . 

27 Find the determinant and inverse of A = [3 $]. Do the 

same for 2A, A"\ — A, and /♦ 

28 Show that the determinant of A ' 1 is 1/det A: 

T dj{ad-bc) — hi (ad — be) 

A -1 = 

I — cl(ad - be) aj{ad — be) 


29 Compute AB and BA and also BC and CB\ 
4 


A = 


2 -1 


B 


3 n r 1 1 
= c = 

11 02 


Verify the associative law : AB times C equals A times BC ♦ 

30 (a) Find the determinants of A, B t AB t and BA above, 
(b) Propose a law for the determinant of BC and test it. 

b 


31 For A = 


and B 


f . 

= wn 

j fc J 


write out AB and 


factor its determinant into [ad — bc)(eh — fg). Therefore 
det(AB) = (det A)(det B). 
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32 Usually det(4 + iJ) does not equal deM + detB, Find 
examples of inequality and equality. 

33 Find the inverses, and check A ~ = / and BB ~ 1 = J, for 


4" 

and 

B = 

"2 2" 

2 



0 1 


34 In Problem 33 compute AB and the inverse of AB. Check 
that this inverse equals fi -1 times A ~ l . 

35 The matrix product ABB~ 1 A~ 1 equals the mat- 

rix. Therefore the inverse of AB is . Important ; The 

associative law in Problem 29 allows you to multiply BB~ l 
first* 

36 The matrix multiplication C~ 1 B~ 1 A~ l ABC yields the 

matrix* Therefore the inverse of ABC is . 


39 Plot the three data points (-1, 2), (0, 6), (1, 4) in a plane. 
Draw the straight line x + yt with the same x and y as in 
Problem 38. Locate the three errors up or down from the data 
points and compare with Problem 38. 


40 Solve equation (14) to find the combination xa + yb of 
a = (1, 1 , 1) and b = (-l, 1, 2) that is closest to d = (1, 1, 3)* 
Draw the corresponding straight line for the data points 
(-1, 1),(1, 1), and (2, 3). What is the vector of three errors and 
what is it perpendicular to? 

41 Under what condition on d lt d 2 ^d 3 do the three points 
(0, ^), (1, d 2 ), (2, d$) lie on a line? 

42 Find the matrices that reverse x and y and project: 


M 


x 

y. 


r ■ 

y 

X 


and 



x 

0 


37 The equations x + 2y + 3z and 4x + 5y + cz = 0 always 
have a nonzero solution* The vector u = (x, y t z) is required 
to be to v = (1, 2, 3) and w = (4, 5, c), So choose u = 


38 Find the combination p = xa + yb of the vectors a = 
(1, 1, 1) and b = {— 1, 0, 1) that comes closest to d = (2, 6, 4). 

(a) Solve the normal equations (14) for x and y. (b) Check that 
the error d — p is perpendicular to a and b. 


43 Multiplying by P- 


.5 .5 
.5 ,5 


projects u onto the 45° line. 


(a) Find the projection Pu of u = [£]. 

(b) Why does P times P equal P? 

(c) Docs P ~ 1 exist? What vectors give Pu = 0? 


44 Suppose ii is not the zero vector but 4u = 0, Then A' 1 
can’t exist: Tt would multiply and produce u. 
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This section moves from two to three dimensions. There are three unknowns x, y s z 
and also three equations. This is at the crossover point between formulas and 
algorithms — it is real linear algebra. The formulas give a direct solution using det- 
erminants. The algorithms use elimination and the numbers x, y, z appear at the 
end. In practice that end result comes quickly. Computers solve linear equations by 
elimination. 

The situation for a nonlinear equation is similar. Quadratic equations 
ax 2 + bx + c ~ 0 are solved by a formula. Cubic equations are solved by Newton’s 
method (even though a formula exists). For equations involving x 5 or x 10 , algorithms 
take over completely. 

Since we are at the crossover point, we look both ways. This section has a lot to 
do, in mixing geometry, determinants, and 3 by 3 matrices: 

1. The row picture: three planes intersect at the solution 

2. The column picture: a vector equation combines the columns 

3. The formulas: determinants and Cramer’s Rule 

4. Matrix multiplication and A -1 

5* The algorithm: Gaussian elimination. 

Part of our goal is three-dimensional calculus* Another part is n-dimensional algebra. 
And a third possibility is that you may not take mathematics next year. If that 
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happens, I hope you will use mathematics. Linear equations are so basic and impor- 
tant, in such a variety of applications, that the effort in this section is worth making. 

An example is needed. It is convenient and realistic if the matrix contains zeros. 
Most equations in practice are fairly simple— a thousand equations each with 990 
zeros would be very reasonable. Here are three equations in three unknowns: 


x + y — 1 

x + 2 z = 0 ( 1 ) 

— 2y + 2z = — 4. 


In matrix-vector form, the unknown u has components x, y, z. The right sides 1,0, -4 
go into d. The nine coefficients, including three zeros, enter the matrix A: 


1 1 o' 


X 


r 

1 0 2 


y 

= 

0 

0-2 2 


z 


-4 


or An = d. (2) 


The goal is to understand that system geometrically, and then solve it, 

THE ROW PICTURE: INTERSECTING PLANES 

Start with the first equation x + y — L In the xy plane that produces a line. In three 
dimensions it is a plane . It has the usual form ax + by + cz = except that c happens 
to be zero. The plane is easy to visualize (Figure 1 1 .20a), because it cuts straight down 
through the line. The equation jc + y= 1 allows z to have any value, so the graph 
includes all points above and below the line. 

The second equation x + 2z = 0 gives a second plane, which goes through the 
origin. When the right side is zero , the point (0, 0, 0) satisfies the equation. This time y 
is absent from the equation, so the plane contains the whole y axis. All points (0, y, 0) 
meet the requirement x + 2z = 0. The normal vector to the plane is N = i 4- 2k. The 
plane cuts across, rather than down, in 1 1 . 20 b, 

Before the third equation we combine the first t wo. The intersection of two planes 
is a line ♦ In three-dimensional space, two equations (not one) describe a line. The 
points on the line have to satisfy x + y = 1 and also x + 2r = 0. A convenient point 
is P = (0, 1 , 0). Another point is Q — (— I, 2, 5 ). The line through P and g extends out 
in both directions. 

The solution is on that line. The third plane decides where. 



Fig. 14.20 First plane, second plane, intersection line meets third plane at solution. 
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The third equation - 2y + 2z = - 4 gives the third plane — which misses the origin 
because the right side is not 2 ero. What is important is the point where the three 
planes meet : The intersection line of the first two planes crosses the third plane. 
We used determinants (but elimination is better) to find x - — 2, y = 3, z ~ L This 
solution satisfies the three equations and lies on the three planes. 

A brief comment on 4 by 4 systems. The first equation might bex + y + z- f = 0. 
It represents a three-dimensional “hyperplane” in four-dimensional space. (In physics 
this is space-time.) The second equation gives a second hyperplane, and its intersection 
with the first one is two-dimensional. The third equation (third hyperplane) reduces 
the intersection to a line. The fourth hyperplane meets that line at a point, which is 
the solution. It satisfies the four equations and lies on the four hyperplanes. In this 
course three dimensions are enough. 

COLUMN PICTURE: COMBINATION OF COLUMN VECTORS 


There is an extremely important way to rewrite our three equations. In (1) they were 
separate, in (2) they went into a matrix. Now they become a vector equation: 



V 


r 


"o" 


r 

X 

1 

+ y 

0 

+ z 

2 

= 

0 


0 


-2 


2 


-4 

L J 


(3) 


The columns of the matrix are multiplied by x , y 7 1. That is a special way to see matrix- 
vector multiplication: Au is a combination of the columns of A. We are looking for 
the numbers x, y, z so that the combination produces the right side d. 

The column vectors a, b, c are shown in Figure 11,21a. The vector equation is 
xfl + yb 4- zc = d, The combination that solves this equation must again be x — - 2, 
y ~ 3, z= 1. That agrees with the intersection point of the three planes in the row 
picture. 



new c *. 


a, b, c in 
same plane 



• d not in that plane: 
no solution 

0 = 1c + 2b - 2a 


*9 11.21 Columns combine to give d. Columns combine to give zero (singular case). 


THE DETERMINANT AND THE INVERSE MATRIX 

For a 3 by 3 determinant, the section on cross products gave two formulas. One was 
the triple product a • (b x e). The other wrote out the six terms: 

det A = a • (b x e) = a t (b 2 c 3 - b 3 c 2 ) + u 2 (b 3 Ci - &ic 3 ) + 
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Geometrically this is the volume of a box * The columns a, b, c are the edges going out 
from the origin. In our example the determinant and volume are 2: 


01 

b r 

Cl 


i 

l 

0 


b 2 

C 2 

= 

l 

0 

2 

03 

b 3 

C 3 


0 

-2 

2 


(I)(0)(2)-(l)(-2){2) + (l)(-2)(0) 
-(l)(l)(2) + (0){l)(2)-(0)(0)(0) 


A slight dishonesty is present in that calculation, and will be admitted now* In 
Section 11.3 the vectors A, B s C were rows. In this section a, b, c are columns , It doesn’t 
matter, because the determinant is the same either way* Any matrix can be 
“transposed” — exchanging rows for columns — without altering the determinant* The 
six terms (flif> 2 c 3 is the first) may come in a different order, but they are the same six 
terms* Here four of those terms are zero, because of the zeros in the matrix. The sum 
of all six terms is D — det A = 2* 

Since D is not zero, the equations can be solved* The three planes meet at a point* 
The column vectors a, b, e produce a genuine box, and are not flattened into the same 
plane (with zero volume). The solution involves dividing by D — which is only possible 
if D — det A is not zero* 


11L When the determinant D is not zero, A has an inverse: AA 1 = A X A ~ 
L Then the equations An - d have one and only one solution u = A ~ *d* 


The 3 by 3 identity matrix I is at the end of equation (5)* Always /u = u. 

We now compute A -1 , first with letters and then with numbers* The neatest 
formula uses cross products of the columns of A — it is special for 3 by 3 matrices. 


Every entry is divided by D : The inverse matrix is A 


D 


b x c 

c x a 
a x b 


( 4 ) 


To test this formula, multiply by A. Matrix multiplication produces a matrix of dot 
products — from the rows of the first matrix and the columns of the second, A ~ l A — I: 


b x c 

c x a 
a x b 


b c 


a 1 (b x e) b * (b x c) c * (b x c) 
a • (c x a) b * (c x a) c • (c x a) 
a*(axb) b*(axb) c(axb) 



“l 

0 

<f 

= 

0 

1 

0 


0 

0 

1 


( 5 ) 


On the right side, six of the triple products are zero* They are the off-diagonals like 
b • (b x c), which contain the same vector twice. Since b x c is perpendicular to b, this 
triple product is zero. The same is true of the others, like a *(a x b) = 0* That is the 
volume of a box with two identical sides* The six off-diagonal zeros are the volumes 
of completely flattened boxes* 

On the main diagonal the triple products equal D. The order of vectors can be abc 
or bca or cab, and the volume of the box stays the same* Dividing by this number D , 
which is placed outside for that purpose, gives the l T s in the identity matrix J* 

Now we change to numbers. The goal is to find A~ l and to test it* 



”i 

1 

o" 

is A 1 = ^ 

4 

-2 

2” 

EXAMPLE 1 The inverse of A — 

i 

0 

2 

-2 

2 

-2 


0 

-2 

2 


-2 

2 

-1 
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That comes from the formula, and it absolutely has to be checked. Do not fail to 
multiply A~ l times A (or A times A" 1 ), Matrix multiplication is much easier than 
the formula for A~ l . We highlight row 3 times column I, with dot product zero: 


4 

-2 

2 “ 


“1 

1 

o" 

_ 1 
~ 2 

4-2 

4-4 

-4 + 4 ' 


"l 

0 

o~ 

-2 

2 

-2 


1 

0 

2 

-2 + 2 

-2 + 4 

4-4 

= 

0 

1 

0 

-2 

2 

-1 


0 

-2 

2 


-2 + 2 

-2 + 2 

4-2 


0 

0 

1 


Remark on A~ l Inverting a matrix requires D ^ (X We divide by D = det A . The 
cross products b x c and c x a and a x b give A ~ 1 in a neat form, but errors are 
easy. We prefer to avoid writing i 5 j, k. There are nine 2 by 2 determinants to be 
calculated, and here is A~ l in full — containing the nine “ cofactors v divided by D: 


A* = 


D 


b 2 c 3 - b 3 c 2 

C 2 a 3 ” C 3 a 2 
a 2^3 — 


b 3 ci — b x c 2 

C 3 a l - C i a 3 

a 3 b 1 - a x b 2 


b x c 2 — b 2 c x 

c l a 2 ~ c 2 a l 

&ib 2 ~ tj 2 b x 


(6) 


Important : The first row of A' 1 does not use the first column of A, except in 1/0. 
In other words, b x c does not involve a. Here are the 2 by 2 determinants that 
produce 4, - 2, 2 — which is divided by D = 2 in the top row of A~ 


1 1 0 

1 0 2 

0-2 2 


1 1 0 

1 0 2 

0-2 2 

_ 


1 1 0 

1 0 2 

0-2 2 


+ - + 

- + - 

+ - + 


( 7 ) 


The second highlighted determinant looks like +2 not -2. But the sign matrix on 
the right assigns a minus to that position in A ~ We reverse the sign of b x c 3 — b 3 c x , 
to find the cofactor b 3 c : - b x c 3 in the top row of (6). 

To repeat: For a row of A~ l , cross out the corresponding column of A . Find the three 
2 by 2 determinants, use the sign matrix , and divide by D. 



“1 

1 

r 


~i 

-1 

o" 

EXAMPLE 2 B~ 

0 

1 

i 

has D = 1 and B 1 = 

0 

1 

-1 


0 

0 

i 


0 

0 

1 


The multiplication BB _1 = / checks the arithmetic. Notice how } } in B leads to a 
zero in the top row of B~ V To find row 1, column 3 of B" 1 we ignore column 1 and 
row 3 of B. (Also: the inverse of a triangular matrix is triangular.) The minus signs 
come from the sign matrix. 


THE SOLUTION u = A d 

The purpose of A~* is to solve the equation Au = d. Multiplying by A -1 produces 
Ju = /4 -1 d. The matrix becomes the identity, /u equals u, and the solution is 
immediate: 


b X C 



1 

“ D 

d*(bxc) 

c x a 


d 

d • (c x a) 

a x b 



d ■ (a x b) 
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By writing those components x, y t z as ratios of determinants, we have Cramer s Rule: 


11 M {Cramer’s Rule) 


The solution is x = 


d b c| 
|a b c|’ 


|a d c| 
|a b cf 


_ ! a b d| 
|a b c] 


( 10 ) 


The right side d replaces, in turn, columns a and b and c. All denominators are D = 
a • (b x c), The numerator of x is the determinant d • {b x c) in (9), The second numera- 
tor agrees with the second component d *(c x a), because the cyclic order is correct. 
The third determinant with columns abd equals the triple product d *(a x b) in A~ 1 u. 
Thus (10) is the same as (9). 


EXAMPLE A: Multiply by A 1 to find the known solution x = - 2, y = 3, 2 = 1: 


“4-2 2” 


1 

_ 1 
“ 2 

i 

4-8“ 


“-2“ 

-2 2 -2 

-2 2-1 

_ 


1 

1 

4^ O 

1 

-2 + 8 

-2 + 4 

— 

3 

1 


EXAMPLE B: Multiply by B 1 to solve Bu = d when d is the column (6, 5, 4): 



“1 

-1 

0” 


"6” 


~r 



1 

r 


~r 


”6“ 

11 = B *d = 

0 

1 

-1 


5 

= 

1 

. Check Bu = 

0 

1 

1 


1 

= 

5 


0 

0 

1 


4 


4 

i_ _ 


0 

0 

1 


4 


4 


EXAMPLE C; Put d = (6, 5, 4) in each column of B. Cramer's Rule gives 11 = (1, 1,4): 


6 1 1 


1 6 l! 


1 1 6 


t 1 1 

5 1 1 

' =1 

0 5 l; 

= 1 

0 1 5 

= 4 all divided by D = 

0 1 1 

4 0 1 


0 4 1 ! 


0 0 4 


0 0 1 


This rule fills the page with determinants. Those are good ones to check by eye, 
without writing down the six terms (three -h and three — ). 

The formulas for A ~ 1 are honored chiefly in their absence. They are not used by 
the computer, even though the algebra is in some ways beautiful. In big calculations, 
the computer never finds A ~ 1 — just the solution. 

We now took at the singular case D = 0. Geometry-algebra-algorithm must all 
break down. After that is the algorithm: Gaussian elimination. 


THE SINGULAR CASE 


Changing one entry of a matrix can make the determinant zero. The triple product 
a*(bxc), which is also the volume, becomes D = 0, The box is flattened and the 
matrix is singular. That happens in our example when the lower right entry is changed 
from 2 to 4: 


S = 


1 

1 

0 


1 

0 

-2 


0 

2 

4 


has determinant 0 = 0. 
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This does more than change the inverse. It destroys the inverse. We can no longer 
divide by D , There is no S ” 1 , 

What happens to the row picture and column picture? For 2 by 2 systems, the 
singular case had two parallel lines. Now the row picture has three planes* which 
need not be parallel. Here the planes are not parallel Their normal vectors are the 
rows of S , which go in different directions. But somehow the planes fail to go through 
a common point. 

What happens is more subtle. The intersection line from two planes misses the 
third plane. The line is parallel to the plane and stays above it (Figure 1 1.22a), When 
all three planes are drawn, they form an open tunnel. The picture tells more than the 
numbers* about how three planes can fail to meet* The third figure shows an end 
view, where the planes go directly into the page. Each pair meets in a line, but those 
lines don’t meet in a point. 



fl 9 - 11,22 The row picture in the singular case: no intersection point, no solution. 


When two planes are parallel, the determinant is again zero. One row of the matrix 
is a multiple of another row. The extreme case has all three planes parallel — as in a 
matrix with nine l’s. 

The column picture must also break down. In the 2 by 2 failure (previous section), 
the columns were on the same line. Now the three columns are in the same plane. The 
combinations of those columns produce d only if it happens to lie in that particular 
plane. Most vectors d will be outside the plane, so most singular systems have no 
solution. 

When the determinant is zero. Am = d has no solution or infinitely many , 

THE ELIMINATION ALGORITHM 

Go back to the 3 by 3 example Am = d. If you were given those equations, you would 
never think of determinants. You would — quite correctly— start with the first equa- 
tion. It gives jc = 1 — y, which goes into the next equation to eliminate x: 

x+ y =1 

x + 2z = 0 — — - — - — ► 1 — y + 2z = 0 
— 2y + 2z ~ —4 -2y + 2z = -4. 

Stop there for a minute. On the right is a 2 by 2 system for y and z. The first equation 
and first unknown are eliminated—exactly what we want. But that step was not 
organized in the best way, because a “1” ended up on the left side. Constants should 
stay on the right side — the pattern should be preserved. It is better to take the same 
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step by subtracting the first equation from the second. 
x+ y =1 

x + 2 z= 0 ► - y + 2z= — 1 (11) 

— 2y + 2z = - 4 -2y + 2z = -4. 

Same equations, better organization. Now look at the corner term — y. Its coefficient 
- 1 is the second pivot. (The first pivot was + 1, the coefficient of x in the first corner.) 
We are ready for the next elimination step: 

Plan: Subtract a multiple of the “pivot equation” from the equation below it. 
Goal : To produce a zero below the pivot, so y is eliminated. 

Method: Subtract 2 times the pivot equation to cancel -2 y. 

— y + 2z = - 1 

- (12) 

— 2y + 2z = — 4 — 2z = — 2. 

The answer comes by back substitution. Equation (12) gives 2=1. Then equation (1 1) 
gives y = 3. Then the first equation gives x — - 2. This is much quicker than determi- 
nants. You may ask: Why use Cramer’s Rule? Good question. 

With numbers elimination is better. It is faster and also safer. (To check against 
error, substitute - 2, 3, 1 into the original equations.) The algorithm reaches the 
answer without the determinant and without the inverse . Calculations with letters use 
det A and A ~ l , 

Here are the steps in a definite order (top to bottom): 

Subtract a multiple of equation 1 to produce Ox in equation 2 
Subtract a multiple of equation 1 to produce Ox in equation 3 
Subtract a multiple of equation 2 (new) to produce Oy in equation 3. 

EXAMPLE (notice the zeros appearing under the pivots): 

x + y + z= 1 x + y + z=l x + y + z = 1 
2x + 5y + 3z- 7 -► 3y + z - 5 -> 3y + z = 5 
4x + 7y + 6z = 1 1 3y + 2z = 7 2 = 2. 

Elimination leads to a triangular system. The coefficients below the diagonal are zero. 

First z = 2, then y = 1, then x = — 2. Back substitution solves triangular systems (fast). 

As a final example, try the singular case Su = d when the comer entry is changed 
from 2 to 4. With D = 0, there is no inverse matrix S’ 1 . Elimination also fails, by 
reaching an impossible equation 0 = — 2: 

x+ y = 1 x + y = 1 x + y = 1 

x +2z= 0 _► — y + 2z = ^ 1 -► ^ y + 2z = - 1 

— 2y + 4z = — 4 - 2y + 4z = — 4 0 = - 2 

The three planes do not meet at a point — a fact that was not obvious at the start. 
Algebra discovers this fact from D- 0. Elimination discovers it from 0 = - 2. The 
chapter is ending at the point where my linear algebra book begins. 
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One final comment. In actual computing, you will use a code written by profession- 
als, The steps will be the same as above, A multiple of equation 1 is subtracted from 
each equation below it, to eliminate the first unknown x. With one fewer unknown 
and equation, elimination starts again. (A parallel computer executes many steps at 
once.) Extra instructions are included to reduce roundoff error. You only see the 
result! But it is more satisfying to know what the computer is doing. 

In the end, solving linear equations is the key step in solving nonlinear equations. 
The central idea of differential calculus is to linearize near a point. 


11.5 EXERCISES 


Read- through questions 

Three equations in three unknowns can be written as Au = 
d. The q u has components x t y t z and A is a b The 
row picture has a c for each equation. The first two 
planes intersect in a d and all three planes intersect in 
a a . which is * The column picture starts with 
vectors a, b, c from the columns of 9 and combines them 
to produce h . The vector equation is i = d. 

The determinant of A is the triple product J This is 
the volume of a box, whose edges from the origin are k 
If det A = I then the system is __m_. Otherwise there 
is an n matrix such that A ~ 1 A — o (the P mat- 

rix). In this case the solution to Au = d is u = q 

The row r s of A -1 are the cross products bxc, ' 

s divided by Z>, The entries of A ~ 1 are 2 by 2 t 

divided by D , The upper left entry equals u The 2 by 2 

determinants needed for a row of A~ x do not use the corre- 
sponding v of A. 


2 The planes x-hy = 0, x + y + 2 = 1, and y + z = 0 intersect 
at u = (x,>\4 

3 The point u = (x, y, z) is on the planes x—y\ y = z, 
x-z= 1. 


4 A combination of a = (1,0, 0) and b = (0,2,0) and c = 
(0, 0, 3) equals d = (5, 2, 0). 

5 Show that Problem 3 has no solution in two ways: find 
the determinant of A f and combine the equations to produce 
0 = 1 . 

6 Solve Problem 2 in two ways: by inspiration and Cramer's 
Rule. 


7 Solve Problem 4 in two ways: by inspection and by com- 
puting the determinant and inverse of the diagonal matrix 


A = 


1 0 0 
0 2 0 
0 0 3 


The solution is u = A " 1 d. Its first component x is a ratio 
of determinants, |dbc| divided by w . Cramer's Rule 
breaks down when det A = * . Then the columns a, b, c 

lie in the same v There is no solution to xa + yb + zc = 
d, if d is not on that i In a singular row picture, the 
intersection of planes 1 and 2 is a to the third plane. 


8 Solve the three equations of Problem 1 by elimination. 

9 The vectors b and c lie in a plane which is perpendicular 

to the vector . In case the vector a also lies in that 

plane, it is also perpendicular and a* = 0. The 

of the matrix with columns in a plane is . 


In practice u is computed by B , The algorithm starts 
by subtracting a multiple of row 1 to eliminate x from C . 
If the first two equations are x — y = 1 and 3x + z = 7, this 
elimination step leaves d , Similarly x is eliminated from 
the third equation, and then E is eliminated. The equ- 
ations are solved by back f When the system has no 
solution, we reach an impossible equation like s The 
example x — y = 1, 3x + z = 7 has no solution if the third equ- 
ation is H 

Rewrite 1-4 as matrix equations Au = d (do not solve). 

1 d = (0, 0, 8) is a combination of a = ( 1 , 2, 0) and b = (2, 3, 2) 
and c = (2, 5, 2). 


10 The plane a x x -\- b x y 4 - c { z = d { is perpendicular to its 

normal vector = , The plane a 2 x + b 2 y + c 2 z = 

d 2 is perpendicular to N 2 = . The planes meet in a 

line that is perpendicular to both vectors, so the line is parallel 

to their product. If this line is also parallel to the 

third plane and perpendicular to N s , the system is . 

The matrix has no , which happens when 

(Nj xN 2 )-N, = 0. 


Problems 11-24 use the matrices A , B, C. 



"l 

4 

o! 


"o 

0 

r 


1 

-1 

— 3~ 

A = 

0 

2 

6 i 


2 

1 

0 

c = 

-1 

2 

0 


0 

0 

3 


6 

4 

0 


0 

-1 

3 
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11 Vectors and Matrices 


11 Kind the determinants |4|, |J?|, |C|, Since A is triangular, 

its determinant is the product , 

12 Compute the cross products of each pair of columns in B 
(three cross products). 


13 Compute the inverses of A and B above. Check that 
A 1 /t = / and B 1 B = L 



V 


V 


14 Solve Au = 

0 

and Bu = 

0 

. With this right side d, why 


0 


0 



is u the first column of the inverse? 


15 Suppose all three columns of a matrix add to zero, as in 
C above. The dot product of each column with v = (1, l s l) is 

* All three columns lie in the same . The 

determinant of C must be . 

16 Find a nonzero solution to Cu = 0. Find all solutions to 
Cu = 0. 

17 Choose any right side d that is perpendicular to v = 
(1, 1, l)and solve Cu = d, Then find a second solution, 

18 Choose any right side d that is not perpendicular to v = 
(1, 1, 1). Show by elimination (reach an impossible equation) 
that Cu = d has no solution. 

19 Compute the matrix product AB and then its determinant. 
How is det AB related to det A and del U? 

20 Compute the matrix products BC and CB , All columns of 

CB add to , and its determinant is , 

21 Add A and C. by adding each entry of A to the corre- 
sponding entry of C. Check whether the determinant of A + C 
equals det A + det C. 

22 Compute 2A by multiplying each entry of A by 2. The 
determinant of 2 A equals _______ times the determinant of 

4, 

23 Which four entries of A give the upper left corner entry p 
of A~\ after dividing by D = det A1 Which four entries of A 
give the entry q in row 1, column 2 of A “ *? Find p and q. 

24 The 2 by 2 determinants from the first two rows of B are 
— 1 (from columns 2, 3) and —2 (from columns 1, 3) and 

__ (from columns 1,2), These numbers go into the 

third of B~ \ after dividing by and chang- 
ing the sign of , 

25 Why does every inverse matrix A~ 1 have an inverse? 

26 From the multiplication ABB~ V A' X = / it follows that 

the inverse of AB is . The separate inverses come in 

order. If you put on socks and then shoes, the 

inverse begins by taking off 


27 Find the determinants of these four permutation matrices : 


"o 1 o" 


"o 0 f 


~0 1 o" 

! 0 0 

Q = 

0 1 0 

PQ = 

0 0 1 

0 0 1 


1 0 0 


1 0 0 


and QP = , Multiply u = (x, y, i) by each permuta- 

tion to find Pu, Qu, PQu, and QPu . 

28 Find all six of the 3 by 3 permutation matrices (including 
/), with a single 1 in each row and column. Which of them 
are “even” (determinant 1) and which are “odd” (determinant 
-i)? 

29 How many 2 by 2 permutation matrices arc there, includ- 
ing /? How many A by 4? 

30 Multiply any matrix A by the permutation matrix P and 
explain how PA is related to A , In the opposite order explain 
how AP is related to A. 


31 Eliminate x from the last two equations by subtracting 
the first equation. Then eliminate y from the new third equa- 
tion by using the new second equation: 

A- + y + z = 2 x + y =1 

(a) x + 3y + 3z = 0 (b) x + z = 3 

x + 3y + 7z = 2 y + z = 5, 

After elimination solve for z,y\ x (back substitution). 

32 By elimination and back substitution solve 

x + 2_y + 2z = 0 x-y = 1 

(a) 2x + 3y + 5z = 0 (b) x -z = 4 

2 y + 2z = 8 y -z = l. 

33 Eliminate v from equation 2 by using equation 1: 

x -f- 2 y + 2z — 0 
2.x + 4y + 5z = 0 
2v + 2z = 8. 


Why can't the new second equation eliminate y from the third 
equation? Is there a solution or is the system singular? 

Note: If elimination creates a zero in the “pivot position,” 
try to exchange that pivot equation with an equation below 
it. Elimination succeeds when there is a full set of pivots, 

34 The pivots in Problem 32a are 1, —1, and 4, Circle those 
as they appear along the diagonal in elimination. Check that 
the product of the pivots equals the determinant, (This is how 
determinants are computed.) 

35 Find the pivots and determinants in Problem 31, 


36 Find the inverse of A = 


1 I 0 
0 1 1 
0 0 I 


and also of B = A 2 . 



11.5 Linear Algebra 


37 The symbol a tJ stands for the entry in row i, column j. 
Find a l2 and a 2l in Problem 36* The formula £ a t jb jk gives 
the entry in which row and column of the matrix product 
Aff* 

38 Write down a 3 by 3 singular matrix S in which no two 
rows are parallel. Find a combination of rows 1 and 2 that is 
parallel to row 3. Find a combination of columns 1 and 2 that 
is parallel to column 3. Find a nonzero solution to £u = 0* 


39 Compute these determinants. The 2 by 2 matrix is invert- 
ible if . The 3 by 3 matrix (is)(is not) invertible. 
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CHAPTER 12 


Motion Along a Curve 



12.1 The Position Vector 



This chapter is about ‘Vector functions*” The vector 2i + 4j + 8k is constant* The 
vector R(t) = tl + 1 2 j + f 3 k is moving. It is a function of the parameter t, which often 
represents time. At each time t, the position vector R(t) locates the moving body: 


position vector = R(() = x(t)i + y{t ] | + z{t) k* (1) 

Our example has x = t, y = t 2 7 z — t 3 , As t varies, these points trace out a curve in 
space , The parameter £ tells when the body passes each point on the curve* The 
constant vector 2i + 4j + 8k is the position vector R(2) at the instant £ = 2* 

What are the questions to be asked? Every student of calculus knows the first 
question: Find the derivative * If something moves, the Navy salutes it and we differen- 
tiate it* At each instant, the body moving along the curve has a speed and a direction* 
This information is contained in another vector function — the velocity vector v(£) 
which is the derivative of R(f): 


, , rfR dx , dy , dz , 
y(t) = t = t 1 + :rJ + t k 

dt dt dt dt 


( 2 ) 


Since i, j, k are fixed vectors, their derivatives are zero. In polar coordinates i and j 
are replaced by moving vectors* Then the velocity v has more terms from the product 
rule (Section 12*4)* 

Two important cases are uniform motion along a line and around a circle * We study 
those motions in detail (v = constant on line, v = tangent to circle)* This section also 
finds the speed and distance and acceleration for any motion R(t). 

Equation (2) is the computing rule /or the velocity dR/dt It is not the definition of 
dR/dt, which goes back to basics and does not depend on coordinates: 


dR AR ^ 

— = lim — = hm 
dt Ar-0 At At o 


R(t + At)-R(t) 
At 
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We repeat: R is a vector so AR is a vector so dR/dt is a vector. All three vectors are 
in Figure 12*1 (£ is not a vector!)* This figure reveals the key fact about the geometry: 
The velocity v — dR/dt is tangent to the curve , 
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The vector AR goes from one point on the curve to a nearby point. Dividing by 
Ar changes its length, not its direction. That direction lines up with the tangent to 
the curve, as the points come closer. 


EXAMPLE 1 R(r) = ri + t 2 } + t 3 k v{t) = i + 2*j + 3t*k 

This curve swings upward as f increases. When t = 0 the velocity is v = i. The tangent 
is along the x axis, since the j and k components are zero. When t — 1 the velocity is 
i + 2} + 3k, and the curve is climbing. 

For the shadow on the xy plane, drop the k component Position on the shadow 
is ti + f 2 j. Velocity along the shadow is i + 2tj. The shadow is a plane curve. 



12.1 Position vector R, change AR, 
velocity dR/dt, 


v -i + j + 2k 
'6 = 0.3, 3) 

4 R a*//’*(0. 2 , i) 



x - f, y = 2 + /, z - l +2 1 
? 2 

Rg. 12.2 Equations of a line, with and 
without the parameter t. 


EXAMPLE 2 Uniform motion in a straight line: the velocity vector v is constant 

The speed and direction don’t change. The position vector moves with dRfdt = v: 

R(t) = R 0 + » (So fixed, v fixed, t varying) (3) 

That is the equation of a line in vector form. Certainly dR/dt = v. The starting point 
R 0 = x 0 i + yj + z 0 k is given. The velocity v = v x i + i? 2 j + 0 3 k is also given. Separating 
the x, y and z components, equation ( 3 ) for a line is 

line with parameter: x = x 0 + tv u y = y 0 + ti? 2 , z-z 0 -\-tv 2 . (4) 

The speed along the line is |v| = yfv\ + v\ + U 3 . The direction of the line is the unit 
vector v/[v|. We have three equations for x, y, z , and eliminating t leaves two equations. 
The parameter t equals (x — x 0 )(v i from equation (4). It also equals (y - y 0 )/u 2 and 
[z — z 0 )jv 2 . So these ratios equal each other, and t is gone: 

line without parameter: * — — — - — — - - — — . (5) 

v x v 2 

An example is x = y/2 = zj 3. In this case (x 0 , y 0 , z 0 ) = (0, 0, 0) — the line goes through 
the origin. Another point on the line is (x, y, z) = ( 2 , 4, 6 ), Because t is gone, we cannot 
say when we reach that point and how fast we are going. The equations x/4 = y /8 = 
z/12 give the same line. Without r we can’t know the velocity v = dR/dt 

EXAMPLE 3 Find an equation for the line through P = (0, 2, 1 ) and Q = (1, 3, 3). 

Solution We have choices! R 0 can go to any point on the line. The velocity v can 
be any multiple of the vector from P to g. The decision on R 0 controls where we 
start, and v controls our speed. 

The vector from P to Q is i + j + 2k. Those numbers 1,1,2 come from subtracting 
0, 2, 1 from 1, 3, 3. We choose this vector i + j + 2k as a first v, and double it for a 



448 


12 Motion Along a Curve 


second v. We choose the vector R 0 = P as a first start and R 0 = Q as a second start. 
Here are two different expressions for the same line — they are P + t\ and Q + r(2v): 

R(r) = (2j + k) + j(i 4 j 4 2k) R*(r) = (i 4 3j + 3k) 4 1 { 2i + 2 j + 4k). 

The vector R(f) gives x = r, y = 2 + f, z = 1 4 2f. The vector R* is at a different point 
on the same line at the same time: x* = 1 + 2r, y* = 3 4 It, z* = 3 + 4t. 

If I pick t = 1 in R and f = 0 in R*, the point is (1, 3, 3). We arrive there at different 
times. You are seeing how parameters work, to tell “where” and also “when.” If t 
goes from — x to + oc, all points on one line are also on the other line. The path is 
the same, but the “twins” are going at different speeds. 

Question 1 When do these twins meet? When does R(f) = R*{t)? 

Answer They meet at t = — 1, when R = R* = — i+j — k. 

Question 2 What is an equation for the segment between P and Q (not beyond)? 
Answer In the equation for R(f), let i go from 0 to 1 {not beyond): 

x = r > j = 2 4 r z = 1 4 2r [0 ^ t ^ 1 for segment]. (6) 

At t = 0 we start from P = (0, 2, 1 ). At t = l we reach Q = ( 1 „ 3, 3), 

Question 3 What is an equation for the line without the parameter Cl 
Answer Solve equations (6) for r or use (5): xj\ = [y — 2)/l = (z — l)/2. 

Question 4 Which point on the line is closest to the origin? 

Answer The derivative of x 2 4 y 2 + z 2 — t 2 4 (2 + f) 2 4 { 1 4- 2t} 2 is 8 + This deriv- 
ative is zero at r = — 1. So the closest point is {- 1 J , - 1 ). 

Question 5 Where does the line meet the plane x + y + z = 1 1? 

Answer Equation (6) gives x 4 y 4 z = 3 + 4r = 1 F So r = 2, The meeting point is 
x = t = 2, y = t 4 2 = 4, z = 1 4 2f = 5. 

Question 6 What line goes through (3, l, 1) perpendicular to the plane x — y — z = 1? 
Answer The normal vector to the plane is N = i— j — k. That is v. The position 
vector to (3, 1, 1) is R 0 = 3i 4 j 4 k. Then R = R 0 4 fv. 

COMPARING LINES AND PLANES 

A line has one parameter or two equations. We give the starting point and velocity: 
{x, y, z) = (jc 0 , > i 0 , z 0 ) 4 r(iv t- 3 ) r That tells directly which points are on the line. 
Or we eliminate t to find the two equations in (5). 

A plane has one equation or two parameters! The equation is ax 4 by 4 cz - d. 
That tells us wdireetly which points are on the plane. (Instead of knowing \\ r, z, we 
know the equation they satisfy. Instead of directions v and w in the plane, we are 
told the perpendicular direction IS = (a,b,c).) With parameters, the line contains 
R 0 4 rv and the plane contains R 0 4 fv — sw. A plane looks worse with parameters 
(r and s), a line looks better. 

Questions 5 and 6 connected lines to planes. Here are two more. See Problems 
41- 44: 

Question 7 When is the line R 0 4 fv parallel to the plane? When is it perpendicular? 
Answer The test is v • N ^ 0. The test is v x N - 0. 

EXAMPLE4 Find the plane containing F 0 = (1,2,1) and the line of points 
(l, 0, 0) 4 r(2, 0, -1), That vector v will be in the plane. 



Solution The vector v = 2i — k goes along the line. The vector w = 2j + k goes from 
(1, 0. 0) to (1, 2, 1). Their cross product is 
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N = v x w = 


1 

2 
0 


j k 
0 -1 
2 1 


— 21 — 2 j -h 4k. 


The plane 2x — 2y + 4z = 2 has this normal N and contains the point (1, 2, 1). 


SPEED, DIRECTION, DISTANCE, ACCELERATION 

We go back to the curve traced out by R(f). The derivative v(t) = dR/dt is the velocity 
vector along that curve. The speed is the magnitude of v: 

speed - |v| = ^/(dx/dif + (dyjdt) 2 + {dzjdt) 2 . (7) 

The direction of the velocity vector is v/|v|, This is a unit vector, since v is divided by 
its length. The unit tangent vector v/|v| is denoted by T. 

The tangent vector is constant for lines. It changes direction for curves. 

EXAMPLE 5 (important) Find v and [v| and T for steady motion around a circle: 
x — r cos cot, y = r sin cut, z = 0, 

Solution The position vector is R — r cos cot i + r sin cot j. The velocity is 

v = dR/dt = — cor sin cot i + cor cos <ot j (tangent, not unit tangent) 

The speed is the radius r times the angular velocity go: 

|v| = yj(— cor sin cot) 2 +(cw cos cot) 2 = cor. 

The unit tangent vector is v divided by |v|: 

T = — sin cot i 4- cos (ot j (length 1 since sin 2 cut + cos 2 <ot — i). 

Think next about the distance traveled . Distance along a curve is always denoted 
by s (called arc length ). I don’t know why we use s — certainly not as the initial for 
speed. In fact speed is distance divided by time. The ratio s/t gives average speed; 
ds/dt is instantaneous speed. We are back to Chapter l and Section 83, the relation 
of speed to distanec: 

speed |v| = dsjdt distance s = j (dsjdt) dt — j \\{t)\ dt . 

Notice that |v| and s and t are scalars. The direction vector is T: 

_ v dR/dt dR /Q . 

T = = — ■ - = — = unit tangent vector . (8) 

|v| ds/dt ds s 

In Figure 12.3, the chord length (straight) is |AR|. The arc length (curved) is As . As 
AR and As approach zero, the ratio |AR/As| approaches |T[ = 1. 

Think finally about the acceleration vector a(t). It is the rate of change of velocity 
(not the rate of change of speed): 

dv d z R d 2 x. d 2 y. d 2 z 


( 9 ) 
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12 Motion Along a Cum 


Ivl - car 



a m 

r cos tof 
r sin of 



Steady motion around a circle. Half turn up a helix. 


For steady motion along a line, as in jc = t, y = 2 + £, z = 1 + 2r, there is no accelera- 
tion. The second derivatives arc all zero. For steady motion around a circle, there is 
acceleration. In driving a car, you accelerate with the gas pedal or the brake. You 
also accelerate by tuning the wheel. It is the velocity vector that changes, net the 
speed. 

EXAMPLE 6 Find the distance s(t) and acceleration a(t) for circular motion. 

Solution The speed in Example 5 is ds/dt = air. After integrating, the distance is s = 
art. At time t we have gone through an angle of cut. The radius is r, so the distance 
traveled agrees with cut times r. Note that the dimension of cu is 1 /time. (Angles are 
dimensionless.) At time t = Injco we have gone once around the circle — to s = 2nr 
not back to s = 0. 

The acceleration is a = d 2 H/dt 2 . Remember R = r cos cut i +r sin cut j: 

a(t) = — cu 2 r cos cut i — cu 2 r sin cut j. (10) 

That direction is opposite to R. This is a special motion, with no action on the gas 
pedal or the brake. All the acceleration is from turning. The magnitude is |a| =cu 2 r, 
with the correct dimension of distance/(timef 2 . 

EXAMPLE 7 Find v and s and a around the helix R = cos t i + sin t j + t k. 

Solution The velocity is v = — sin t i + cos t j + k. The speed is 

ds/dt = |v| = ,/sin 2 r + cos 2 t + 1 = Jl (constant). 

Then distance is s = -Jl t. At time t = n, a half turn is complete. The distance along 
the shadow is n (a half circle). The distance along the helix is n , because of its 

45 s slope. 

The unit tangent vector is velocity /speed, and the acceleration is dv/dt: 

T = (— sin f i + cos f j + k)/^/2 a = — cos t i — sin t j, 

EXAMPLE 8 Find v and s and a around the ellipse x = cos t, y = 2 sin t, z = 0. 

Solution Take derivatives: v = — sin t i + 2 cos t j and |v| = v ' ; sin 2 t -I- 4 cos 2 ! . This is 
the speed ds/dt. For the distance s, something bad happens (or something normal). 
The speed is not simplified by sin 2 r + cos 2 t = I. We cannot integrate ds/dt to find a 
formula for s. The square root defeats us. 

The acceleration — cos £ i - 2 sin t j still points to the center, This is not the Earth 
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going around the sun. The path is an ellipse but the speed is wrong. See Section 12.4 
(the pound note) for a terrible error in the position of the sun. 


12A The basic formulas for motion along a curve are 

dR dv _ ds v _ dRjdt dR 

V= !i * = ~dt V = df = jvj = ds/df = d? 


Suppose we know the acceleration a(r) and the initial velocity v 0 and position R 0 . 
Then v(r) and R(r) are also known. We integrate each component: 

a(r) — constant => v(t) = v 0 + at => R(r) = R 0 + v„r + ^at 2 

a(t) = cos t k => v(t) = v 0 + sin f k => R(r) = R 0 + v 0 t — cos t k. 

THE CURVE OF A BASEBALL 

There is a nice discussion of curve balls in the calculus book by Edwards and Penney. 
We summarize it here (optionally). The ball leaves the pitcher’s hand five feet off the 
ground: R 0 = Oi 4 - Oj + 5k. The initial velocity is v 0 = 1201 — 2j + 2k (120 ft/sec is more 
than 80 miles per hour). The acceleration is -32k from gravity, plus a new term from 
spin. If the spin is around the z axis, and the ball goes along the x axis, then this 
acceleration is in the y direction. (It comes from the cross product k x i — there is a 
pressure difference on the sides of the ball.) A good pitcher can achieve a = 16] — 32k. 
The batter integrates as fast as he can: 

v(f) = v 0 + at = 1201 + (- 2 + 16r)j + (2 — 32r)k 

R(i) = R 0 + Vo r + iaf 2 = I20f 1 + (- 2f + 8f 2 )j + (5 + 2f - 16r 2 )k. 

Notice the i 2 . The effect of spin is small at first, then suddenly bigger (as every batter 
knows). So is the effect of gravity — the ball starts to dive. At f = 1, the i component 
is 60 feet and the ball reaches the batter. The j component is I foot and the k 
component is 2 feet — the curve goes low over the outside corner. 

At ( = 5 , when the batter saw the ball halfway, the j component was zero. It looked 
as if it was coming right over the plate. 


x = 30 z = 5.5 .v = 0 z = 5 



Fig. 12.4 A curve ball approaches home plate. Halfway it is on line. 


12.1 EXERCISES 


Read-through questions 

The position vector o along the curve changes with the 
parameter /. The velocity is b . The acceleration is c 

If the position is i + tj + t 2 k, then v = d and a = a . 

In that example the speed is jv| = t This equals ds/di, 


where s measures the a . Then s = J h . The tangent 

vector is in the same direction as the I but T is a ) 
vector. In genera) T = k and in the example T = i 

Steady motion along a line has a - m . If the line is x = 
y — z, the unit tangent vector is T - n . If the speed is 
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|v| = y/3* the velocity vector is v = o , If the initial posi- 
tion is (1, 0, 0), the position vector is Rfri = p , The general 
equation of a line is x = x 0 + **;!, v= q , i= r . In 
vector notation this is R(t) = » . Eliminating t leaves the 

equations (x - x 0 )A>i = fv — vnl/u 7 = t . A line in space 
needs u equations where a plane needs v . A line has 
one parameter where a plane has w . The line from R 0 = 
(1 , 0, 0) to (2, 2 t 2) with |v| = 3 is R(f) = * . 

Steady motion around a circle (radius r, angular velocity 
w) has x = v . y = t . z = 0. The velocity is v ^ A , 
The speed is |v| = B , The acceleration is a = c . which 
has magnitude D and direction E , Combining 
upward motion R = tk with this circular motion produces 
motion around a F . Then v = & and |v| = H , 

1 Sketch the curve with parametric equations x = t, y = t 3 * 
Find the velocity vector and the speed at t = 1. 

2 Sketch the path with parametric equations x = 1 + t t y = 

1 - 1. Find the xy equation of the path and the speed along it* 

3 On the circle x = cos f t y — sin t explain by the chain rule 
and then by geometry why dyfdx = — cot t. 

4 Locate the highest point on the curve x = 6t, y = 6f — t 2 . 

This curve is a . What is the acceleration a? 

5 Find the velocity vector and the xy equation of the tangent 
line to x = e\ y = e~ l at t = 0. What is the xy equation of the 
curve? 

6 Describe the shapes of these curves: (a)x = 2*, y = 4*; (b) 
x -4 1 , y = 8 J ; (c) x = 4\y = 4t. 

Note: To find "parametric equations" is to find x(f), y(f), and 
possibly z(f). 

7 Find parametric equations for the line through P = 
( 1 , 2, 4) and Q = (5, 5, 4). Probably your speed is 5; change the 
equations so the speed is 10* Probably your R 0 is P; change 
the start to Q. 

8 Find an equation for any one plane that is perpendicular 
to the line in Problem 7. Also find equations for any one line 
that is perpendicular. 

9 On a straight line from (2, 3, 4) with velocity v = i - It, the 

position vector is Rfri = If the velocity vector is 

changed to ti — fit, then R(f} = . The path is still 


10 Find parametric equations for steady motion from P = 
(3, I, -2) at r = 0 on a line to Q = (0, 0, 0) at t = 3. What is 
the speed? Change parameters so the speed is e*. 

11 The equations x — 1 = i(y — 2) = 3(2 — 2) describe a 

* The same path is given parametrically by x = 1 + t t 

y = , z = . The same path is also given by 

x - 1 + 2t, y = , z = 

12 Find parametric equations to go around the unit circle 


with speed e r starting from x = 1, y = 0. When is the circle 
completed? 

13 The path x = 2y = 3z = 6r is a traveled with 

speed . If t is restricted by f ^ 1 the path starts at 

. If f is restricted by 0 < t < 1 the path is a . 

14 Find the closest point to the origin on the line x = l + f, 

y = 2 - t. When and where does it cross the 45° line through 
the origin? Find the equation of a line it never crosses. 

15 (a) How far apart are the two parallel lines x = y and 
x^y+1?(b) How far is the point x = t, y - 1 from the point 
x = t, y = t + 1? (c) What is the closest distance if their speeds 
are different: x = t y y — t and x — 2t, y — 2t + 1? 

16 Which vectors follow the same path as R = ti + t 2 j? The 
speed along the path may be different. 

(a)2fi + 2r 2 j (b)2fi + 4r 2 j (c) -fi + r 2 j (d)r 3 i + t 6 j 

17 Find a parametric form for the straight line y — mx 4- b. 

18 The line x = t + y = 2 4- v 2 t passes through the origin 

provided v { + v 1 — 0. This line crosses the 

45° line y = x unless v l + v 2 = 0. 

19 Find the velocity v and speed [v| and tangent vector T 
for these motions: (a) R = ti + 1 ~ *j (b) R = t cos 1 1 + t sin f j 
(c)R=(t + l)i + (2t + l)j + (2r + 2)k. 

20 If the velocity dxjdt i + dyjdt j is always perpendicular to 
the position vector xi + yj, show from their dot product that 
x 2 + y 2 is constant. The point stays on a circle. 

21 Find two paths R(r) with the same v = cos t i + sin t j. Find 
a third path with a different v but the same acceleration. 

22 If the acceleration is a constant vector, the path must be 

* If the path is a straight line, the acceleration vector 

must be . 

23 Find the minimum and maximum speed if x = t 4 - cos 

y = t - sin f. Show that |a| is constant but not a. The point is 
going around a circle while the center is moving on what line? 

24 Find x(t), y(f) so that the point goes around the circle 
(x - 1 ) 2 + {y - 3} 2 = 4 with speed 1 . 

25 A ball that is circling with x = cos 2t, y - sin 2t flies off on 
a tangent at t = n/8* Find its departure point and its position at 
a later time t (linear motion; compute its constant velocity v)* 

26 Why is |a| generally different from d 2 s/dt 2 l Give an 
example of the difference, and an example where they are 
equal* 

27 Change t so that the speed along the helix R = 

cos t i + sin t j + r k is 1 instead of Call the new 

parameter s. 

28 Find the speed ds/dt on the line x=lH- 6r, y = 2 + 3f, 
z = 2r. Integrate to find the length s from (1 , 2 t 0) to 
(13, 8, 4). Check by using 12 2 + 6 2 + 4 2 * 
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29 Find v and |v| and a for the curve x = tan i, y = sec £. What 
is this curve? At what time does it go to infinity, and along 
what line? 

30 Construct parametric equations for travel on a helix with 
speed r. 

31 Suppose the unit tangent vector T(t) is the derivative of 
R(f). What does that say about the speed? Give a noncircular 
example. 

32 For travel on the path y =/(x), with no parameter, it is 

impossible to find the but still possible to find the 

at each point of the path. 

Find x(t) and y(t) Tor paths 33-36, 

33 Around the square bounded by x — 0, x = 1 , y = 0, y — 1 , 
with speed 2. The formulas have four parts, 

34 Around the unit circle with speed eT\ Do you get all the 
way around? 

35 Around a circle of radius 4 with acceleration |a| = 1. 

36 Up and down the y axis with constant acceleration — j, 
returning to (0, 0) at t 10. 

37 True (with reason) or false (with example): 

(a) Tf |R[ “ 1 for all t then |v| = constant. 

(b) If a = 0 then R = constant 

(c) If v • v = constant then v • a = 0, 

(d) If v • R = 0 then R • R = constant. 

(e) There is no path with v = a. 

38 Find the position vector to the shadow of ri + t 2 j + r 3 k on 
the xz plane. Is the curve ever parallel to the line x-y — il 

39 On the ellipse x = a cos t, y = b sin t, the angle 0 from the 

center is not the same as r because , 


40 Two particles are racing from (1,0) to (0, 1). One follows 
x = cosf, y = sint, the other follows x = l + Uit, y = v 2 t. 
Choose Ui and v 2 so that the second particle goes slower but 
wins. 

41 Two lines in space are given by R(f) = P + tv and R(t) = 
Q + fw. Four possibilities: The lines are parallel or the same 
or intersecting or skew. Decide which is which based on the 
vectors v and w and u = Q — P (which goes between the lines): 

(a) The lines are parallel if flre parallel 

(b) The lines are the same if are parallel 

(c) The lines intersect if are not parallel but 

lie in the same plane, 

(d) The lines are skew if the triple product u ■ (v x w) is 


42 If the lines are skew (not in the same plane), find a formula 
based on u, v, w for the distance between them. The vector u 
may not be perpendicular to the two lines, so project it onto 
a vector that is. 

43 The distance from Q to the line P -l fv is the projection of 
a = Q — P perpendicular to v. How far is Q = (9, 4, 5) from 
the line x = 1 + f, y = 1 + It, z — 3 + 2t? 

44 Solve Problem 43 by calculus: substitute for x, y, z in 
(x — 9) 2 + (y — 4) 2 + (z — 5) 2 and minimize. Which (x, y, z) on 
the line is closest to (9, 4, 5}? 

45 Practice with parameters, starting from x = F(f)i y = G(i). 

(a) The mirror image across the 45 c line is x = , 

y = ■ 

(b) Write the curve x = f 3 , y = t 2 asy =/(x). 

(c) Why can’t x = t 2 , y = t 3 be written as y =/(*)? 

(d) If F is invertible then t = F _1 (x) and v = fxl 

46 From 12:00 to 1:00 a snail crawls steadily out the minute 
hand (one meter in one hour). Find its position at time t 
starting from (0, 0), 


■E 12.2 Plane Motion: Projectiles and Cycloids ■ 

The previous section started with R(t), From this position vector we computed v and 
a. Now we find R(r) itself, from more basic information. The laws of physics govern 
projectiles, and the motion of a wheel produces a cycloid (which enters problems in 
robotics). The projectiles fly without friction, so the only force is gravity. 

These motions occur in a plane. The two components of position will be x (across) 
and y (up), A projectile moves as t changes* so we look for x(t) and y(t). We are 
shooting a basketball or firing a gun or peacefully watering the lawn, and we have 
to aim in the right direction (not directly at the target). If the hose delivers water at 
10 meters/second, can you reach the car 12 meters away? 
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The usual initial position is (0, 0). Some flights start higher, at (0, h). The initial 
velocity is (r 0 cos a, v 0 sin a), where u 0 is the speed and a is the angle with the 
horizontal. The acceleration from gravity is purely vertical: d z y/dt 2 = —g. So the 
horizontal velocity stays at its initial value. The upward velocity decreases by — gt: 

dx/dt = i' 0 cos a, dyjdt = i> 0 sin a — gt. 

The horizontal distance x(t) is steadily increasing. The height }\t) increases and then 
decreases, To find the position, integrate the velocities (for a high start add h to y): 

The projectile path is x(t) = (u 0 cos a)t, y(t) = (u 0 sin a)t - jgt 2 . (1) 

This path is a parabola. But it is not written as y = ax 2 + bx + c. It could be, if we 
eliminated t. Then we would lose track of time. The parabola is y(x), with no param- 
eter, where we have x(r) and y(t). 

Basic question: Where does the projectile hit the ground ? For the parabola, we solve 
y(x) = 0. That gives the position x. For the projectile we solve y{ r) = 0. That gives the 
time it hits the ground, not the place. If that time is T, then x(T) gives the place. 
The information is there. It takes two steps instead of one, but we learn more. 

EXAMPLE 1 Water leaves the hose at 10 meters/ second (this is i' 0 ). It starts up at the 
angle a. Find the time T when y is zero again, and find where the projectile lands. 

Solution The flight ends when y = (10 sin <t)T - \gT 2 = 0. The flight time is T — 
(20 sin a)/g. At that time, the horizontal distance is 

x(T) = (I0 cos a)T = {200 cos a sin a )jg. This is the range I?. 

The projectile (or water from the hose) hits the ground at x = R, To simplify, replace 
200 cos a sin a by 100 sin 2a. Since g = 9,8 meters/sec 2 , we can y t reach the car. 

The range R =(100 sin 2a)/9.8 is at most 100/9.8. This is less than 12. 

The range is greatest when sin 2a = I (a is 45 s ). To reach 12 meters we could stand 
on a ladder (Problem 14). To hit a baseball against air resistance, the best angle is 
nearer to 35°. Figure 12.5 shows symmetric parabolas (no air resistance) and unsym- 
metric flight paths that drop more steeply. 


12B The flight time T and the horizontal range R = x{T) are reached when 
y = 0, which means (r 0 sin a)T = \gT 2 : 

T = (2r 0 sin a)jg and R = (u 0 cos a)T = (vl sin 2a)/ g. 




DISTANCE IN FEET 

Hg. 12*5 Equal range R y different times T Baseballs hit at 35 s with increasing u 0 - The dots 
are at half-seconds (from The Physics of Baseball by Robert Adair Harper and Row 1990). 
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EXAMPLE 2 What are the correct angles a for a given range R and given i> 0 ? 

Solution The range is R *= (vl sin 2a)/g. This determines the sine of 2a— but two 
angles can have the same sine . We might find 2 a — 60° or 120°. The starting angles 
a = 30° and a = 60° in Figure 12.5 give the same sin 2a and the same range R. The 
flight times contain sin a and are different 
By calculus, the maximum height occurs when dyjdt = 0. Then v Q sin x — gt, which 
means that t = (v 0 sin a )/g. This is half of the total flight time T — the time going up 
equals the time coming down. The value of y at this halfway time t = \T is 

}'ma, = (»0 sin *)(»o sin a)jg - igK sin a jgf = (d 0 sin af/2g. (2) 

EXAMPLE 3 If a ski jumper goes 90 meters down a 30° slope, after talcing off at 28 
meters/second, find equations for the flight time and the ramp angle a. 

Solution The jumper lands at the point x — 90 cos 30°, y — — 90 sin 30° (minus sign 
for obvious reasons). The basic equation (2) is x = (28 cos a)t, y = (28 sin a)r — \gt 2 . 
Those are two equations for a and t. Note that t is not T y the flight time to y — 0. 

Conclusion The position of a projectile involves three parameters u 0 > a, and t, Three 
pieces of information determine the flight (almost). The rea son for the word almost is 
the presence of sin a and cos a. Some flight requirements cannot be met (reaching a 
car at 12 meters). Other requirements can be met in two ways (when the car is close). 
The equation sin a = c is more likely to have no solution or two solutions than exactly 
one solution. 

Watch for the three pieces of information in each problem. When a football starts 
at v o = 20 meters/second and hits the ground at x — 40 meters* the third fact is 

. This is like a lawyer who is asked the fee and says $1000 for three questions. 

“Isn’t that steep?” says the client “Yes ” says the lawyer, “now what’s your last 
question?” 


CYCLOIDS 

A projectile’s path is a parabola. To compute it, eliminate t from the equations for x 
and y. Problem 5 finds y = ax 1 + bx , a parabola through the origin. The path of a 
point on a wheel seems equally simple, but eliminating t is virtually impossible. The 
cycloid is a curve that really needs and uses a parameter. 

To trace out a cycloid, roll a circle of radius a along the x axis . Watch the point 
that starts at the bottom of the circle. It comes back to the bottom at x = 2 na, after 
a complete turn of the circle. The path in between is shown in Figure 12.6. After a 
century of looking for the xy equation, a series of great scientists (Galileo, Christopher 
Wren, Huygens, Bernoulli, even Newton and THopital) found the right way to study 
a cycloid — by introducing a parameter. We will call it 0; it could also be t. 





Hg. 12.6 Path of P on a rolling circle is a cycloid. Fastest slide to Q, 
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The parameter is the angle 0 through which the circle turns. (This angle is not at 
the origin, hke 0 in polar coordinates.) The circle rolls a distance ad, radius times 
angle, along the x axis. So the center of the circle is at x = ad , y = a. To account for 
the segment CP, subtract a sin 8 from x and a cos d from y: 

The point P has x = a(d — sin d) and y = g( 1 — cos 0). (3) 

At d = 0 the position is (0, 0). At d = 2ti the position is (Ixa, 0). In between, the slope 
of the cycloid comes from the chain rule: 


dy dyjdd a sin 0 
dx dx/d8 o(l — cos df 

This is infinite at d = 0, The point on the circle starts straight upward and the cycloid 
has a cusp , Note how all calculations use the parameter 0. We go quickly: 

Question 1 Find the area under one arch of the cycloid (0 = 0 to 0 = 2 n). 

Answer The area is j y dx — Jq* a(l — cos 0)o( 1 — cos 9)dd. This equals 3 na 2 . 

Question 2 Find the length of the arch, u sing ds = J{ dxjdd) 1 + ( dyjdfff dd. 
Answer J ds = Jq" 1 — cos ff) 2 + (sin 0) 2 dd = J 2 * a^2—2 cos 0 dd. 

Now substitute 1 — cos d — 2 sin 2 \d. The square root is 2 sin jd. The length is 8 a. 


Question 3 If the cycloid is turned over (y is downward), find the time to slide to 
the bottom. The slider starts with v — 0 at y — 0. 

Answer Kinetic plus potential energy is \mv 2 — mgy = 0 (it starts from zero and 
can't change). So the speed is v = 2gy . This is ds/dt and we know ds: 


sliding time = 


dt = 


ds 


’* a*J 2 - 2 cos 9 dd 
o J2ga(\ — COS 0) 


= n^faig- 


Check dimensions: a = distance, g = distance/(time) 2 , j lyfajg - time. That is the short- 
est sliding time for any curve . The cycloid solves the “brachistochrone problem,” 
which minimizes the time down curves from O to Q (Figure 12,6). You might think 
a straight path would be quicker — it is certainly shorter A straight line has the 
equation x — nyf 2, so the sliding time is 


f dt = J dsljlgy = J 2 " J (nil ) 1 + 1 dyj^/lgy = Jn} + 4 s /ajg. (5) 

This is larger than the cycloid time tt^fafg. It is better to start out vertically and pick 
up speed early, even if the path is longer. 

Instead of publishing his solution, John Bernoulli turned this problem into an 
international challenge: Prove that the cycloid gives the fastest slide . Most mathemati- 
cians couldn’t do it The problem reached Isaac Newton (this was later in his life). 
As you would expect, Newton solved it For some reason he sent back his proof with 
no name. But when Bernoulli received the answer, he was not fooled for a moment: 
“I recognize the lion by his claws.” 

What is also amazing is a further property of the cycloid: The time to Q is the same 
if you begin anywhere along the path. Starting from rest at P instead of O, the bottom 
is reached at the same time. This time Bernoulli got carried away: “You will be 
petrified with astonishment when I say...” 


There are other beautiful curves, closely related to the cycloid. For an epicycloid, 
the circle rolls around the outside of another circle. For a hypocycloid, the rolling 
circle is inside the fixed circle. The asrreid is the special case with radii in the ratio 1 
to 4. It is the curved star in Problem 34, where x = a cos 3 0 and y = a sin 3 0. 
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The cycloid even solves the old puzzle: What point moves backward when a train 
starts forward ? The train wheels have a flange that extends below the track, and 
dxjdt < 0 at the bottom of the flange. 


12.2 EXERCISES 


Read-through questions 

A projectile starts with speed v 0 and angle a. At time t its 
velocity is dxjdt — q dvidt — b (the downward 
acceleration is g). Starting from (0, 0), the position at time f 
is x = c , v= d , The flight time back to y = 0 is T = 
Q . At that time the horizontal range is R = 1 , The 

flight path is a 9 . 

The three quantities Vn , h 1 determine the pro- 

jectile’s motion. Knowing uq and the position of the target, 
we (can) (cannot) solve for a. Knowing a and the position of 
the target, we (can) (cannot) solve for i> 0 , 

A 1 is traced out by a point on a rolling circle. If the 
radius is a and the turning angle is 0 , the center of the circle 
is at x = k y = 1 The point is at x = m . y = 

n starting from (0,0). It travels a distance o in a 
full turn of the circle. The curve has a P at the end of 
every turn. An upside-down cycloid gives the Q slide 
between two points. 

Problems 1-18 and 41 are about projectiles 

1 Find the time of flight T, the range R t and the maximum 
height Y of a projectile with u 0 = 16 ft/sec and 

(a) a = 30° (b) a = 60 c (c) a = 90". 

2 If v 0 = 32 ft/sec and the projectile returns to the ground 
at 7=1, find the angle a and the range R. 

3 A ball is thrown at 60° with v 0 = 20 meters/sec to clear a 
wall 2 meters high. How far away is the wall? 

4 If v(0) = 3i + 3j find v(f), v(l), v(2) and R{t), R{1), R(2), 

5 (a) Eliminate r from x = f, v = t — jt 1 to find the xy equa- 
tion of the path. At what x is v = 0? 

(b) Do the same for any i 0 and x 

6 Find the angle a for a ball kicked at 30 meters/second if 
it clears 6 meters traveling horizontally. 

7 How far out does a stone hit the water h feet below, start- 
ing with velocity u 0 at angle a. = 0? 

8 How far out does the same stone go, starting at angle a? 
Find an equation for the angle that maximizes the range. 

9 A ball starting from (0, 0) passes through (5, 2) after 2 
seconds. Find and x (The units are meters.) 

*10 With x and y from equation (1), show that 
1=0 > isxivo) 2 + 2gy. 


If a fire is at height H and the water velocity is v 0f how far 
can the fireman put the hose back from the fire? (The parabola 
in this problem is the “envelope" enclosing all possible paths,) 

11 Estimate the initial speed of a 100-meter golf shot hit at 
a = 45°. Is the true larger or smaller, when air friction is 
included? 

12 T = 2r 0 (sin x)jg is in seconds and R = (uq sin 2ot)/g is in 

meters if v$ and g are in . 

13 (a) What is the greatest height a ball can be thrown? Aim 
straight up with v 0 = 28 meters/sec. 


14 If a baseball goes 100 miles per hour for 60 feet, how long 
does it take (in seconds) and how far does it fall from gravity 
(in feet)? Use $gt 2 . 

15 If you double u 0s what happens to the range and maxi- 
mum height? If you change the angle by da, what happens to 
those numbers? 

16 At what point on the path is the speed of the projectile 
(a) least (b) greatest? 

17 If the hose with i? 0 = lOm/sec is at a 45° angle, x reaches 

1 2 meters when t = and y — . From a lad- 
der of height the water will reach the car (12 meters), 

* 

18 Describe the two trajectories a golf ball can take to land 
right in the hole, if it starts with a large known velocity rv 
In reality (with air resistance) which of those shots would fall 
closer? 

Problems 19-34 are about cycloids and related curves 

19 Find the unit tangent vector T to the cycloid. Also find 
the speed at 8 = 0 and 0 = n, if the wheel turns at dO/dt = 1. 

20 The slope of the cycloid is infinite at 0 = 0: 

dy _ dyidO _ sin 0 
dx dxjdO 1 — cos 0 

By whose rule? Estimate the slope at 0 = ^ and 0= — 
Where does the slope equal one? 

21 Show that the tangent to the cycloid at P in Figure 12.6a 
goes through x = aO, y = 2a. Where is this point on the rolling 
circle? 

22 For a trochoid, the point P is a distance d from the center 
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of the rolling circle. Redraw Figure 12.6b to find x = 
aO — d sin 8 and y = __ „ 

23 If a circle of radius a rolls inside a circle of radius 2 a, show 
that one point on the small circle goes across on a straight 
Line, 

24 Find d 2 yjdx 2 for the cycloid, which is concave ^ , 

25 If dOjdt = c , find the velocities dxjdt and dyjdt along the 
cycloid. Where is dxjdt greatest and where is dyjdt greatest? 

26 Experiment with graphs of x = a cos 0 + b sin 8, y = 
ccos6-\-d sin 0 using a computer. What kind of curves are 
they? Why are they closed? 

27 A stone in a bicycle tire goes along a cycloid. Find equ- 
ations for the stone’s path if it flies off at the top {a projectile), 

28 Draw curves on a computer with x = a cos 0 + b cos 3 0 
and y = c sin 0 + d sin 30, Is there a limit to the number of 
loops? 



35 Find the area inside the astroid. 

36 Explain why x = 2 a cot 0 and y = 2a sin 2 0 for the point P 
on the witch of Agnesi. Eliminate 0 to find the xy equation. 
Note: Maria Agnesi wrote the first three- semes ter calculus 
text (THopital didn’t do integral calculus). The word “witch" 
is a total mistranslation, nothing to do with her or the curve. 





29 When a penny rolls completely around another penny, the 

head makes turns. When it rolls inside a circle four 

times larger (for the astroid), the head makes turns. 

30 Display the cycloid family with computer graphics: 

(a) cycloid 

(b) epicycloid x = C cos 0 — cos C0, y — C sin 8 + sin C8 

(c) hypocydoid x = c cos 0 + cos c0, y = c sin 8 — sin cd 

(d) astroid (c = 3) 

(e) deltoid (c = 2). 

31 If one arch of the cycloid is revolved around the x axis, 
find the surface area and volume. 

32 For a hypocydoid the fixed circle has radius c + 1 and the 
circle rolling inside has radius 1. There are c + 1 cusps if c is 
an integer. How many cusps (use computer graphics if pos- 
sible) for c = 1/2? c = 3/2? c = JlP. What curve for c = 1? 

33 When a string is unwound from a circle find x(0) and y{0) 
for point P. Its path is the “involute" of the circle, 

34 For the point P on the astroid, explain why x = 
3 cos 0 + cos 30 and y = 3 sin 0 — sin 30. The angle in the 

figure is 30 because both circular arcs have length r 

Convert to x = 4 cos 3 0, y = 4 sin *0 by triple-angle formulas. 


37 For a cordioid the radius C — 1 of the fixed circle equals 

the radius 1 of the circle rolling outside (epicycloid with C = 
2). (a) The coordinates of P are x = — 1 + 2 cos 0 — cos 20, 
y — (b) The double-angle formulas yield x = 

2 cos 0(1 — cos 0), y = . (c) x 2 + y 2 = so its 

square root is r = , 

38 Explain the last two steps in equation (5) for the sliding 
time down a straight path. 

39 On an upside-down cycloid the slider takes the same time 
T to reach bottom wherever it starts. Starting at 0 = a, write 
l — cos 0 = 2 sin 2 0/2 and 1 — cos a = 2 sin 2 a/2 to show that 

T f* >/2a 2 ll - cos 0) dO rc R 
J* y^atffcos a — cos 0) V 8 

40 Suppose a heavy weight is attached to the top of the roll- 
ing circle. What is the path of the weight? 

41 The wail in Fenway Park is 37 feet high and 315 feet from 
home plate. A baseball hit 3 feet above the ground at a = 

22. 5 C will just go over if v 0 = „ , The time to reach the 

wall is . 
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12.3 Curvature and Normal Vector 


A driver produces acceleration three ways — by the gas pedal, the brake, and steering 
wheel. The first two change the speed. Turning the wheel changes the direction. All 
three change the velocity (they give acceleration). For steady motion around a circle, 
the change is from steering — the acceleration dvjdt points to the center. We now 
look at motion along other curves, to separate change in the speed ]y| from change 
in the direction T. 

The direction of motion is T = v/|v|. It depends on the path but not the speed 
(because we divide by jv|). For turning we measure two things: 

1. How fast T turns: this will be the curvature k (kappa). 

2. Which direction T turns: this will be the normal vector N. 

k and N depend, like s and T, only on the shape of the curve. Replacing t by 2t or 
t 2 leaves them unchanged. For a circle we give the answers in advance, The normal 
vector N points to the center. The curvature k is 1 /radius. 

A smaller turning circle means a larger curvature k: more bending. 

The curvature k is change in direction |dT| divided by change in position |ds|. There 
are three formulas for k — a direct one for graphs y(x) y a brutal but valuable one for 
any parametric curve (x(t), y(f)), and a neat formula that uses tbe vectors y and a. We 
begin with the definition and the neat formula. 

DEFINITION k - \cTT/ds\ FORMULA k = |v x a|/|v| 3 (1) 

The definition does not involve the parameter t — but the calculations do. The posi- 
tion vector R(r) yields v = dR/dt and a = dyjdt If t is changed to It , the velocity v is 
doubled and a is multiplied by 4. Then |v x a| and |v| 3 are multiplied by 8, and their 
ratio k is unchanged. 

Proof of formula (1) Start from v — |v|T and compute its derivative a: 

d|v| dT 

a = — T + |y| — by the product rule. 
dt dt 

Now take the cross product with v = |v|T. Remember that T x T = 0: 

? X a = |v|Tx |v|^. (2) 


We know that |T| = 1. Equation (4) will show that T is perpendicular to dT/dt. So 
|y x a| is the first length |v| times the second length |v| \dY/dt\. The factor sin 6 in the 
length of a cross product is 1 from the 90° angle. In other words 


d T 

|v x a| 

— - , ■ and k — 

d T 


dTjdt 

dt 

M 

ds 


ds/dt 


(3) 


The chain rule brings the extra \ds/dt\ = Jy| into the denominator. 

Before any examples, we show that dTjdt is perpendicular to T. The reason is that 
T is a unit vector. Differentiate both sides of T*T=: 1: 


<TT ^ ^ <TT „ 

— -T + T- — =0 
dt dt 


or 


dT 

2T‘— =0. 
dt 


(4) 
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That proof used the product rule U'*V + U*V' for the derivative of U-V 
(Problem 23, with U = V = T). Think of the vector T moving around the unit sphere. 
To keep a constant length (T + dT)*(T + dT) = 1, we need 2T*<fT = 0. Movement 
dT is perpendicular to radius vector T. 

Our first examples will be plane curves. The position vector R(r) has components 
x(t) and >f?) but no z(i). Look at the components of v and a and v x a (x' means 

dx/dt): 

R *W y(t) 0 

v x'(r) /(f) 0 |v| = N /|x'| 2 + |/| 2 

a x''(r) y"(t) 0 | x y - yV'| 

v x a 0 0 x'y" - y’x" * ((x') 2 +(/) 2 ) 3 ' 2 ! * 

Equation (5) is the brutal but valuable formula for k. Apply it to movement around 
a circle. We should find k — 1 /radius a: 

EXAMPLE 1 When x = a cos an and y = a sin ot we substitute x\ y\ x", y" into (5): 

(—aia sin a)t)(—o) 2 a sin cut) — (cua cos a>t)(—aj 2 a cos cur) cu 3 a 2 

Koa sin cur) 2 + (oa cos cur) 2 ] 3/2 [cu 2 a 2 ] 3/2 ’ 

This is cu 3 u 2 /cu 3 a 3 and cu cancels. The speed makes no difference to k= 1 /a. 

The third formula for k applies to an ordinary plane curve given by y(x). The 
parameter r is x! You see the square root in the speed |v| = dsjdx\ 


R 

X 

y(x) 

0 


V 

1 

dyjdx 

0 

M = ^1 + (dyldxf 

a 

0 

d 2 yjdx 2 

0 

\d 2 yjdx 2 \ 

y x a 

0 

0 

d 1 yjdx 1 

K -(\+(dy/dx) 2 > 


In practice this is the most popular formula for k . The most popular approximation 
is \d 2 y/dx 2 \. {The denominator is omitted.) For the bending of a beam, the nonlinear 
equation uses k and the linear equation uses d 2 yjdx 2 , We can see the difference for 
a parabola: 


EXAMPLE 2 The curvature of y - $x 2 is k = |y"|/(l + (y') 2 ) 3/2 - 1/(1 + x 2 ) 3/2 . 



Rg. 12.7 Normal N divided by curvature k for circle and parabola and unit helix. 
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The approximation is y ” = 1. This agrees with k at x = 0, where the parabola turns 
the corner* But for large x, the curvature approaches zero* Far out on the parabola, 
we go a long way for a small change in direction* 

The parabola y= “i* 2 , opening down, has the same k , Now try a space curve* 


EXAMPLE 3 Find the curvature of the unit helix R = cos t i + sin t j H- rk* 
Take the cross product of y = — sin t i + cos t j + k and a = — cos j i — sin t j: 


v x a = 


i j k 

sin t cos f I 

cos t — sin t 0 


= sin f i — cos t j + k* 


This cross product has length Also the speed is |v| = ^/sin 2 t + cos 2 f + 1 = 

Compare with a unit circle. Without the climhing term tk, the curvature would be 1. 
Because of climbing, each turn of the helix is longer and k = 

That makes one think: Is the helix twice as long as the circle? No. The length of a 
turn is only increased by |v| = v /2. The other v /2 is because the tangent T slopes 
upward. The shadow in the base turns a full 360°, but T turns less. 


THE NORMAL VECTOR N 


The discussion is bringing us to an important vector. Where k measures the rate of 
turning, the unit vector N gives the direction of turning. N is perpendicular to T, and 
in the plane that leaves practically no choioe. Turn left or right. For a space curve, 
follow dT, Remember equation (4), which makes dT perpendicular to T. 

The normal vector N is a unit vector along dT/dt. It is perpendicular to T: 


DEFINITION N = 


dT (ds 
|dT/ds| 


l_dT 
k ds 


FORMULA N = 


dT/dt 

IdT/dtf 


( 7 ) 


EXAMPLE 4 Find the normal vector N for the same helix R = cos r i + sin r j + rk. 
Solution Copy v from Example 3, divide by |v|, and compute dT/dt: 

T = v/|v| = (— sin t i + cos t j + k)/ v /2 and dT/dt = (— cos t i — sin t j)/ v /2. 

To change dT/dt into a unit vector, cancel the yjl. The normal vector is N = 
— cos f i — sin t j. It is perpendicular to T. Since the k component is zero, N is hori- 
zontal. The tangent T slopes up at 45° — it goes around the circle at that latitude. 
The normal N is tangent to this circle (N is tangent to the path of the tangent!). 
So N stays horizontal as the helix climbs. 

There is also a third direction, perpendicular to T and N. It is the binormal vector 
B = TxN, computed in Problems 25-30. The unit vectors T, N, B provide the 
natural coordinate system for the path — along the curve, in the plane of the curve, 
and out of that plane. The theory is beautiful but the computations are not often 
done — we stop here. 
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TANGENTIAL AND NORMAL COMPONENTS OF ACCELERATION 

May I return a last time to the gas pedal and brake and steering wheel? The first 
two give acceleration along T. Turning gives acceleration along N. The rate of turning 
(curvature /t) and the direction N are established. We now ask about the force 
required* Newton’s Law is F = ma, so we need the acceleration a — especially its 
component along T and its component along N, 

d 2 s 

The acceleration is a = — r- T + k 
dt 2 

For a straight path, d 2 sjdi 2 is the only acceleration — the ordinary second derivative. 
The term K(dsjdt) 2 is the acceleration in turning. Both have the dimension of length/ 
(time) 2 . 

The force to steer around a corner depends on curvature and speed — as all drivers 
know. Acceleration is the derivative of v = |v|T = (ds/dt)T: 

d 2 s ww , ds dT d 2 s ds dT ds 

a = TT ^ ~T ~ ~T~2^ + —j r T* (9) 

dt 2 dt dt dt 1 dt ds dt 

That last term is jc(ds/Jt) 2 N, since dTjds = kN by formula (7). So (8) is proved. 


ds 

7t 


N. 


( 8 ) 


EXAMPLE 5 A fixed speed dsidi = 1 gives d 2 s/dt 2 = 0. The only acceleration is kN. 

EXAMPLE 6 Find the components of a for circular speed-up R(f) = cos i 2 i + sin t 2 j. 
Without stopping to think, compute dR/dt = v and dsidt = |y| and v/|v| = T: 

v = “ 2t sin t 2 i + It cos t 2 jv| = 2t, T = — sin e 2 i 4- cos t 2 j. 

The derivative of ds/dt = |v| is d 2 sjdt 2 = 2* The derivative of v is a: 

a = — 2 sin r 2 i + 2 cos t 1 j — 4f 2 cos t 2 i — 4f 2 sin t 2 j* 

In the first terms of a we see 2T. In the last terms we must be seeing k|v| 2 N* Certainly 
|v| 2 = 4 1 2 and K—\ r because the circle has radius 1. Thus a = 2T + 4t 2 N has the 
tangential component 2 and normal component 4t 2 — acceleration along the circle 
and in to the center. 


Table of Formulas 


v = dR/dt a = d\/dt 

|v| = ds/dt T = v/|v| = |fJR/fJs| 

Curvature k = \(TT/ds\ = jv x a|/|v| 3 


Plane curves k = 


_ \d 2 yfdx 2 \ 

(M 2 + (y') 2 ) 3/2 (1 +(dy/dx) 2 f 


x, i ^ 1 dT dTfdt 

Normal vector in — — — = — — — 
k ds |rfT/rfr| 


Acceleration a = (d 2 s/dt 2 ) T + k|y| 2 N 



Fig. 12*8 Components of a as car turns corner. 
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12.3 EXERCISES 


Read-through questions 

The curvature tells how fast the curve a For a circle of 
radius a, the direction changes by 2 n in a distance b , so 
k = c . For a plane curve y — f(x) the formula is k = \y"\f 
d The curvature of y - sin x is 8 At a point where 
y = 0 (an * point) the curve is momentarily straight and 
k — __9 . For a space curve k = |v x a|/ h . 

The normal vector N is perpendicular to j It is a 
1 vector along the derivative of T, so N — jj . For 
motion around a circle N points I Up a helix N also 
points m . Moving at unit speed on any curve, the time t 

is the same as the _n s. Then |v| = o and d 2 sjdt 2 = 

P and a is in the direction of q 

Acceleration equals »* T 4 * N. At unit speed 

around a unit circle, those components are t An 

astronaut who spins once a second in a radius of one meter 
has |a| = u meters/sec 2 , which is about * g. 

Compute the curvature k in Problems 1-8. 

1 

2 v = In x (where is k largest?) 

3 x = 2 cos l, y = 2 sin t 

4 x = cos f 2 , y — sin t 1 

5 x = l 4 t 2 , y = 3f 2 (the path is a ). 

6 x - cos 3 r, y — sin 3 / 

7 r = ^ = t (so x = t cos t. y = ) 

8 x = r, y = In cos f 

9 Find T and N in Problem 4. 

10 Show that N = $in t i 4 cos t j in Problem 6. 

11 Compute T and N in Problem 8. 

12 Find the speed | v] and curvature k of a projectile: 

.x = (i’o cos dt)f . )’ = (i ; o sin a)£ - ?gt J . 

13 Find T and |v| and k for the helix R = 3 cos t i 
4 3 sin t j 4 4r k. How much longer is a turn of the helix than 
the corresponding circle? What is the upward slope of T? 

14 When k = 0 the path is a . This happens when v 

and a are . Then v x a = . 

15 Find the curvature of a cycloid x = a(t — sin r), y = 
a( 1 — cos t). 

16 If all points of a curve are moved twice as far from the 
origin (x -* 2x, y -► 2y), what happens to x? What happens 
to N? 


17 Find k and N at 0 = n for the hypocycloid x^ 
4 cos 0 4- cos 40, y = 4 sin 0 — sin 40. 

18 From v = |v|T and a in equation (8), derive k = |v x a|/|v|\ 

19 From a point on the curve, go along the vector N/pc to 
find the center of curvature. Locate this center for the point 

( 1 . 0) on the circle x = cos f, y = sin t and the ellipse x = cos r, 
y = 2sin; and the parabola y = ?{x 2 — 1), The path of the 
center of curvature is the “ evolute ” of the curve. 

20 Which of these depend only on the shape of the curve, 
and which depend also on the speed? v, T, |v|, s, k> a, N, B. 

21 A plane curve through (0, 0) and (2, 0) with constant cur- 
vature k is the circular arc , For which k is there no 

such curve? 

22 Sketch a smooth curve going through (0,0), (1, —I), and 

(2.0) . Somewhere d 2 y/dx z is at least . Somewhere 

the curvature is at least , (Proof is for instructors 

only.) 

23 For plane vectors, the ordinary product rule applied to 

U X V j 4 U 2 ^2 shows that (U • V)' =s U' • V 4 . 

24 If v is perpendicular to a, prove that the speed is constant. 
True or false: The path is a circle. 

Problems 25-30 work with the T-N-B system — along the 
curve, in the plane of the curve, perpendicular to that plane. 

25 Compute B = T x N for the helix R = cos f i 4 sin i j 4 tk 
in Examples 3-4. 

26 Using Problem 23, differentiate B * T — 0 and B • B = 1 to 
show that B r is perpendicular to T and B. So dBjds = — rN 
for some number t called the torsion. 

27 Compute the torsion t = |^B/ds| for the helix in 
Problem 25. 

28 Find B = T x N for the curve x = l, y = t, z = t 2 > 

29 A circle lies in the xy plane. Its normal N lies 

and B = and t = |iiB/Jsj - . 

30 The Serret-Frenet formulas are dYjds = kN, dS/ds = 
— kT 4 tB, dB/ds™ — rN, We know the first and third. 
Differentiate N = — T x B to find the second, 

31 The angle 0 from the x axis to the tangent line is 0 — 
tan" : [dy!dx), when dyidx is the slope of the curve. 

(a) Compute ddjdx, 

(b) Divide by dsjdx = { 1 — (dyidx) 1 ) 1 * 2 to show that \d0/ds\ 
is k in equation (5). Curvature is change in direction \d0\ 
divided by change in position \ds\. 

32 If the tangent direction is at angle 6 then T = 

cos 0 i 4 sin 0 j . In Problem 31 \d0!ds\ agreed w ith k = |<iT/ds| 
because \dJ id0\ - . 
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Id 33-37 find the T and N components of acceleration. 

33 x — 5 cos cot, y — 5 sin cot, z — 0 (circle) 

34 x = l + t y y = 1 + 2t, z = 1 + 3f (line) 

35 x = t cos t, y = t sin t, z = 0 (spiral) 


36 x = e* cos t, y = e f sin t, z = 0 (spiral) 

37 x = 1 , y = r, z = J 2 , 

38 For the spiral in 36, show that the angle between R and 
a (position and acceleration) is constant. Find the angle. 

39 Find the curvature of a polar curve r = F(0). 



12.4 Polar Coordinates and Planetary Motion 



This section has a general purpose — to do vector calculus in polar coordinates . It 
also has a specific purpose — to study central forces and the motion of planets . The 
main gravitational force on a planet is from the sun. It is a central force, because it 
comes from the sun at the center. Polar coordinates are natural, so the two purposes 
go together. 

You may feel that the planets are too old for this course. But Kepler’s laws are 
more than theorems, they are something special in the history of mankind — “the 
greatest scientific discovery of all time. - ” If we can recapture that glory we should do 
it. Part of the greatness is in the difficulty — Kepler was working sixty years before 
Newton discovered calculus. From pages of observations, and some terrific guesses, 
a theory was bom. We will try to preserve the greatness without the difficulty, and 
show how elliptic orbits come from calculus. The first conclusion is quick. 

Motion in a central force field always stays in a plane. 

F is a multiple of the vector R from the origin (central force). F also equals ma 
(Newton’s Law). Therefore R and a are in the same direction and R x a = 0. Then 
R x v has zero derivative and is constant: 


d 

by the product rule: — (R xv) = vxv + Rxa=0 + 0. (1) 

R x v is a constant vector H. So R stays in the plane perpendicular to H. 

How does a planet move in that plane? We turn to polar coordinates. At each 
point except the origin (where the sun is), u, is the unit v ector po inting outward \ It is 
the position vector R divided by its length r {which is yj x 2 "+ y 2 ): 

u, — R/r = (xl + yj)/r = cos 9 i + sin 6 j. (2) 

That is a unit vector because cos 2 0 + sin 2 0 — 1. It goes out from the center. 
Figure 12.9 shows u, and the second unit vector u e at a 90° angle: 

u* = — sin 9 i + cos 9 j. (3) 

The dot product is u, • u e - 0. The subscripts r and 9 indicate direction (not derivative). 

Question 1; How do u r and u 0 change as r changes (out a ray)? They dont. 

Question 2: How do u r and u* change as 0 changes? Take the derivative: 

dujd6 — — sin 9 i + cos 9 j = u 9 

du e /d9 = — cos 9 i — sin 0 j = — u,. 


( 4 ) 
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Hg. 12*9 ii, is outward, u* is around the center. Components of v and a in those directions. 


Since u, = R/r s one formula is simple: The position vector is R = m r . For its derivative 
v = dRjdt , use the chain rule dx^fdt = (dujd9)(d8jdt) = (d9/dt) u$: 


d 

dt ^ 


The velocity is v = — (ru r ) = — u, + r — 


dr 

It' 


d8 

dt' 


(5) 


The outward speed is drjdt The circular speed is r d9jdt The sum of squares is |y| 2 . 

Return one more time to steady motion around a circle, say r = 3 and 6 = 2t The 
velocity is v = 6u s , all circular. The acceleration is -12u r , all inward. For circles u* 
is the tangent vector T. But the unit vector u r points outward and N points inward — 
the way the curve turns. 

Now we tackle acceleration for any motion in polar coordinates. There can be 
speedup in r and speedup in 9 (also change of direction). Differentiate v in (5) by the 
product rule: 

dv d 2 r dr du T dr d6 d 2 9 d9 du e 

dt dt 2 dt dt dt dt dt 2 dt dt 

For dujdt and du&jdt, multiply equation (4) by dOjdt. Then all terms contain u r or Up. 
The formula for a is famous but not popular (except it got us to the moon): 


a 


dv 

It 



( 6 ) 


In the steady motion with r = 3 and 9 = It, only one acceleration term is nonzero: 
a = — 12u,. Formula (6) can be memorized (maybe). Problem 14 gives a new way to 
reach it, using re ie . 


EXAMPLE 1 Find R and v and a for speedup 8 = t 2 around the circle r = 1. 

Solution The position vector is R = u r . Then v and a come from (5-6): 

v = (r dOjdt) u# = 2fii0 a = — (2r) 2 u, + 2^. 

This question and answer were also in Example 6 of the previous section. The acceler- 
ation was 2T + 4f 2 N, Notice again that T = u 0 and N = — u M going round the circle. 


EXAMPLE 2 Find R and v and \v\ and a for the spiral motion r = 3 9 — 2t 
Solution The position vector is R = 3r u,. Equation (5) gives velocity and speed: 
v — 3u r + 6tu e and |v| = + (6r) 2 . 
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The motion goes out and also around , From (6) the acceleration is — 12t u r + 12u^ 
The same answers would come more slowly from R = 3f cos 2r i + 3t sin 2 1 j. 

This example uses polar coordinates, but the motion is not circular. One of Kepler's 
inspirations, after many struggles, was to get away from circles. 


KEPLER'S LAWS 


You may know that before Newton and Leibniz and calculus and polar coordinates, 
Johannes Kepler discovered three laws of planetary motion. He was the court mathe- 
matician to the Holy Roman Emperor, who mostly wanted predictions of wars, 
Kepler also determined the date of every Easter— no small problem. His triumph 
was to discover patterns in the observations made by astronomers (especially by 
Tycho Brahe), Galileo and Copernicus expected circles, but Kepler found ellipses. 

Law 1: Each planet travels in an ellipse with one focus at the sun. 

Law 2; The vector from sun to planet sweeps out area at a steady rate: dAjdt — 
constant. 

Law 3: The length of the planet's year is T — ka i,2 i where a — maximum distance 
from the center (not the sun) and k = is the same for all planets. 

With calculus the proof of these laws is a thousand times quicker. But Law 2 is the 
only easy one. The sun exerts a central force. Equation (1) gave R x v = H = constant 
for central forces. Replace R by ru r and replace v by equation (5): 


H = ru r x 


dr dO \ 2 d0 

i, a ' +r *,•*•= r * (u ' x “'’ ) - 


(7) 


This vector H is constant, so its length h = r 2 d&idt is constant. In polar coordinates, 
the area is dA = ?r 2 d0. This area dA is swept out by the planet (Figure 12,10), and 
we have proved Law 2: 


dAjdt = \r 2 dOidt — jh = constant. ( 8 ) 

Near the sun r is small. So dQjdt is big and planets go around faster. 


y = 1/C 



Now for Law 1, about ellipses. We are aiming for 1/r - C — D cos 0, which is the 
polar coordinate equation of an ellipse . It is easier to write q than 1/r, and find an 
equation for q. The equation we will reach is d 2 qjd8 2 + q = C. The desired q — 
C — D cos 0 solves that equation (check this), and gives us Kepler's ellipse. 
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The first step is to connect drjdt to dqjdd by the chain rule: 

± = l(l\ = d.*i = zl*lM = _ dq 

dt dt\q) q 2 dt q 1 d$ dt d$‘ 

Notice especially dSjdt — hjr 2 — hq 2 . What we really want are second derivatives: 

d 2 r u d(dq\ d (dq\dO ^d 2 q 

k Te{Te)T,--' hq lfi- (10) 

After this trick of introducing q , we are ready for physics. The planet obeys Newton’s 
Law F = ma, and the central force F is the sun’s gravity: 


F . GM d 2 r fd9\ 2 

— = a is = -r* — r — ) . 

m r 2 dt 2 \dt J 


(ID 


That right side is the u, component of a in (6). Change r to 1 jq and change dSjdt to 
hq 2 . The preparation in (10) allows us to rewrite d 2 r/dt 2 in equation (11), That 
equation becomes 

; d 2 q 1 
dd 2 q 

Dividing by — h 2 q 2 gives what we hoped for— the simple equation for q: 

d z qldd 2 + q = GM/h 2 = C (a constant). (12) 


- GM q 2 = -h 2 q 2 ^ 2 --Jhq 2 ) 2 . 


The solution is q — C — D cos 9. Section 9,3 gave this polar equation for an ellipse or 
parabola or hyperbola. To be sure it is an ellipse, an astronomer computes C and D 
from the sun’s mass M and the constant 0 and the earth’s position and velocity. The 
main point is that C > D. Then q is never zero and r is never infinite. Hyperbolas and 
parabolas are ruled out, and the orbit in Figure 12.10 must be an ellipse.! 


Astronomy is really impressive. You should visit the Greenwich Observatory in 
London, to see how Halley watched his comet. He amazed the world by predicting 
the day it would return. Also the discovery of Neptune was pure mathematics — 
the path of Uranus was not accounted for by the sun and known planets. LeVerrier 
computed a point in the sky and asked a Berlin astronomer to look. Sure enough 
Neptune was there. 

Recently one more problem was solved— to explain the gap in the asteroids around 
Jupiter. The reason is “chaos” — the three-body problem goes unstable and an 
asteroid won’t stay in that orbit. We have come a long way from circles. 


Department of Royal Mistakes The last pound note issued by the Royal Mint 
showed Newton looking up from his great book Principia Mathematical He is not 
smiling and we can see why. The artist put the sun at the center! Newton has just 
proved it is at the focus. True, the focus is marked 5 and the planet is P. But those 
rays at the center brought untold headaches to the Mint — the note is out of circula- 
tion. T gave an antique dealer three pounds for it (in coins). 


Kepler’s third law gives the time T to go around the ellipse — the planet’s year. 
What is special in the formula is a 3/2 — and for Kepler himself, the 15th of May 1618 
was unforgettable: “the right ratio outfought the darkness of my mind, by the great 
proof afforded by my labor of seventeen years on Brahe’s observations,” The second 


+An amateur sees the planet come around again, and votes for an ellipse. 
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law dAjdt = jh is the key, plus two facts about an ellipse — its area nab and the height 
b 2 /a above the sun: 

rr dA i , _ 2 nab 

1, The area A = I — dt = - AT must equaI Kab ’ s0 T = ~fT 

2. The distance r = 1/C at B = ir/2 must equal b 2 /a, so b = J a/C . 


The height b 2 ja is in Figure 12.10 and Problems 25-26. The constant C = GMjh 2 is 
in equation (12). Put them together to find the period: 


27tab _ 27ia a _ In 3/2 
~h~ ~ h VC" y/GM 0 


(13) 


To think of Kepler guessing a 112 is amazing. To think of Newton proving Kepler’s 
laws by calculus is also wonderful — because we can do it too. 


EXAMPLE 3 When a satellite goes around in a circle, find the time T. 

Let r be the radius and to be the angular velocity. The time for a complete circle 
(angle In) is T = 2n/a). The acceleration is GM/r 2 from gravity, and it is also rcu 2 for 
circular motion. Therefore Kepler is proved right: 

rw 2 = GMjr 2 => (t) = jGMjr* => T = 2nja) - 2 nr y 2 j^J GM. 


12.4 EXERCISES 


Read-through questions 

A central force points toward ° Then R x «/ 2 R/dt 2 = 0 
because b . Therefore R x dK/dt is a c _ (called H). 

In polar coordinates, the outward unit vector is o r = 

cos 0 i 4- <t . Rotated by 90° this becomes u„ = ? . The 

position vector R is the distance r times < The velocity 
v = dR/dt is a u, + h u 0 . For steady motion around 
the circle r — 5 with 0 = 4(, v is l and |v| is I and a 
is l< . 


For motion under a circular force, r 2 times l is con- 
stant. Dividing by 2 gives Kepler’s second law dAjdi = m . 

The first law says that the orbit is an n with the sun at 
o . The polar equation for a conic section is P = 
C - D cos 9. Using F = ma we found + a = C. So the 

path is a conic section; it must be an ellipse because ; . 

The properties of an ellipse lead to the period T — » , 

which is Kepler’s third law. 
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1 Find the unit vectors a,, and u# at the point (0, 2| The u r 

and iig components of v = i + j at that point are . 

2 Find u r and iig at (3, 3)* If v ^ i + j then v— u r » 

Equation (5) gives drjdt — and dBjdt = . 

3 At the point (1, 2), velocities in the direction will 

give drjdt — 0. Velocities in the direction will give 

d0jdt = 0, 

4 Traveling on the cardioid r= 1— cost? with d$jdt = 2 t 
what is v? How long to go around the cardioid (no integration 
involved)? 

5 If r = e* and 0 = 3 t, find v and a when f = l* 

6 If r = 1 and $ = sin f, describe the path and find v and a 
from equations (5-6)* Where is the velocity zero? 

7 (important) R = 4 cos 5f i + 4 sin 5t j = 4u r travels on a 
circle of radius 4 with 9 = St and speed 20* Find the compo- 
nents of v and a in three systems: i and j, T and N, u r and iv 

8 When is the circle r = 4 completed, if the speed is 8f? Find 
v and a at the return to the starting point (4, 0)* 

9 The iig component of acceleration is = 0 for a 

central force, which is in the direction of . Then 

r 2 d9jdt is constant (new proof) because its derivative is r times 


10 If r 2 d9jdt = 2 for travel up the line x = 1, draw a triangle 
to show that r = sec $ and integrate to find the time to reach 
( 1 , 1 ). 

11 A satellite is r = 10,000 km from the center of the Earth, 
traveling perpendicular to the radius vector at 4 km/sec. Find 
dOldt and ft * 


12 From |u r | = 1, it follows that dujdr and di^jdB are 

to u r (Section 12.3)* In fact dujdr is and 

dujdd is . 

13 Momentum is mv and its derivative is ma = force. Angular 

momentum is mH = mR x v and its derivative is = 

torque. Angular momentum is constant under a central force 
because the is zero. 


14 To find (and remember) v and a in polar coordinates, start 
with the complex number re* and take its derivatives: 


d 2 R 
dt 2 


n ig dR dr iA t dB 

R = re* — = — e & + ir—e * 

dt dt dt 


+ + + + 


Key idea: The coefficients of e* and ie* are the u, and u^ 
components of R, v, a: 


„ dr dO 

R = Hi r + 0i4 v = -u r + r—u e 


(a) Fill in the five terms from the derivative of dRjdt 

(b) Convert e* to u r and ie* to to find a 


(c) Compare R, v, a with formulas (5-6) 

(d) (for instructors only) Why does this method work? 

Note how e* = cos $ + i sin 6 corresponds to u r = cos 9 i 
+ sin0j. This is one place where elec trical engineers are 
allowed to write j instead of i for ^/—i. 

15 If the period is T find from (13) a formula for the distance 
a. 

16 To stay above New York what should be the period of a 
satellite? What should be its distance a from the center of the 
Earth? 

17 From T and a find a formula for the mass M* 

18 If the moon has a period of 28 days at an average distance 

of a = 380,000 km, estimate the mass of the . 

19 The Earth takes 365? days to go around the sun at a 
distance 93 million miles # 150 million kilometers. Find 
the mass of the sun, 

20 True or false: 

(a) The paths of all comets are ellipses. 

(b) A planet in a circular orbit has constant speed* 

(c) Orbits in central force fields are conic sections. 

21 2* 10 7 in what units, based on the Earth's mass 
M = 6 * 10 2+ kg and the constant G = 6*67 * 10“ 11 Nm 2 /kg 2 ? 
A force of one kg • meter/sec 2 is a Newton N* 

22 If a satellite circles the Earth at 9000 km from the center, 
estimate its period T in seconds* 

23 The Viking 2 orbiter around Mars had a period of about 
10,000 seconds. If the mass of Mars is M = 6.4 • 10 23 kg, what 
was the value of a? 

24 Convert 1/r =* C- Dcos 0, or l=Cr-Dx, into the xy 
equation of an ellipse. 

25 The distances a and c on the ellipse give the constants 
in r = \ j{C — D cos 0)* Substitute 0 = 0 and 0 = n as in 
Figure 12*10 to find D = cj{a 2 -c 2 ) and C = a/{a 2 — c 2 ) = 
ajb 2 . 

26 Show that x = — c, y-b 2 ja lies on the ellipse 
x 2 \a 2 + y 2 jb 2 = 1* Thus y is the height 1/C above the sun in 
Figure 12.10. The distance from the sun to the center has c z = 
a 2 - b 2 . 

27 The point x = a cos 2 ntjT t y = b sin IntjT travels around 
an ellipse centered at (0, 0) and returns at time T. By symmetry 
it sweeps out area at the same rate at both ends of the major 
axis. Why does this break Kepler’s second law? 

28 If a central force is F = — mo(r)u r , explain why 
d 2 rjdi 2 - r{dBjdt) 2 = — ajr). What is air) for gravity? 
Equation (12) for q = 1 jr leads to + q = r 2 o(r). 

29 When F = 0 the body should travel in a straight * 
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The equation q m + q = 0 allows q = cos 6 , in which case the 

path 1 jr = cos 0 is . Extra credit: Mark off equal 

distances on a line, connect them to the sun, and explain why 
the triangles have equal area. So dAjdt is still constant. 

30 The strong nuclear force increases with distance, a{r) = r 
It binds quarks so tightly that up to now no top quarks have 
been seen (reliably). Problem 28 gives q 66 + q — 1,<j\ 

(a) Multiply by q 6 and integrate to find {ql - \q 2 = 
+ C, 

*(b) Integrate again (with tables) after setting u = q 2 , u$ = 
2qq 0 . 

31 The path of a quark in 30(b) can be written as 

r 2 {A T B cos 20) — L Show that this is the same as the ellipse 
{A T JJ)x 2 + (A — = 1 with the origin at the center. The 

nucleus is not at a focus, and the pound note is correct for 
Newton watching quarks, (Quantum mechanics not 
accounted for,) 

32 When will Halley’s comet appear again? It disappeared in 


1986 and its mean distance to the sun (average of a + c and 
a — c) is a = ! .6 • 10^ kilometers. 

33 You are walking at 2 feet/second toward the center of a 
merry-go-round that turns once every ten seconds. Starting 
from r = 20, 0 = 0 fiud r(f), 0(f), v(f), a(f) and the length of your 
path to the center. 

34 From Kepler’s laws r = 1/(C — D cos 0) and r 2 d0!dt = h , 
show that 

1. dridt = -Dh sin 0 2. dbidt 2 = 0 - C 

3. d 2 rjdt 2 — r[d0;dt) 2 = —Ch 2 jr 2 . 

When Newton reached 3, he knew that Kepler's laws required 
a central force of Ch 2 /r 2 . This is his inverse square law r Then 
he went backwards, in our equations (8-12), to show that this 
force yields Kepler’s laws. 

35 How long is our year? The Earth’s orbit has a = 
149,37* 10* kilometers. 
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CHAPTER 13 


Partial Derivatives 


This chapter is at the center of multidimensional calculus. Other chapters and other 
topics may be optional; this chapter and these topics are required. We are back to 
the basic idea of calculus — the derivative. There is a function /, the variables move a 
little bit, and / moves. The question is how much / moves and how fast. Chapters 
1-4 answered this question for/{x), a function of one variable. Now we have/(x, y) 
or f[x, y, z) — with two or three or more variables that move independently. As x and 
y change, / changes. The fundamental problem of differential calculus is to connect 
Ax and Ay to A /. 

Calculus solves that problem in the limit. It connects dx and dy to df. In using this 
language I am building on the work already done. You know that dffdx is the limit 
of AfjAx. Calculus computes the rate of change — which is the slope of the tangent 
line. The goal is to extend those ideas to 

fix, y)=x 2 - y 2 or f[x, y) = Jx 2 + y 2 or f[x, y, z) = 2x + 3y + 4z. 

These functions have graphs, they have derivatives, and they must have tangents. 

The heart of this chapter is summarized in six lines. The subject is differential 
calculus — small changes in a short time. Still to come is integral calculus — adding 
up those small changes. We give the words and symbols for f[x, y), matched with the 
words and symbols for J[x). Please use this summary as a guide, to know where 
calculus is going. 


Curve y =f[x) vs. Surface z =f[x, y) 

-f becomes two partial derivatives and — 
dx dx dy 


€1 

dx 2 


—4 becomes four second derivatives 


37 _37 _37 37 

dx 2 ' dxdy’ dydx’ dy 2 

df df df 

Afx Ax becomes the linear approximation Af x — Ax + — Ay 


dx 


dy 


dj" 

tangent line becomes the tangent plane z ~ z 0 ** ^(x - x 0 ) + ^(y - y 0 ) 

dy dy dx , .... , dz dz dx , dzdy 

~r = -r-r becomes the chain rule — = — 

dt dx dt 

df 


dt dx dt dy dt 


dx 


= 0 becomes two maximum-minimum equations — - = 0 and — = 0. 

dx dy 
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13 Partial Derivatives 


13.1 Surfaces and Level Curves 

The graph of y = f(x) is a curve in the xy plane. There are two variables — x is 
independent and free, y is dependent on x. Above x on the base line is the point (x, y) 
on the curve. The curve can be displayed on a two-dimensional printed page. 

The graph of z = /(x, y) is a surface in xyz space. There are three variables — x and 
y are independent, z is dependent. Above (x, y) in the base plane is the point (x, y, z) 
on the surface (Figure 13.1). Since the printed page remains two-dimensional, we 
shade or color or project the surface. The eyes are extremely good at converting two- 
dimensional images into three-dimensional understanding — they get a lot of practice. 
The mathematical part of our brain also has something new to work on — two partial 
derivatives. 

This section uses examples and figures to illustrate surfaces and their level curves. 
The next section is also short. Then the work begins. 

EXAMPLE 1 Describe the surface and the level curves for z =J\x y y) = ^/x 2 + y 2 . 

The surface is a cone. Reason: yjx 2 + y 2 is the distance in the base plane from (0, 0) 
to (x, y). When we go out a distance 5 in the base plane, we go up the same distance 
5 to the surface. The cone climbs with slope 1. The distance out to (x, y) equals the 
distance up to z (this is a 45° cone). 

The level curves are circles. At height 5, the cone contains a circle of p oints — all 
at the same “level” on the surface. The plane z = 5 meets the surface z = y/x 2 + y 2 at 
those points (Figure 13.1b). The circle below them (in the base plane) is the level 
curve. 

DEFINITION A level curve or contour line of z = /(x, y) contains all points (x, y) that 
share the same value /(x, y) = c. Above those points, the surface is at the height z = c. 

There are different level curves for different c. To see the curve for c = 2, cut 
through the surface with the horizontal plane z = 2. The plane meets the surface 
above the points where /(x, y) = 2. The level curve in the base plane has the equation 
J\x y y) = 2. Above it are all the points at “level 2” or “level c” on the surface. 

Every curve /(x, y) = c is labeled by its constant c. This produces a contour map 
(t he base plane is full of curves). For the cone, the level curves are given by 
yjx 2 + y 2 = c, and the contour map consists of circles of radius c. 

Question What are the level curves of z = /(x, y) = x 2 + y 2 ? 

Answer Still circles. But the surface is not a cone (it bends up like a parabola). The 
circle of radius 3 is the level curve x 2 4- y 2 = 9. On the surface above, the height is 9. 




Fig. 13.1 The surface for z =J\x y y) = y/x 2 -I- y 2 is a cone. The level curves are circles. 


13.1 Surfaces and Level Curves 
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EXAMPLE 2 For the linear function /(x, y) = 2x 4- y, the surface is a plane. Its level 
curves are straight lines. The surface z = 2x + y meets the plane z = c in the line 
2x + y = c. That line is above the base plane when c is positive, and below when c is 
negative. The contour lines are in the base plane. Figure 13.2b labels these parallel 
lines according to their height in the surface. 

Question If the level curves are all straight lines, must they be parallel? 

Answer No. The surface z = y/x has level curves y/x = c. Those lines y = cx swing 
around the origin, as the surface climbs like a spiral playground slide. 



x 




Fig. 13.2 A plane has parallel level lines. The spiral slide z = y/x has lines yjx — c. 

EXAMPLE 3 The weather map shows contour lines of the temperature function. Each 
level curve connects points at a constant temperature. One line runs from Seattle to 
Omaha to Cincinnati to Washington. In winter it is painful even to think about the 
line through L.A. and Texas and Florida. USA Today separates the contours by 
color, which is better. We had never seen a map of universities. 



Fig. 13.3 The temperature at many U.S. and Canadian universities. Mt. Monadnock in New Hampshire is said to be the most 
climbed mountain (except Fuji?) at 125,000/year. Contour lines every 6 meters. 
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Question From a contour map, how do you find the highest point? 

Answer The level curves form loops around the maximum point. As c increases the 
loops become tighter. Similarly the curves squeeze to the lowest point as c decreases. 

EXAMPLE 4 A contour map of a mountain may be the best example of all. Normally 
the level curves are separated hy 100 feet in height. On a steep trail those curves are 
bunched together — the trail climbs quickly. In a flat region the contour lines are far 
apart. Water runs perpendicular to the level curves. On my map of New Hampshire 
that is true of creeks but looks doubtful for rivers. 

Question Which direction in the base plane is uphill on the surface? 

Answer The steepest direction is perpendicular to the level curves. This is important. 
Proof to come. 

EXAMPLE 5 In economics x 2 y is a utility function and x z y = c is an indifference curve. 

The utility function x 1 y gives the value of x hours awake and y hours asleep. Two 
hours awake and fifteen minutes asleep have the value / = (2 2 }(|). This is the same as 
one hour of each: /=(1 2 )(1). Those lie on the same level curve in Figure 13.4a. We 
are indifferent, and willing to exchange any two points on a level curve. 

The indifference curve is "convex.'' We prefer the average of any two points. The 
line between two points is up on higher level curves. 

Figure 1 3.4b shows an extreme case. The level curves are straight lines 4 x + y = c. 
Four quarters are freely substituted for one dollar. The value is /= 4 x + y dollars. 

Figure 13.4c shows the other extreme. Extra left shoes or extra right shoes are 
useless. The value (or utility) is the smaller of x and y. That counts pairs of shoes. 


asleep 


y quarters 


right shoes 





Hg. 13.4 Utility functions x 2 y * 4x + y, min(x, y). Convex, straight substitution, complements. 


13.1 EXERCISES 


Read-through questions 

The graph of z =f[x, y) is a Q in b -dimensional 
space. The c curve J\x, y) = 7 lies down in the base plane. 
Above this level curve are all points at height d in the 
surface. The » z = 7 cuts through the surface at those 
points. The level curves ffx v) = < are drawn in the xy 

plane and labeled by fl . The family of labeled curves is 
a h map. 


For z =J\x, y) — x z — y 2 , the equation for a level curve is 
_!_. This curve is a J . For z = x — y the curves are 
* . Level curves never cross because _!_. They crowd 

together when the surface is . The curves tighten to a 
point when n , The steepest direction on a mountain is 
o to the p . 
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1 Draw the surface z =f{x, y) for these four functions: 

/ 2 = 2-V?T7 

A = 2~fa 2 +y 2 ) / 4 = 1 + e _ * l_yl 

2 The level curves of all four functions are . They 

enclose the maximum at . Draw the four curves 

f[x, y)= 1 and rani them by increasing radius. 

3 Set y = 0 and compute the x derivative of each function 
at x — 2. Which mountain is flattest and which is steepest at 
that point? 

4 Set y ~ 1 and compute the x derivative of each function 
at x = 1. 

For / 5 to /i o draw the level curves /— 0, I, 2. Abo /= — 4, 

5 / s = x - y 6 f 6 = (x + y) 2 

7 f-,~xe~ y 8/„ = sin(x-y) 

9/,-y-x 2 10/ IO = y/x 2 

11 Suppose the level curves are parallel straight lines. Does 
the surface have to be a plane? 

12 Construct a function whose level curve /=0 is in two 
separate pieces. 

13 Construct a function for which /= 0 is a circle and /= 1 
is not. 

14 Find a function for which /= 0 has infinitely many pieces. 

15 Draw the contour map for f—xy with level curves f— 
— 2, —1, 0, 1, 2. Describe the surface. 

16 Find a function f[x, y) whose level curve /= 0 consists of 
a circle and all points inside it. 

Draw two level curves in 17-20 . Are they ellipses parabolas 
or hyperbolas? Write J — 2x = c as — c + 2x 

before squaring both sides. 

17 / =v /4x 2 + y 2 18 /= v /4x 2 + y 2 — 2x 

19 / =v /5x 2 + y 2 - 2x 20/= v /3PTy 2 — 2x 

21 The level curves of f—(y — 2)j{x — 1) are __ 

through the point (1, 2) except that this point is not . 


22 Sketch a map of the US with lines of constant temperature 
(isotherms) based on today’s paper. 

23 (a) The contour lines of z = x 2 + y 2 — 2x — 2y are circles 

around the point , where z is a minimum. 

(b)The contour lines of /= are the circles 

x 2 + y 2 = c + 1 on which /= c. 

24 Draw a contour map of any state or country (lines of 
constant height above sea level). Florida may be too flat. 

25 The graph of w =? F(x, y t z) is a -dimensional sur- 
face in xyzw space. Its level sets F(x, y, z) = c are - 

dimensional surfaces in xyz space. For w = x — 2y + z those 

level sets are For w — x 1 + y 2 + z 2 those level sets 

are , 

26 The surface x 2 + y 2 — z 2 = — 1 is in Figure 13.8. There is 

empty space when z 2 is smaller than I because . 

27 The level sets of F = x 2 + y 2 + qz 2 look like footballs 

when q is , like basketballs when q is . 

and like frisbees when q is . 

28 Let T(x> y) be the driving lime from your home at (0, 0) 
to nearby towns at (x, y). Draw the level curves. 

29 (a) The level curves of f[x t y) *= sm(x — y) are , 

(b) The level curves of g(x 7 y) — sin(x 2 — y 2 ) are . 

(c) The level curves of h(x, y) - sin(x — y 2 ) are . 

30 Prove that if xiyi = 1 and x 2 y 2 = 1 then their average 
x =Hxi +* 2 ), y = i(yi +> 2 ) has xy Z L The function f=xy 
has convex level curves (hyperbolas). 

31 The hours in a day are limited by x + y = 24. Write x 2 y 
as x 2 (24 — x) and maximize to find the optimal number of 
hours to stay awake. 

32 Near x = 16 draw the level curve x 2 y - 2048 and the line 
x + y = 24. Show that the curve is convex and the line is 
tangent. 

33 The surface z = 4x + y is a . The surface z = 

min(x, y) is formed from two . We are willing to 

exchange 6 left and 2 right shoes for 2 left and 4 right shoes 
but better is the average . 

34 Draw a contour map of the top of your shoe. 



13.2 Partial Derivatives 



The central idea of differential calculus is the derivative. A change in x produces a 
change in /. The ratio A//Ax approaches the derivative, or slope, or rate of change. 
What to do if /depends on both x and y? 

The new idea is to vary x and y one at a tiirie. First, only x moves. If the function 
is x + xy, then A/ is Ax + yAx. The ratio A//Ax is 1 + y. The “x derivative” of x + xy 
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is 1 I- y. For all functions the method is the same: Keep y constant, change x, take the 
limit of A/7 Ax: 


DEFINITION 


y) = lim ^ = 

dx ax-dAx 


lim f( x + Ax 'y)-tt x ' 

ix^o Ax 


a) 


On the left is a new symbol dfjdx . It signals that only x is allowed to vary — Bffdx is 
a partial derivative. The different form d of the same letter (still say “d”) is a reminder 
that x is not the only variable. Another variable y is present but not moving. 


EXAMPLE 1 ftx>y) = x 2 y 2 + xy + y 


^(x,y)=2xy 2 + y + 0. 


Do not treat y as zerol Treat it as a constant, like 6. Its x derivative is zero. 
If /(x) = sin 6x then dfjdx = 6 cos 6x. I/ftx, y) = sin xy then dfjdx — y cos xy. 

Spoken aloud, dfjdx is still tl df d x.” It is a function of x and y. When more is 
needed, call it “the partial of/with respect to x.” The symbol/' is no longer available, 
since it gives no special indication about x. Its replacement f x is pronounced ‘/x” or 
“/sub x,” which is shorter than dfjdx and means the same thing. 

We may also want to indicate the point (x 0 , y 0 ) where the derivative is computed: 


df 

^(*o, yo) or fjx 0 ,y 0 ) OT 


df 


dx 


l{*o* yo) 


or just 



EXAMPLE 2 /(x, y) = sin 2x cos y f x — 2 cos 2x cos y (cos y is constant for djdx) 


The particular point (x 0 ,y 0 ) is (0,0). The height of the surface is /(0, 0)=0, 
The slope in the x direction is f x = 2. At a different point x 0 = te, y 0 = n we find 
fjit, n)= -2. 


Now keep x constant and vary y. The ratio A// Ay approaches Sfjdy : 


A/ 

/(x, y) = lim — = hm 
Ay -»0 Ay by-' 


f{x, y + Ay)-f\x, y) 
Ay 


( 2 ) 


This is the slope in the y direction. Please realize that a surface can go up in the x 
direction and down in the y direction. The plane/Tx, y) - 3x - 4 y has f x = 3 (up) and 
f y = - 4 (down). We will soon ask what happens in the 45° direction. 


EXAMPLE 3 = + f 


df _ x _ x 
Jx 2 + y 2 ~ f 


of _ y _ y 

dy v* 2 +7 


The jc derivative of ^/x 2 + y 1 is really one-variable calculus, because y is constant. 
The exponent drops from \ to — and there is 2x from the chain rule. This distance 
function has the curious derivative dfjdx — xjf. 

The graph is a cone. Above the point (0,2) the height is s /0 2 + 2 2 = 2. The 
partial derivatives are f x = 0/2 and f y = 2/2. At that point, Figure 13.5 climbs in the 
y direction. It is level in the x direction. An actual step Ax will increase 0 2 + 2 2 to 
(Ax) 2 + 2 2 . But this change is of order (Ax) 2 and the x derivative is zero. 


Figure 13.5 is rather important. It shows how dfjdx and Sfjdy are the ordinary 
derivatives of fix, y 0 ) and J\x 0 , y). It is natural to call these partial functions. The first 
has y fixed at y 0 while x varies. The second has x fixed at x 0 while y varies. Their 
graphs are cross sections down the surface — cut out by the vertical planes y = y 0 and 
x = x 0 . Remember that the level curve is cut out by the horizontal plane z = c. 
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The limits of Af/Ax and AfjAy are computed as always. With partial functions 
we are back to a single variable. The partial derivative is the ordinary derivative of a 
partial function (constant y or constant jc). For the cone, dfjdy exists at all points 
except (0, 0). The figure shows how the cross section down the middle of the cone 
produces the absolute value function: /(0, y) = jyj. it has one-sided derivatives but not 
a two-sided derivative. 

Similarly df/dx will not exist at the sharp point of the cone. We develop the idea 
of a continuous function f(x, y) as needed (the definition is in the exercises). Each 
partial derivative involves one direction, but limits and continuity involve all direc- 
tions. The distance function is continuous at (0, 0), where it is not differentiable. 

EXAM PLE 4 f(x, y) = y 2 -x 2 df/dx - - 2x dfjdy = 2 y 

Move in the x direction from (1,3). Then y 2 - x 2 has the partial function 9 - x 2 . 
With y fixed at 3, a parabola opens downward, In the y direction (along x = 1) the 
partial function y 2 - 1 opens upward. The surface in Figure 13.6 is called a hyperbolic 
paraboloid, because the level curves y 2 - x 2 = c are hyperbolas. Most people call it a 
saddle, and the special point at the origin is a saddle point. 

The origin is special for y 2 — x 2 because both derivatives are zero. The bottom of 
the y parabola at (0, 0) is the top of the x parabola. The surface is momentarily flat in 
all directions, it is the top of a hill and the bottom of a mountain range at the same 
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time. A saddle point is neither a maximum nor a minimum, although both derivatives 
are zero. 

Note Do not think that f\x 9 y) must contain y 2 and x z to have a saddle point. The 
function 2 xy does just as well. The level curves 2 xy = c are still hyperbolas. The 
partial functions 2 xy 0 and 2 x 0 y now give straight lines— which is remarkable. Along 
the 45° line x = y, the function is 2x 2 and climbing. Along the -45° line x = - y, 
the function is — 2x 2 and falling. The graph of 2 xy is Figure 13.6 rotated hy 45°. 

EXAMPLES 5“6 /(x, y, z) = x 2 + y 2 + z 2 P(T 9 V) ~ nRT/V 

Example 5 shows more variables. Example 6 shows that the variables may not be 
named x and y. Also, the function may not be named /! Pressure and temperature 
and volume are P and T and V. The letters change but nothing else: 

dP/dT- nR/V dP/dV= - nRT/V 2 (note the derivative of 1/K). 

There is no dPjdR because R is a constant from chemistry — not a variable. 

Physics produces six variables for a moving body — the coordinates x, y, z and the 
momenta p x9 p y9 p z . Economics and the social sciences do better than that. If there 
are 26 products there are 26 variables — sometimes 52, to show prices as well as 
amounts. The profit can be a complicated function of these variables. The partial 
derivatives are the marginal profits , as one of the 52 variables is changed. A spreadsheet 
shows the 52 values and the effect of a change. An infinitesimal spreadsheet shows 
the derivative. 


SECOND DERIVATIVE 

Genius is not essential, to move to second derivatives. The only difficulty is that two 
first derivatives f x and f y lead to four second derivatives and f xy and f yx and f yy . 
(Two subscripts: f xx is the x derivative of the x derivative. Other notations are 
d 2 ffdx 2 and d 2 f/dxdy and 3 2 f/dydx and d 2 fjdy 2 ) Fortunately f xy equals f yx9 as we 
see first by example. 

EXAMPLE 7 /= x/y has f x = 1/y, which has /„ = 0 and f xy = - 1/y 2 . 

The function x/y is linear in x (which explains f xx = 0). Its y derivative is/, = - x/y 2 . 
This has the x derivative / yx = “1/y 2 - The mixed derivatives f xy and f yx are equal . 

In the pure y direction, the second derivative is f yy = 2 x/y 3 . One-variable calculus 
is sufficient for all these derivatives, because only one variable is moving. 

EXAMPLE 8 /= 4x 2 + 3xy + y 2 has f x — 8x + 3y and f y — 3x + 2y. 

Both “cross derivatives" f xy and f yx equal 3. The second derivative in the x direction 
is d 2 fjdx 2 — 8 or / xx = 8. Thus “fxx" is “d second fdx squared.” Similarly 
d 2 f/dy 2 — 2. The only change is from d to 5. 

If f( x > y ) h* 1 * continuous second derivatives then f xy =/ yx . Problem 43 sketches a proof 
based on the Mean Value Theorem. For third derivatives almost any example shows 
that f xxy — f xyx — f yxx is different from/,,* ~fyxy ~fxyy* 

Question How do you plot a space curve x(t) 9 y(t), z(t) in a plane ? One way is to look 
parallel to the direction (1, 1, 1). On your XY screen, plot A"=(y-x)/ v /2 and 
Y = (2z — x - yVv^- The line x = y = z goes to the point (0, 0)! 
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How do you graph a surface z =/(*, y)? Use the same X and Y. Fix x and let y 
vary, for curves one way in the surface. Then fix y and vary x, for the other partial 
function. For a parametric surface like x — (2 + v sin £u) cos u, y = (2 + v sin \ u) sin u, 
2 = v cos in, vary u and then v. Dick Williamson showed how this draws a one-sided 
“Mobius strip.” 
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43 The proof of f xy =f yx studies /(x, y) in a small rectangle. 
The top-bottom difference is g(x) = /(x, B) — f(x t A). The 
difference at the corners 1, 2, 3, 4 is: 


distance from (x Bi yj to (a, b) is ____ and it approaches 

* 4 For any e > 0 there is an N such that the distance 

< e for all n > . 


= g(b) — g(fl) (definition of g) 

= (b - (Mean Value Theorem) 

= (b - a)\_Uc, fl) -f x { c, A)] (compute g r ) 

= (b~a)(B-A)fJc,C) (MVT again) 

(a) The right-left difference is h(y) =f(b t y) — /(a, y). The 
same Q is — M.A), Change the steps to reach Q = 
[B-A){b-a)f^c*, C*). 

(b) The two forms of Q make f xy at (c, C) equal to f yx at 
(c* t C*}, Shrink the rectangle toward ( a , A). What assump- 
tion yields f xy =f yx at that typical point? 



B 
C 
C * 

A 


44 Find df/dx and dfjdy where they exist, based on equations 
(1) and (2). 

(a) /= [.xyl (b)/= x 2 + y 2 if x * 0, /= 0 if x = 0 


46 Find (x 2 ,y 2 ) and (* 4 * and "the limit (a, b) if it exists. 
Start from (x 0l y 0 ) = (1, 0). 

(a) (x nf y n ) = (lf[n + \lni(n+\)) 

(b) (x IT3 y h ) = (x n _ 11 y B _ : ) 

(^) ^n) = {.Vfj - 1 ■> - 1 ) 

(d) (x n , y„) = (x fl - 1 -f- y n — i , x n _ i y B -i) 

47 {Limit of f{x, y)) 1 Informal definition: the numbers 
f(x„, y„) approach L when the points (x„, y n ) approach (a, b). 
2 Epsilon-delta definition: For each e > 0 there is a 5 >0 such 

that |/(xj y) - L| is less than when the distance from 

(x, y) to [a, b) is . The value of / at (a,b) is not 

involved. 

48 Write down the limit L as (x, y) -* (a t b)* At which points 
(a, b) does /(x, y) have no limit? 

(*)f{x,y) = J?T? (b)_/tx, y) — xjy 

(c) /(x, y) = l/(x + y) (d}^x, y) = xy/(x 2 + y 2 ) 

In (d) find the limit at (0, 0) along the fine y = mx* The limit 
changes with m , so L does not exist at (0, 0). Same for x/y. 

49 Definition of continuity : f(x, y) is continuous at (a t b) if 
f(a, b) is defined and f(x f y) approaches the limit _____ as 
(x, y) approaches {a, b ). Construct a function that is not con- 
tinuous at (1, 2). 

50 Show that x 2 y/(x* + y 2 ) -* 0 along every straight line 

y = mx to the origin. But traveling down the parabola y = x 2 , 
the ratio equals . 


Questions 45-52 are about limits in two dimensions. 

45 Complete these four correct definitions of limit: 1 The 
points {x ff , yj approach the point (a, b) if x* converges to a 

and . 2 For any circle around { a , b), the points (x ni y n ) 

eventually go _ the circle and stay . 3 The 


51 Can you define /(Q, 0} so that /(x, y) is continuous at (0, 0)? 

00/= W + 1* - f i (b)/= 0 + xY (c)/= X 1 + ', 

52 Which functions approach zero as (x, y) -* (0, 0) and why? 


(a) 


xy* 


x 2 + y 2 


(b) 


x 2 y 2 
x 4 + / 


(c) 


x m y” 
x m + /' 



13.3 Tangent Planes and Linear Approximations 



Over a short range, a smooth curve y=f(x) is almost straight. The curve changes 
direction, but the tangent line y — y 0 =f\x 0 )(x - x 0 ) keeps the same slope forever* 
The tangent line immediately gives the linear approximation to y =f(x): 

y*y<> + /'(*o)(x -*())■ 

What happens with two variables'’ The function is z = f(x, y), and its graph is a 
surface * We are at a point on that surface, and we are near-sighted* We don’t see far 
away. The surface may curve out of sight at the horizon, or it may be a bowl or a 
saddle. To our myopic vision, the surface looks flat. We believe we are on a plane 
(not necessarily horizontal), and we want the equation of this tangent plane. 
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Notation The basepoint has coordinates x 0 and y 0 . The height on the surface is 
Other letters are possible: the point can be (a, h) with height vv. The 
subscript 0 indicates the value of x or y or z or df/dx or df/dy at the point , 

With one variable the tangent line has slope df/dx , With two variables there 
are two derivatives df/dx and df/dy, At the particular point, they are (5f/8x) 0 and 
(dfjdy) 0 . Those are the slopes of the tangent plane. Its equation is the key to this 
chapter: 


13A The tangent plane at (x 0 ,y Ql z 0 ) has the same slopes as the surface z — 
/(x, y). The equation of the tangent plane {a linear equation) is 

2_2 »"(l )„ (jc " x ” )+ (D„ (, ' _ ^ (i > 

The normal vector N to that plane has components [df/dx) Q , (df/dy ) 0 , “1, 


EXAMPLE 1 Find the tangent plane to z = 14 - x 2 - y 2 at (x 0> y 0> z 0 ) = (1, 2, 9). 

Solution The derivatives are df/dx = —2x and df/dy — -2 y, When x= 1 and y = 2 
those are (df/dx) 0 = — 2 and (df/dy) 0 ~ -4. The equation of the tangent plane is 

z - 9 = — 2(x ~1) — 4{y — 2) or z + 2x + 4y=19. (2) 

This z(x, y) has derivatives -2 and “4, just like the surface. So the plane is tangent. 

The normal vector N has components - 2, - 4, - 1. The equation of the normal 
line is (x, y, z) — {1, 2, 9) 4- 1{- 2. —4, ^1), Starting from (!, 2, 9) the line goes out along 
N — perpendicular to the plane and the surface. 




Frg* 13,7 The tangent plane contains the _v and y tangent lines, perpendicular to N. 


Figure 13.7 shows more detail about the tangent plane. The dotted lines are the x 
and y tangent lines. They lie in the plane. All tangent lines lie in the tangent plane! 
These particular lines are tangent to the "partial functions” — where y is fixed at y 0 = 
2 or x is fixed at x 0 = 1. The plane is balancing on the surface and touching at the 
tangent point. 

More is true. In the surface, every curve through the point is tangent to the plane. 
Geometrically, the curve goes up to the point and ^kisses” the plane,! The tangent 
T to the curve and the normal N to the surface are perpendicular: T* N - 0. 


f A safer word is “osculate.” At saddle points the plane is kissed from both sides. 
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EXAMPLE 2 Find the tangent plane to the sphere z 2 = 14 - x 2 - y 2 at (1, 2, 3). 

Solution Instead of z = 14 - x 2 - y 2 we have z = 14 - x 2 - y 2 . At x 0 = 1, y 0 = 2 

the height is now z 0 = 3. The surface is a sphere with radius .^14. The only trouble 
from the square root is its derivatives: 


dz 

dx 



2x) 

v 14 - x 2 - y 2 


and 


Sz = if- 2y) 
dy ,/l4- x 2 - y 2 


(3) 


At (1, 2) those slopes are - i and - The equation of the tangent plane is linear: 
z — 3 = — i(x — 1) — j(y - 2). I cannot resist improving the equation, by multiplying 
through by 3 and moving all terms to the left side: 


tangent plane to sphere'. l(x — 1) + 2( y — 2) + 3(z - 3) = 0. (4) 

If mathematics is the “science of patterns,” equation (4) is a prime candidate for study. 
The numbers 1,2,3 appear twice. The coordinates are (x 0 , y 0 >2o) = (l» 2, 3). The 
normal vector is li + 2j + 3k. The tangent equation is lx + 2y + 3z = 14. None of this 
can be an accident, but the square root of 14 - x 2 - y 2 made a simple pattern look 
complicated. 

This square root is not necessary. Calculus offers a direct way to find dz/dx — 
implicit differentiation. Just differentiate every term as it stands: 

x 2 + y 2 + z 2 =14 leads to 2x + 2z dz/dx = 0 and 2y + 2zdz/dy = 0. (5) 

Canceling the 2’s, the derivatives on a sphere are - x/z and - y/z. Those are the same 
as in (3). The equation for the tangent plane has an extremely symmetric form: 


z-z 0 = - — (x-x 0 )- — (y~y 0 ) or x o (x-x o ) + >'o(j ; ->'o) + 2o(z-Zo) = 0 . (6) 

z 0 Z o 

Reading off N = x 0 i + yj + z 0 k from the last equation, calculus proves something 
we already knew: The normal vector to a sphere points outward along the radius. 



Fig. 13.8 Tangent plane and normal N for a sphere. Hyperboloids of 1 and 2 sheets. 


THE TANGENT PLANE TO F(x, y,z)=c 

The sphere suggests a question that is important for other surfaces. Suppose the 
equation is F(x, y, z) = c instead of z = /(x, y). Can the partial derivatives and tangent 
plane be found directly from F? 

The answer is yes. It is not necessary to solve first for z. The derivatives of F, 
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computed at (x 0 ,yo,Zo)> give a second formula for the tangent plane and normal 
vector. 


13B The tangent plane to the surface F(x, y, z) = 

c has the linear equation 

(%) ( x ~ x o) + (Pj ^ -*>)+(f 

-)(z- 2o ) = 0. (7) 

\OXjo \cyjo \ (1 

: /0 

-TM. , . M fdF\ . ( dF\ . fdF 

The normal vector isN= — i+ — i+ — 

\8xJo \8yJo \8z 

). k ' 


Notice how this includes the original case z = /(x, y). The function F becomes 
J\x,y)-z. Its partial derivatives are df/dx and df/dy and -1. (The —1 is from the 
derivative of - z.) Then equation (7) is the same as our original tangent equation (1). 

EXAMPLE 3 The surface F = x 2 + y 2 — z 2 = c is a hyperboloid. Find its tangent plane. 

Solution The partial derivatives are F x = 2x, F y = 2 y, F, = - 2z. Equation (7) is 

tangent plane. 2x 0 (x - x 0 ) + 2y 0 (y - y 0 ) " 2z 0 (z - z 0 ) = 0. (8) 

We can cancel the 2’s. The normal vector is N = x 0 i + y 0 j - z 0 k. For c > 0 this 
hyperboloid has one sheet (Figure 13.8). For c = 0 it is a cone and for c < 0 it breaks 
into two sheets (Problem 13.1.26). 


DIFFERENTIALS 


Come back to the linear equation z — z 0 = (dz/8x) 0 (x — x 0 ) + ( dz/dy) 0 (y - y 0 ) for the 
tangent plane. That may be the most important formula in this chapter. Move along 
the tangent plane instead of the curved surface. Movements in the plane are dx and 
dy and dz — while Ax and A y and Az are movements in the surface. The d's are 
governed by the tangent equation — the A’s are governed by z = /(x, y). In Chapter 2 
the d' s were differentials along the tangent line: 

dy = (dy/dx)dx (straight line) and Ay % (dy/dx) Ax (on the curve). (9) 

Now y is independent like x. The dependent variable is z. The idea is the same. The 
distances x - x 0 and y - y 0 and z - z 0 (on the tangent plane) are dx and dy and dz. 
The equation of the plane is 

dz = (dz/dx) 0 dx + {dz/dy) 0 dy or df =f x dx +f y dy. (10) 

This is the total differential. All letters dz and df and dw can be used, but dz and df 
are not used. Differentials suggest small movements in x and y; then dz is the resulting 
movement in z. On the tangent plane, equation (10) holds exactly. 

A “centering transform’' has put x 0 ,y 0i z 0 at the center of coordinates. Then the 
“zoom transform’’ stretches the surface into its tangent plane. 


EXAMPLE 4 The area of a triangle is A = \ab sin 9. Find the total differential dA. 


Solution The base has length b and the sloping side has length a. The angle between 
them is 6. You may prefer A = \bh, where h is the perpendicular height a sin 9. Either 
way we need the partial derivatives. If A = \ab sin 0, then 


dA 

da 


= -b sin 9 
2 


dA 

~db 


= -a sin 9 
2 


dA 

d9 


= -ab cos 9. 
2 


(ID 
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These lead immediately to the total differential dA {like a product rule): 

dA = + (~ Vfc + ( sin 9 da + ]-a sin 9 db + ]rab cos 9 d9. 

\daj \dbj \o9 ) 2 2 2 

EXAMPLE 5 The volume of a cylinder is V = nr 2 h. Decide whether V is more sensitive 
to a change from r = 1.0 to r = 1 J or from h — 1*0 to h — 1. 1. 

Solution The partial derivatives are dVjdr-2nrh and dVidh = nr 2 . They measure 
the sensitivity to change . Physically, they are the side area and base area of the 
cylinder. The volume differential dV comes from a shell around the side plus a layer 
on top: 

dV- shell 4- layer = 2nrh dr + nr z dh. (12) 

Starting from r = /i = 1, that differential is dV = 2ndr + ndh. With dr-dh-A, the 
shell volume is ,2 n and the layer volume is only An. So V is sensitive to dr. 

For a short cylinder like a penny, the layer has greater volume. V is more sensitive 
to dh. In our case V = nr 2 h increases from tt( 1) 3 to rc(l.l) 3 , Compare A V to dV\ 

AV = it(l.l) 3 - n( l) 3 = 331 ti and dV= + 7t(.l) - .300*. 

The difference is AV- dV= .03 1 tt. The shell and layer missed a small volume in 
Figure 13.9, just above the shell and around the layer. The mistake is of order 
(dr) 1 + (dh) 2 . For V = nr 2 K the differential dV= 2 nrh dr + nr 2 dh is a linear approxima- 
tion to the true change AK We now explain that properly. 

LINEAR APPROXIMATION 

Tangents lead immediately to linear approximations. That is true of tangent planes as 
it was of tangent lines. The plane stays dose to the surface, as the line stayed close 
to the curve. Linear functions are simpler than /(x) or f(x y y) or F(x, y, z). Alt we 
need are first derivatives at the point. Then the approximation is good near the point. 

This key idea of calculus is already present in differentials. On the plane, ^/'equals 
f x dx+fydy. On the curved surface that is a linear approximation to A/: 


13C The linear approximation to /(x, y) near the point (x 0> y 0 ) is 

f(x> y) */(*o> >’o) + “ *o) + (^) 0 ^ ” y °)‘ ( 1 3) 


In other words Af ^f x Ax + f y Ay y as proved in Problem 24. The right side of (13) 
is a linear function / L {x, y). At {x 0 , y 0 ), the functions / and f L have the same slopes. 
Then/(x, y) curves away from f L with an error of “second order:"' 

I /(X, y) -fdx, y)\ M{(X - x 0 ) 2 + (y - .Vo) 2 ]- (14) 

This assumes that f xx ,f xy , and f yy are continuous and bounded by M along the line 
from (x 0 ,y 0 ) to (■** y ). Example 3 of Section 13*5 shows that | ^ 2 M along that line. 

A factor j comes from equation 3.8.12, for the error/-/ t with one variable. 

For the volume of a cylinder, r and h went from 1.0 to 1.1. The second derivatives 
of V — nr 2 h are V n = 2nh and V rk = 2nr and V hh = 0, They are below M = 2.2n. Then 
(14) gives the error bound 1 2 + .l 2 ) = .044rc, not far above the actual error .03 In. 
The main point is that the error in linear approximation comes from the quadratic 
terms — those are the first terms to be ignored by f L . 
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layer dh 
area nr 1 


shell dr 


Fig. 13.9 Shell plus layer gives dV= JQOti. 

Including top ring gives AV = 33\n. 



EXAMPLE 6 Find a linear approximation to the distance function r = y jx 2 + y 2 . 

Solution The partial derivatives are xjr and yjr. Then Ar ^(x/r)Ax + (y/r)Ay* 

For (x, y, r) near (1, 2, v"5): Jx 1 + y 2 a JVTT 2 + (x- 1 )jjs + 2(y - 2)/ v /5. 

If y is fixed at 2, this is a one-variable approximation to ^ Jx 2 + 2 2 . If x is fixed at 1, 
it is a linear approximation in y. Moving both variables, you might think dr would 
involve dx and dy in a square root. It doesn’t* Distance involves x and y in a square 
root, but change of distance is linear in Ax and Ay — to a first approximation. 

There is a rough point at x = 0, y = 0. Any movement from (0,0) gives Ar = 
*J{ Ax) 2 + (Ay)^. The square root has returned* The reason is that the partial deriva- 
tives xjr and yjr are not continuous at (0,0). The cone has a sharp point with no 
tangent plane. Linear approximation breaks down. 

The next example shows how to approximate A z from Ax and Ay, when the 
equation is F(x, y, z) = c. We use the implicit derivatives in (7) instead of the explicit 
derivatives in (1)* The idea is the same: Look at the tangent equation as a way to 
find Az, instead of an equation for z. Here is Example 6 with new letters. 

EXAMPLE 7 From F= - x 2 - y 2 + z 2 = 0 find a linear approximation to Az. 

Solution (implicit derivatives) Use the derivatives of F: — 2xAx — 2yAy + 2zAz ^ 0. 
Then solve for Az, which gives Az % (x/z)Ax + (y/z)Ay — the same as Example 6* 

EXAMPLE 8 How does the equilibrium price change when the supply curve changes? 

The equilibrium price is at the intersection of the supply and demand curves 
(supply = demand). As the price p rises, the demand q drops (the slope is “ *2): 

demand line DD: p = - .2q + 40* (15) 

The supply (also q) goes up with the price. The slope 5 is positive (here s = .4): 

supply line SS: p = sq + t = Aq+ 10. 

Those lines are in Figure 13.10. They meet at the equilibrium price P — £30. The 
quantity Q = 50 is available at P (on SS) and demanded at P(on DD )* So it is sold* 
Where do partial derivatives come in? The reality is that those lines DD and SS 
are not fixed for all time. Technology changes, and competition changes, and the 
value of money changes. Therefore the lines move. Therefore the crossing point ( Q , P) 
also moves. Please recognize that derivatives are hiding in those sentences* 
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Main point: The equilibrium price P is a function of s and t . Reducing s by better 
technology lowers the supply line to p = 2 q + 10. The demand line has not changed. 
The customer is as eager or stingy as ever. But the price P and quantity Q are 
different. The new equilibrium is at Q = 60 and P = $28, where the new line XX 
crosses DD. 

If the technology is expensive, the supplier will raise t when reducing s. Line YY 
is p = .3 q + 20. That gives a higher equilibrium P — $32 at a lower quantity Q — 40 — 
the demand was too weak for the technology. 

Calculus question Find dPjds and dPjdt. The difficulty is that P is not given as 
a function of s and f. So take implicit derivatives of the supply = demand equations: 

supply = demand: P = - 2 Q + 40 = sQ + t (16) 

s derivative: P s = - 2Q S = sQ s + Q {note £ s = 0) 

r derivative: P t - - 2Q t - $Q t + 1 (note t t = 1) 

Now substitute s = A f f = 10, P — 30, Q = 50. That is the starting point, around which 
we are finding a linear approximation. The last two equations give P s = 50/3 and 
P t = 1/3 {Problem 25), The linear approximation is 

P = 30 + 50(s - .4)/3 + (£ - 10)/3. (17) 

Comment This example turned out to be subtle (so is economics). I hesitated before 
including it. The equations are linear and their derivatives are easy, but something 
in the problem is hard — there is no explicit formula for P. The function P(s, f) is not 
known. Instead of a point on a surface, we are following the intersection of two lines. 
The solution changes as the equation changes. The derivative of the solution comes from 
the derivative of the equation. 

Summary The foundation of this section is equation (1) for the tangent plane. Every- 
thing builds on that — total differential, linear approximation, sensitivity to small 
change. Later sections go on to the chain rule and “directional derivatives” and 
“gradients,” The central idea of differential calculus is A / & f x Ax + / y Ay. 

NEWTON S METHOD FOR TWO EQUATIONS 

Linear approximation is used to solve equations. To find out where a function is zero, 
look first to see where its approximation is zero. To find out where a graph crosses 
the xy plane, look to see where its tangent plane crosses. 

Remember Newton's method for /{x) = 0. The current guess is x n . Around that 
point, yfx) is close to f(x n ) + (x — x a )f*(x tt )> This is zero at the next guess x n + t — 
x n ~ That is where the tangent line crosses the x axis. 

With two variables the idea is the same — but two unknowns x and y require two 
equations. We solve g(x, y) = 0 and /?(x, y) — 0. Both functions have linear approxi- 
mations that start from the current point (x„, yj — where derivatives are computed: 

y) ~ ?(*„, v„) + (c-gi8x)(x - x„) + (8gl8y)(y - y„) 

(18) 

h(x, y) as h{x n , y„) + (dhidx)(x - xj + {dhjcy){y - >'„)■ 

The natural idea is to set these approximations to zero ♦ That gives linear equations 
for x — x„ and y — y n . Those are the steps Ax and Ay that take us to the next guess 
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in Newton’s method: 


13D Newton’s method to solve g[x, y) = 0 and h(x t y) = 0 has linear equations 
for the steps Ax and Ay that go from (x„, y„) to (x n + 1 , y„ + 1 ): 


(to) A, + ( ««d (|)4 x+(|)a, (19) 


EXAMPLE 9 g — x 3 — y — Oand h — y 3 — x = Ohave 3 solutions (1, 1), (0. 0), (— i, —1), 

I will start at different points (x 0 ,y Q ). The next guess is x 1 = x 0 + Ax, y x = y 0 + Ay. 
It is of extreme interest to know which solution Newton’s method will choose — -if it 
converges at ail. I made three small experiments. 

1. Suppose (x 0 , y 0 ) = (2, 1). At that point g — 2 3 - 1 = 7 and h = I 3 - 2 — — L The 
derivatives are g x = 3x 2 = 12, g y = — 1, h x =■ — 1, h y = 3 y 2 — 3. The steps Ax and Ay 
come from solving (19): 

12Ax — Ay - - 7 Ax = - 4/7 x x ~ x 0 + Ax - 10/7 

“Ax + 3Ay=+l Ay=+l/7 y x — y 0 + Ay = 8/7. 

This new point (10/7, 8/7) is closer to the solution at (1, 1). The next point is (1.1, 
1.05) and convergence is clear. Soon convergence is fast. 

2. Start at (x 0 > y 0 ) = ({, 0). There we find g = 1/8 and h — - 1/2: 

(3/4)Ax - Ay = - 1/8 Ax = — 1/2 x x = x 0 + Ax - 0 

- Ax + OAy = + 1/2 Ay — + 1/4 y x = y 0 + Ay = — 1/4. 

Newton has jumped from (?> 0) on the x axis to (0, — J) on the y axis. The next step 
goes to (1/32, 0), back on the x axis. We are in the “basin of attraction” of (0, 0). 

3. Now start further out the axis at (1, 0), where g = 1 and h= — 1: 

3Ax — Ay= — I Ax= — 1 x l = x o + Ax = 0 

=> => 

- Ax + OAy = + 1 Ay = - 2 y t - y 0 + Ay = - 2 , 

Newton moves from (1,0) to (0, -2) to (16, 0). Convergence breaks down— the 
method blows up. This danger is ever-present, when we start far from a solution. 

Please recognize that even a small computer will uncover amazing patterns. It can 
start from hundreds of points (x 0 ,y 0 ), and follow Newton's method. Each solution 
has a basin of attraction , containing all (x 0 , y 0 ) leading to that solution. There is also 
a basin leading to infinity. The basins in Figure 13.11 are completely mixed together — 
a color figure shows them as fractals. The most extreme behavior is on the borderline 
between basins, when Newton can’t decide which way to go. Frequently we see chaos. 

Chaos is irregular movement that follows a definite rule. Newton’s method deter- 
mines an iteration from each point (x H ,yJ to the next. In scientific problems it 
normally converges to the solution we want. (We start close enough.) But the com- 
puter makes it posible to study iterations from faraway points. This has created a 
new part of mathematics — so new that any experiments you do are likely to be 
original. 
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Section 3 J found chaos when trying to solve x 2 + 1 = 0. But don’t think Newton’s 
method is a failure. On the contrary, it is the best method to solve nonlinear equations. 
The error is squared as the algorithm converges, because linear approximations have 
errors of order (Ax) 2 + (Ay) 2 * Each step doubles the number of correct digits, near 
the solution. The example shows why it is important to be near. 



Fig. 13.11 The basins of attraction to (1 T 1), (0, 0), (—1, —1), and infinity. 


13,3 EXERCISES 


Read-through questions 

The tangent line to y—f(x) is v — y n — ° The tangent 

plane to w = /(x, y) is w — w 0 = b . The normal vector is 
N = c For w = x 3 + y 3 the tangent equation at (t, 1,2) 
is d The normal vector is N = » . For a sphere, the 

direction ofN is f 

The surface given implicitly by F(x, y, z) = c has tangent 
equation {dFIdx^x — x n f 4- 0 . For xyz — 6 at (1,2,3) 

the tangent plane is h . On that plane the difTerent Lais 
satisfy [ dx 4* 1 dy 4- k dz — 0. The differential 

of z —J{x f y) is dz = 1 This holds exactly on the tangent 

plane, while Az ^ rn holds approximately on the n , 
The height z = 3x + 7y is more sensitive to a change in o 
than in x, because the partial derivative p is larger than 
g . 

The linear approximation to /(x, y) is /Tx n * y n ) + r 
This is the same as A/ ^ * Ax 4- * Ay. The error is 

of order u For / = sin xy the linear approximation 
around (0, 0) is f, — v . We are moving along the w 
instead of the * . When the equation is given as 

F(x, y, z) = c, the linear approximation is v Ax + 
* Ay + a Az ~ 0. 

Newton’s method solves g(x, y) — 0 and /i(x, y) = 0 by a 
B approximation. Starting from x„, y„ the equations are 
replaced by c and t> . The steps Ax and Ay go to the 


next point Each solution has a basin of Those 

basins are likely to be & . 

In 1-8 find the tangent plane and the normal vector at P, 

1 z = yjc 2 +/,P={0, 1, 1) 

2 x + y + z = 17, .P = (3, 4, 10) 

3 z = xjy, P = (6, 3, 2) 

4 z = e x * 2y , P = (0,0, 1) 

5 x 2 + y 2 + z 2 = 6,P = (U2, 1) 

6 x 2 + y 2 + 2z 2 = 7, P = (1, 2, 1) 

7 z = x y , J* = <1, 1. 1) 

8 V = nr 2 h, P = (2,2, 8s). 

9 Show that the tangent plane to z 2 — x 2 — y 2 — 0 goes 
through the origin and makes a 45° angle with the 2 axis* 

10 The planes z = x + 4y and z = 2x + 3y meet at (1, 1, 5). 
The whole line of intersection is (x, y, z)=(l, 1, 5) + W, 
Find v = N { x N 2 . 

11 If z — 3x — 2y find dz from dx and dy. If z = x 3 /y 2 find dz 
from dx and dy at x 0 — 1, y 0 = 1. If x moves to LG2 and y 
moves to 1.03, find the approximate dz and exact Az for both 

functions. The first surface is the to the second 

surface. 
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12 The surfaces z = x 2 -h 4y and z = 2x 4- 3y 2 meet at (1, 1, 5). 
Find the normals Ni and N 2 and also v = Nj x N 2 . The line 
in this direction v is tangent to what curve? 

13 The normal IN to the surface F(x, y, z) = 0 has components 
F x , F yy F z+ The normal line has x — x 0 + F x t t y = y 0 + F^f, 

z = . For the surface xyz — 24 = 0, find the tangent 

plane and normal line at (4, 2, 3). 

14 For the surface x z y 2 — z = 0, the normal line at (f, 2,4) 

has x = , y = , z = , 

15 For the sphere x 2 + y 2 -\-z 2 = 9, find the equation of the 
tangent plane through (2, 1, 2), Also find the equation of the 
normal line and show that it goes through (0, 0, 0). 

16 If the normal line at every point on F(x, y, z) = 0 goes 
through (0, 0, 0), show that F x = ex, F y = cy , F z = cz. The sur- 
face must be a sphere. 

17 For w = xy near (x 0 , y 0 ), the linear approximation is dw = 

, This looks like the _____ rule for derivatives. 

The difference between Aw = xy — x 0 y 0 and this approxima- 
tion is . 

18 If/= xyz (3 independent variables) what is dp. 

19 You invest P = S4000 at R = 8% to make / = $320 per 
year. If the numbers change by dP and dR what is d/? If the 
rate drops by dR = .002 (to 7.8%) what change dP keeps dl = 
0? Find the exact interest / after those changes in R and P. 

20 Resistances R l and R 2 have parallel resistance ft, where 
MR = 1/Ri + l/ft 2 > Is R more sensitive to A Jli or Aft 2 if R^ = 
1 and R 2 = 2? 

21 (a) If your batting average is A = (25 hits)/(100 at bats) = 
.250, compute the increase (to 26/101} with a hit and the 
decrease (to 25/101} with an out. 

(b) If A = x/y then dA = dx + dy. A hit 

(dx = dy= 1) gives dA = (1 — A)jy\ An out (dy = 1 ) gives 

dA = — Ajy* So at A -- .250 a hit has times the 

effect of an out. 

22 (a) 2 hits and 3 outs (dx = 2, dy = 5) will raise your average 

(dA > 0) provided A is less than . 

(b) A player batting A = .500 with y = 400 at bats needs 
dx = hits to raise his average to .505. 

23 If x and y change by Ax and Ay, find the approximate 
change A9 in the angle 9 == tan " *(yjx). 

24 The Fundamental Lemma behind equation (13) writes 
Af=aAx + bAy. The Lemma says that a -*-/ JC (x 0 , y 0 ) and 
b -*f y (x 0 , y 0 ) when Ax 0 and Ay 0, The proof takes Ax 
first and then Ay: 

(1) /(*o + 4jc.> , o)-A*o.>'o) = A*/] [ (c-3'o) wtl ere c is 

between and (by which theorem?} 

(2) /(x 0 + Ax, y 0 + Ay) -f(x Q + Ax, y 0 ) = A>/ r (x 0 + Ax, C) 

where C is between and . 


(3 ) ct=f x (c y provided/* is . 

(4) b=/ v (x 0 + Ax, C)-//x 0 ,y 0 ) provided f y is . 

25 If the supplier reduces j, Figure 13.10 shows that P 

decreases and Q , 

(a) Find P x = 50/3 and P t = 1/3 in the economics equation 
(17) by solving the equations above it for Q t and Q {t 

(b) What is the linear approximation to Q around $ = .4, 
r = 10, P = 30, Q = 50? 

26 Solve the equations P = — .2Q + 40 and P - sQ + f for P 
and Q. Then find SP/ds and dP/dt explicitly. At the same 
s, t, P, Q check 50/3 and 1/3, 

27 If the supply = demand equation (16) changes to P - 
sQ 4 - 1 = - Q + 50, find P s and P f at s = 1, t = 10. 

28 To find out how the roots of x 2 + bx + c ■ = 0 vary with b, 
take partial derivatives of the equation with respect to 

. Compare dxfoh with cxjdc to show that a root at 
x = 2 is more sensitive to b. 

29 Find the tangent planes to z = xy and z = x 2 — y 2 at x = 
2, y = 1. Find the Newton point where those planes meet the 
xy plane (set z = 0 in the tangent equations). 

30 (a) To solve g(x, y) = 0 and /?(x, y) = 0 is to find the meeting 
point of three surfaces: z=g(x y y) and z = /i(x,y) and 


(b) Newton finds the meeting point of three planes: the 
tangent plane to the graph of#, , and 

Problems 31-36 go further with Newton’s method for 
x* — y and /i = y 3 — x. This is Example 9 with solutions ( 1 , 1 ), 

(0, 0),(-l, -1). 

31 Start from = 1, y 0 = 1 and find Ax and Ay, Where are 
Xi and y n and what line is Newton's method moving on? 

32 Start from (£, and find the next point. This is in the 
basin of attraction of which solution? 

33 Starting from (a, —a) find Ay which is also — Ax, Newton 
goes toward (0,0). But can you find the sharp point in 
Figure 13.11 where the lemon meets the spade? 

34 Starting from (n, 0) show that Newton’s method goes to 
(0, — 2a*) and find the next point (x 2 ,y 2 ). Which numbers a 
lead to convergence? Which special number a leads to a cycle, 
in which (x 2 , y 2 ) is the same as the starting point (n, 0)? 

35 Show that x 2 = y\ y 3 = x has exactly three solutions. 

36 Locate a point from which Newton’s method diverges. 

37 Apply Newton's method to a linear problem: g = 
x + 2y — 5 = 0, h = 3x — 3 = 0. From any starting point show 
that (x ! , >'i ) is the exact solution (convergence in one .ste/?). 
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38 The complex equation (x 4 iy) 3 = 1 contains two real equ- 
ations, x 3 — 3xy 2 — 1 from the real part and 3x 2 y — y 3 = 0 
from the imaginary part. Search by computer for the basins 
of attraction of the three solutions (1, 0), (-1/2, ,^3/2), and 
(—1/2, —y/3}2 ) — which give the cube roots of L 

39 In Newton's method the new guess comes from (x n , y„) by 
an iteration: x„ +l = G(x*, y K } and y n+ 1 = H{x nt y„). What are 
G and H for g = x 2 — y — 0, h — x — y = 0? First find Ax and 
Ay; then x„ 4 Ax gives G and y n 4 Ay gives H. 

40 In Problem 39 find the basins of attraction of the solution 
(0, 0) and (1, 1), 


41 The matrix in Newton's method is the Jacobian: 


J = 


dgldx dgfdy 
dhjcx dhidy 


and 


Ax 


|_AyJ 


-g 


Find J and Ax and Ay for g = e x — 1, h = e v 4 x. 


a 

n. 


42 Find the Jacobian matrix at (1, 1) when g = x 2 y 1 and 

h = xy, This matrix is and Newton's method fails. 

The graphs of g and h have tangent planes. 

43 Solve g = x 1 — y 1 -i- 1 = 0 and h = 2xy = 0 by Newton's 

method from three starting points: {0, 2) and(— 1, l)and (2, 0). 
Take ten steps by computer or one by hand. The solution 
(0, 1) attracts when y 0 > 0. If y 0 = 0 you should find the chaos 
iteration x„ +1 = i(x fl - 1 )■ 



13.4 Directional Derivatives and Gradients 



As x changes, we know how /(x, y) changes. The partial derivative cfjdx treats y as 
constant. Similarly dfjdy keeps x constant, and gives the slope in the y direction. But 
east-west and north-south are not the only directions to move. We could go along a 
45° line, where Ax = Ay . In principle, before we draw axes, no direction is preferred. 
The graph is a surface with slopes in all directions. 

On that surface, calculus looks for the rate of change (or the slope). There is a 
directional derivative , whatever the direction. In the 45 Q case wc arc inclined to divide 
A/ by Ax, but we would be wrong. 

Let me state the problem. We are given f(x t y) around a point P = (x 0j y 0 ). We are 
also given a direction u (a unit vector). There must be a natural definition of D u f— 
the derivative off in the direction u. To compute this slope at P, we need a formula. 
Preferably the formula is based on cfjdx and cfjdy t which we already know. 

Note that the 45° direction has u = i/^/2 + j/^/2. The square root of 2 is going to 
enter the derivative. This shows that dividing A/ by Ax is wrong. We should divide 
by the step length As. 


EXAMPLE 1 Stay on the surface z = xy. When (x, y ) moves a distance As in the 45° 
direction from (1, 1), what is Az/As? 

Solution The step is As times the unit vector u. Starting from x = y = l the step 
ends at x = y = 1 4 Asj^/2. (The components of uAs are Asjy/l.) Then z = xy is 

i — (1 4 Asjy/l) 2 = 1 4 yJlA s 4 j(As) 2 , which means Az = % /2As + i(As) 2 + 

The ratio A zjAs approaches v /2 as As ->0. That is the slope in the 45° direction. 


DEFINITION The derivative of f in the direction u at the point P is D^fiP): 


n r A/ r f(P + uAs)-f{P) 

DJ(P) = hm — = lim . 

Ai-o As As — Cl As 


( 1 ) 


The step from P = (x 0 , y 0 ) has length As. It takes us to (x 0 4 u 1 As. y 0 4 u 2 As). We 
compute the change Af and divide by As, But formula (2) below saves time. 
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The x direction is u = (1, 0). Then ti As is (As, 0) and we recover dfjdx : 

Af_A*o + y 0 )-/(*o, yo) , lnnrnn . n Sf 

s approaches 

Similarly D„f=8f/3y, when u = (0, 1) is in the y direction. What is D a f when u = 
(0, - 1)? That is the negative y direction, so D„f = - df/dy. 

CALCULATING THE DIRECTIONAL DERIVATIVE 

D„f is the slope of the surface z = f(x, y) in the direction u. How do you compute it? 
From dfjdx and df/dy, in two special directions, there is a quick way to find D u f in 
all directions. Remember that u is a unit vector. 


13E The directional derivative DJ in the direction u = (u lt u 2 ) equals 

(2) 


The reasoning goes back to the linear approximation of A/: 

df A df 4 df A df 

Af *& 4,<+ 5 A, ’ = s“' 4s+ ^“ iA! ' 

Divide by As and let As approach zero. Formula (2) is the limit of A//A$, as the 
approximation becomes exact, A more careful argument guarantees this limit pro- 
vided f x andjy are continuous at the basepoint (x 0 ,y 0 ). 

Main point: Slopes in all directions are known from slopes in two directions , 

EXAMPLE 1 (repeated) /= xy and P= (1, 1) and u = (1/^/2, 1/^2). Find DJ(P). 
The derivatives f x = y and f y = x equal 1 at P. The 45° derivative is 

DJ{P) =f x u , +f y u 2 = 1(1 fd% + H1/V2) = y/l as before. 

EXAMPLE 2 The linear function /= 3x + y + 1 has slope £>„/= 3u t + u 2 . 

The x direction is u = (l,0), and D u /=3. That is dfjdx. In the y direction Z) u / = 1. 
Two other directions are special — along the level lines of f(x, y) and perpendicular: 

Level direction : D u f is zero because / is constant 

Steepest direction D u f is as large as possible (with uf + u| = 1), 

To find those directions, look at D u f = 3ui + u 2 - The level direction has 3^ + u 2 = 0. 
Then u is proportional to (1, - 3). Changing x by 1 and y by - 3 produces no change 
in/= 3x + y + 1. 

In the steepest direction u is proportional to (3, 1). Note the partial derivatives 
f x = 3 and f y = 1. The dot product of (3, 1) and (1, - 3) is zero — steepest direction 
is perpendicular to level direction. To make (3, 1) a unit vector, divide by v /t0. 

Steepest climb : DJ= 3(3 j^/lO) + l(l/ v /l0) = lO/^/iO = /t0 

Steepest descent: Reverse to u = (- 3/^/To, - 1 /^/TO) and D u f=—y/ 10. 

The contour lines around a mountain follow D„/ = 0. The creeks are perpendicular. 
On a plane like /=3x + y+l, those directions stay the same at all points 
(Figure 13.12). On a mountain the steepest direction changes as the slopes change. 
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Fig. 13.12 Steepest direction is along the gradient. Level direction is perpendicular 


THE GRADIENT VECTOR 


Look again at f x u t f/ v w 2 , which is the directional derivative D u f This is the dot 
product of two vectors . One vector is ti = (u tl u 2 )< which sets the direction. The other 
vector is (f x ,f y ), which comes from the function. This second vector is the gradient. 

rif 

DEFINITION The gradient of J\x, >') is the vector whose components are ~ and — : 


ef. df. 

grad 


f af \ 

add -f- k in three dimensions |. 

V dz J 


The space-saving symbol V is read as “grad.” In Chapter 15 it becomes “del.” 

For the linear function 3xF y + 1, the gradient is the constant vector (3, 1). It is 
the way to climb the plane. For the nonlinear function x 2 F xy, the gradient is the 
non-constant vector (2x F y, x). Notice that grad /shares the two derivatives in N = 
(/*»/*! ~1)< But gradient is not the normal vector. N is in three dimensions, 
pointing away from the surface z =/(x t y ). The gradient vector is in the xy plane! The 
gradient tells which way on the surface is up, but it does that from down in the base. 

The level curve is also in the xy plane, perpendicular to the gradient. The contour 
map is a projection on the base plane of what the hiker sees on the mountain. The 
vector grad / tells the direction of climb, and its length |grad/| gives the steepness. 


13F The directional derivative is D u f = (grad /) *u. The level direction is per- 
pendicular to grad / since D v f = 0. The slope DJ is lar gest wh en u is parallel to 
grad / That maximum slope is the length |grad/| = F / 2 : 

for u = gra ^{. the slope is (grad/) • u = = |grad/|. 

Igrad /| |grad/| 


The example /= 3xF y F 1 had grad/= (3. 1). Its steepest slope was in the direc- 
tion u — (3, Ij/ylO, The maximum slope was v /l0. That is lgrad/1 = v "'"9 + 1 . 

Important point: The maximum of (grad /)*u is the length |grad / 1. In nonlinear 
examples, the gradient and steepest direction and slope will vary. But look at one 
particular point in Figure 13.13, Near that point, and near any point, the linear 
picture takes over. 

On the graph of / the special vectors are the level direction L — (/., —f x> 0) and 
the uphill direction U - (//,/ F/ 2 ) and the normal N = (/ x ,/ y , - 1). Problem 18 
checks that those are perpendicular. 
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EXAMPLE 3 The gradient of J\x 7 y) = (14 — x 1 - y 2 )/ 3 is V/= (- 2x/3, - 2y/3). 

On the surface, the normal vector is N = (— 2xj3, — 2y/ 3, “1). At the point (1, 2, 3), 
this perpendicular is N = (-2/3, -4/3, -1). At the point (1,2) down in the base, 
the gradient is (— 2/3, -4/3), The length of grad /is the slope y/20/3. 

Probably a hiker does not go straight up* A “grade” of ^/20/3 is fairly steep (almost 
150%/ To estimate the slope in other directions, measure the distance along the path 
between two contour lines* If Af= 1 in a distance As = 3 the slope is about 1/3. This 
calculation is not exact until the limit of Af/As , which is D u f. 




Fig, 13.13 N perpendicular to surface and grad / perpendicular to level line (in the base). 


EXAMPLE 4 The gradient of /(x, y, z) = xy 4 yz 4 xz has three components. 

The pattern extends from /(x, y) to /(x, y, z). The gradient is now the three-dimensional 
vector For this function grad /is (y + z, x 4 z, x 4 y). To draw the graph 

of w=J[x y y, z) would require a four-dimensional picture, with axes in the xyzw 
directions* 

Notice the dimensions. The graph is a 3-dimensional “surface” in 4-dimensional 
space. The gradient is down below in the 3-dimensional base* The level sets of / come 
from xy yz zx = c — they are 2-dimensional* The gradient is perpendicular to that 
level set (still down in 3 dimensions). The gradient is not N! The normal vector is 
—1), perpendicular to the surface up in 4-dimensional space. 

EXAMPLE 5 Find grad z when z(x, y) is given implicitly: y, z) = x 2 4 y 2 — z 2 = 0. 

In thi s case we find z = 4 ^Jx 2 ~+ y / The derivatives are ± x/\/x 2 ~+ y 2 and 
± y!^fx 2 ~+ y 2 , which go into grad z. But the point is this: To find that gradient faster, 
differentiate F(x, y, z) as it stands. Then divide by F z : 

F x dx 4 F y dy 4 F z dz = 0 or dz = (— F x dx - F y dy)/F z . (3) 

The gradient is (—F x /F z , — FJF Z ), Those derivatives are evaluated at (x 0 , y 0 ). The 
computation does not need the explicit function z =/(x, y): 

F = x 2 4 y 2 - z 2 => F x = 2x, F y = 2 y, F z = — 2z => grad z = (xjz t yjz). 

To go uphill on the cone, move in the direction (x/z, y/z). That gradient direction 
goes radially outward* The steepness of the cone is the length of the gradient vector: 


[grad z| = ^/{xjz) 2 4 {yjz) 2 = t because z 1 = x 2 4 y 1 on the cone. 
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DERIVATIVES ALONG CURVED PATHS 


On a straight path the derivative of /is D u f= (grad /) - it. What is the derivative on 
a curved path? The path direction u is the tangent vector T. So replace u by T, which 
gives the “direction 1 ' of the curve. 

The path is given by the position vector R(t) = x(r)i -r y(r)j. The velocity is \ = 
(dxjdt)i + (dyjdt) j. The tangent vector is T = y/|y]. Notice the choice — to move at any 
speed (with v) or to go at unit speed (with T). There is the same choice for the 
derivative of/(x, y) along this curve: 

df dfdx dfdy 

rate of change — = (grad /) * v = — — — — (4) 

dt ox dt cy dt 


slope 


£ = (grad/)*T = 


q/'dx qfdy 
ox ds 8yds 


( 5 ) 


The first involves time. If wc move faster, dfjdt increases. The second involves distance. 
If we move a distance ds, at any speed, the function changes by df. So the slope in 
that direction is dfjds. Chapter 1 introduced velocity as dfjdt and slope as dy/dx and 
mixed them up. Finally we see the difference. 

Uniform motion on a straight line has R “ R 0 4- vf. The velocity v is constant. The 
direction T = u = v/|y| is also constant. The directional derivative is (grad /)-u, but 
the rate of change is {grad /) *v. 


Equations (4) and (5) look like chain rules. They are chain rules. The next section 
extends dfjdt = (dfjdx){dxjdt) to more variables, proving (4) and (5). Here we focus 
on the meaning: dfjds is the derivative of f in the direction u = T along the curve . 


EXAMPLE 7 Find dfjdt and dfjds for / = r, The curve is * — t 2 , y = t in Figure 13.14a. 

Solution The velocity along the curve is v = 2*i + j. At the typical point t - 1 it is 
v = 2i + j. The unit tangent is T = v/^/5. The gradient is a unit vector i/^/2 + j/^/2 
pointing outward, when f{x, y) is the distance r from the center. The dot product 
with v is dfjdt = 3/^/2, The dot product with T is dfjds = 3/^/To. 

When we slow down to speed 1 (with T), the changes in f(x t y) slow down too. 


EXAMPLE 6 Find dfjds for f=xy along the circular path x = cos t, y = sin t. 

First take a direct approach. On the circle, xy equals (cos f)(sin f). Its derivative comes 
from the product rule: dfjdt = cos 2 * - sin 2 r. Normally this is different from dfjds , 
because the time * need not equal the arc length s. There is a speed factor dsjdt to 
divide by — but here the speed is 1 . {A circle of length $ = 2n is completed at t = 2tt.) 
Thus the slope dfjds along the roller-coaster in Figure 13T4 is cos 2 * - sin 2 *. 





Fig. 13,14 The distance f=r changes along the curve. The slope of the roller-coaster is 
(grad/) * T The distance D from v'o ) has grad D = unit vector. 
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The second approach uses the vectors grad / and T. The gradient of f=xy is 
( y , x) = (sin t % cos t). The unit tangent vector to the path is T = (-sin t, cos r). Their 
dot product is the same dfjds: 

slope along path = (grad /) *T = — sin 2 t + cos 2 t. 

GRADIENTS WITHOUT COORDINATES 

This section ends with a little “philosophy." What is the coordinate-free definition of 
the gradient? Up to now, grad /= { f x J y ) depended totally on the choice of x and y 
axes. But the steepness of a surface is independent of the axes. Those are added later, 
to help us compute. 

The steepness dfjds involves only / and the direction, nothing else. The gradient 
should be a “tensor" — its meaning does not depend on the coordinate system. The 
gradient has different formulas in different systems (xy or rd or ...), but the direction 
and length of grad / are determined by dfjds — without any axes: 

The direction of grad /is the one in which dfjds is largest. 

The length |grad/| is that largest slope. 

The key equation is (change inf) * (gradient of f) -(change in position). That is another 
way to write A f » fjAx +fyAy. It is the multivariable form — we used two variables — 
of the basic linear approximation Ay » (dy/dx)Ax, 

EXAMPLE 9 D(x> y) = distance from (x, y) to (x 0 ,y 0 )- Without derivatives prove 
|grad D\ = 1. The graph of D(x, y) is a cone with slope 1 and sharp point (x 0 , y 0 ). 

Rrst question In which direction does the distance D(x, y) increase fastest? 

Answer Going directly away from (x 0 , y 0 ). Therefore this is the direction of grad D. 

Second question How quickly does D increase in that steepest direction? 

Answer A step of length As increases D by As. Therefore |grad D\ — As/As “ 1. 

Conclusion grad D is a unit vector . The derivatives of D in Problem 48 are 
(x-x 0 )/D and (y — y 0 )/D. The sum of their squares is 1, because (x-x 0 ) 2 + 
(y~ yo) 2 equals D 1 . 


13.4 EXERCISES 


Read-through questions 

D B /g ives the rate of change of o in the direction b . 
It can be computed from the two derivatives c in the 
special directions ,, d In terms of u ly u 2 the formula is 
D r f= * . This is a t product of u with the vector 

fl which is called the h For the linear function / = 
ax + by, the gradient is arad f = l and the directional 
derivative is D 9 f = I •• k . 

The gradient V/— (f x ,f y ) is not a vector in I dimen- 
sions, it is a vector in the m . It is perpendicular to the 
n lines. It points in the direction of o climb. Its 
magnitude |grad /[ is p . For /^ jc 2 + y 2 the gradient 
points q and the slope in that steepest direction is r 


The gradient of J\x t y , z) is » . This is different from the 

gradient on the surface y, z) =* 0, which is — (F x /F z )i + 
t Traveling with velocity v on a curved path, the rate 
of change of/ is dfldt = u . When the tangent direction 
is T, the slope of/ is dfjds = v In a straight direction u,’ 
dfjds is the same as w . 


Compute grad / then D 9 f 

1 J[x,y) = x 2 -y 2 

2 J[x, y) = 3x + 4y + 7 

3 J\x y y) — e* cos y 
4 fix, y) = y 10 


(grad /) * u, then D 9 f at P . 
u = (73/2, 1/2) P = (1, 0) 
u = (3/5, 4/5) P - (0, n/2) 
u = (0, 1) P = (0, «/2) 

u = (0, -1) P = ( 1,-1) 
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$J{x 7 y) = distance to (0,3) u = (l, 0) P = (l 1) 

Find grad / = {f xt f yt f t ) for the functions 6-3 from physics. 

6 l//x 2 + y 2 + z 2 (point source at the origin) 

7 In(x 2 + y 2 ) (line source along z axis) 

8 l/yj{x - 1 ) 2 + y 2 + z 2 - l//(x + 1 ) 2 + y 1 + z 1 (dipole) 

9 For 3x 2 + 2 y 2 find the steepest direction and the level 
direction at (1, 2). Compute D a f in those directions. 

10 Example 2 claimed that /= 3x + y + 1 has st eepest slope 
yiO. Maximize D u f=3ui + u 2 = 3iii H-^/l — uf . 

11 True or false, when/(x T y ) is any smooth function: 

(a) There is a direction u at P in which D 9 f = 0, 

(b) There is a direction u in which D u f = grad / 

(c) There is a direction u in which D m f = 1. 

(d) The gradient of f(x)g(x) equals g grad/+/grad g. 

12 What is the gradient of/(x)? (One component only.) What 
are the two possible directions u and the derivatives D a fl 
What is the normal vector N to the curve y=fx )? (Two 
components.) 

In 13-16 find the direction u in which /increases fastest at P = 
(1, 2). How fast? 

13 /(x, y) = ax + by 14 j[x, y) = smaller of 2x and y 
15 fix, y) = e*~ y 16 fix, y) = y/5-x 2 - y 2 (careful) 

17 (Looking ahead) At a point where J{x, y) is a maximum, 
what is grad /? Describe the level curve containing the maxi- 
mum point (x, y)* 

18 (a) Check by dot products that the normal and uphill and 
level directions on the graph are perpendicular: N = 

-nu = (f x j y j 2 +f 2 y \i J = (U 0). 

(b) N is to the tangent plane, U and L are 

to the tangent plane. 

(c) The gradient is the xy projection of and also 

of , The projection of L points along the 

19 Compute the N, U, L vectors for /= 1 — x + y and draw 
them at a point on the flat surface. 


20 Compute N, U, L for x 2 4- y 2 — z 2 = 0 and draw them at 
a typical point on the cone. 

With gravity in the negative z direction, in what direction — U 
will water flow down the roofs 21-24? 

21 i = 2x (flat roof) 22 z = 4x — 3y (flat roof) 

23 i — yj 1 - x 2 — y 2 (sphere) 24 z = - Jx 1 + y 2 (cone) 

25 Choose two functions f[x , y) that depend only on x + 2y. 

Their gradients at (1, 1) are in the direction . Their 

level curves are . 

26 The level curve of/= y/x through (1, 1) is . The 

direction of the gradient must be . Check grad /. 

27 Grad/is perpendicular to 2i + j with length 1, and grad g 
is parallel to 2i + j with length 5. Find grad/ grad g,f and g ♦ 

28 True or false: 

(a) If we know grad /, we know / 

(b) The line x — y= — z is perpendicular to the plane z = 
x + y. 

(c) The gradient of z = x + y lies along that line. 

29 Write down the level direction u for 0 - tan " Vy/x) at the 
point (3, 4). Then compute grad 0 and check D m 0 = 0. 

30 On a circle around the origin, distance is As = rAO. Then 
dOjd$ = \jr. Verify by computing grad 0 and T and 
(grad 0) *T. 

31 At the point (2, 1,6) on the mountain z = 9 — x — y 2 t 

which way is up? On the roof z = x + 2y + 2, which way is 
down? The roof is to the mountain. 

32 Around the point (1, —2) the temperature T=e -Jfl-yl has 

AT*; Ax -h Ay. In what direction u does 

it get hot fastest? 

33 Figure A shows level curves of z = /(x, y). 

(a) Estimate the direction and length of grad / at P, Q, R. 

(b) Locate two points where grad / is parallel to i 4- y 

(c) Where is |grad f \ largest? Where is it smallest? 

(d) What is your estimate of z ma!l on this figure? 

(e) On the straight line from P to describe z and esti- 
mate its maximum. 
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34 A quadratic function ax 2 4 by 1 4 cx 4 dy has the gradi- 
ents shown in Figure B. Estimate a^b^c.d and sketch two 
level curves. 

35 The level curves of/[x,y) are circles around (1, 1). The 
curve f=c has radius 2c. What is /? What is grad / at (0, 0}? 

36 Suppose grad /is tangent to the hyperbolas xy = constant 
in Figure C, Draw three level curves of/tx, y). Is Igrad f\ larger 
at P or Q? Is ]grad f\ constant along the hyperbolas? Choose 
a function that could be/; x 1 4 y 2 > x 2 — y 2 , xy, x 2 y 2 . 

37 Repeat Problem 36, if grad /is perpendicular to the hyper- 
bolas in Figure C. 

38 If/= 0, 1, 2 at the points (0, 1), (1, 0), (2, 1), estimate grad/ 
by assuming /= Ax 4 By 4 C. 

39 What functions have the following gradients? 

(a) (2x 4 y, x) (b) -e*~ y ) (c) (y, -x) (careful) 

40 Draw level curves ofyfx, y) if grad / = (y y x). 

In 41-46 find the velocity v and the tangent vector T Then 
compute the rate of change df/dt =grad/*v and the slope 
df/ds =grad/'T + 

41 /= x 2 4 y 2 x = t y = t 2 


42 f=x 

x = cos 2t 

y = sin 2t 

43 f=x 2 -y 2 

x = x 0 4 2t 

y=>o + 3f 

II 

7 

X = t 2 4 1 

II 

45 /=ln xyz 

x = e f 

ri 

II 

46 f=2x 2 + 3y 2 + z 2 

x = t 

y=t 2 


47 (a) Find df/ds and dfjdt for the roller-coaster /= xy along 
the path x = cos 2f, y = sin 2 1. (b) Change to /= x 2 4 y 2 and 
explain why the slope is zero. 

48 The distance D from (x, y) to (1, 2) has D 2 = 
(x — l ) 2 4 (y — 2) 2 . Show that dD/dx = (x — l)/D and dD/dy — 

(y - 2 )/D and |grad D| = 1. The graph of D(x> y) is a 

with its vertex at * 

49 If /— 1 and grad / = (2, 3) at the point (4, 5), find the tan- 
gent plane at (4, 5). If/ is a linear function, find^x, y). 

50 Define the derivative of/(x,y} in the direction u — (u lt u 2 ) 
at the point P = (x 0 , y 0 ) t What is A/ (approximately)? What 
is D v f (exactly)? 

51 The slope of / along a level curve is df/ds = = 0, 

This says that grad /is perpendicular to the vector 

in the level direction. 
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Calculus goes back and forth between solving problems and getting ready for harder 
problems. The first is “application,” the second looks like “theory” If we minimize / 
to save time or money or energy, that is an application. If we don’t take derivatives 
to find the minimum — maybe because/is a function of other functions, and we don’t 
have a chain rule — 'then it is time for more theory. The chain rule is a fundamental 
working tool, because f[g{x}) appears all the time in applications. So do/(g(x, y)) and 
f(x (r), y(i)) and worse. We have to know their derivatives. Otherwise calculus can’t 
continue with the applications. 

You may instinctively say: Don’t bother with the theory, just teach me the formulas. 
That is not possible. You now regard the derivative of sin 2x as a trivial problem, 
unworthy of an answer. That was not always so. Before the chain rule, the slopes of 
sin 2x and sin x 2 and sin 2 x 2 were hard to compute from A//Ax. We are now at the 
same point for f[x, y). We know the meaning of df/dx , but if/ = r tan 8 and x = r cos 9 
and y = r sin 0, we need a way to compute dffdx . A little theory is unavoidable, if the 
problem-solving part of calculus is to keep going. 

To repeat: The chain rule applies to a function of a function. In one variable that 
was/lgC*))- With two variables there are more possibilities: 

1. f{z) with z = g(x, y) Find df/dx and df/dy 

2. J\x, y) with x = x(t), y = y(t) Find df/dt 

3- fix, y ) with x = x(r, u) t y = y(£, u) Find df/dt and dffdu 
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All derivatives are assumed continuous. More exactly, the input derivatives like 
dg/dx and dxjdt and 8xj8u are continuous. Then the output derivatives like dfjdx 
and dfjdt and dfjdu will be continuous from the chain rule. We avoid points like 
r - 0 in polar coordinates — where drjd x = xjr has a division by zero. 

A Typical Problem Start with a function of jc and y, for example jc times y. Thus 
J[ x, y) = xy. Change x to r cos 0 and y to t sin G. The function becomes (r cos 0) times 
(rsin 0). We want its derivatives with respect to r and 9. First we have to decide on 
its name . 

To be correct, we should not reuse the letter/. The new function can be F : 

/lx , y) = xy f(r cos 0, r sin G) = {r cos 0)(r sin 0) = F(r, 0). 

Why not call it /(r, 0)? Because strictly speaking that is r times 0! If we follow the 
rules, then /lx, y) is xy and/lr, 0) should be r0. The new function F does the right 
thing— it multiplies (r cos 0)(r sin 0). But in many cases, the rules get bent and the 
letter F is changed back to / 

This crime has already occurred. The end of the last page ought to say dFjdt. 
Instead the printer put dfjdt The purpose of the chain rule is to find derivatives in 
the new variables t and u (or t and 0). In our example we want the derivative of F 
with respect to r . Here is the chain rule: 

~T = T~ + = (y)(cos 6) + (*)(sin 9) = 2 r sin 6 cos 9. 

or dx or oyer 

1 substituted r sin 0 and r cos G for y and x. You immediately check the answer: 
F(j\ 0) = t 2 cos 0 sin 0 does lead to 8Fj8r ~ It cos 0 sin 0. The derivative is correct. 
The only incorrect thing — but we do it anyway — is to write /instead of F. 

_ , 5/_ A ¥ . df dx dfdy 

Question What is Answer It is + 7 --. 

38 dx 38 dy 38 

THE DERIVATIVES OF f(g( x, y)) 

Here g depends on x and y, and / depends on g. Suppose x moves by dx, while y 
stays constant. Then g moves by dg — (8gjdx)dx. When g changes,/ also changes: 
df= (df/dg)dg. Now substitute for dg to make the chain: df= (dfjdg)(dgjdx)dx. This 
is the first rule: 


13G Chain rule for Mx,y)): ^ 


4fdg # 

dg dx d 3y 


dgdy 


0 ) 


EXAMPLE 1 Every /lx + cy ) satisfies the 1-way wave equation dfjdy — c dfjdx. 

The inside function is g = x + cy. The outside function can be anything, g 2 or sin g 
or c*. The composite function is (x + cy) 2 or sin(x + cy) or e x+cy . In each separate 
case we could check that dfjdy = c dfjdx . The chain rule produces this equation in 
all cases at once, from dgjdx - 1 and dgj3y = c: 

dx dgdx dg dy dgdy dg dy dx' 

This is important: dfjdy = c dfjdx is our first example of a partial differential equation. 
The unknown /(x, y) has two variables. Two partial derivatives enter the equation. 
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Up to now we have worked with dyjdt and ordinary Sfferential equations. The 
independent variable was time or space (and only one dimension in space). For partial 
differential equations the variables are time and space (possibly several dimensions 
in space). The great equations of mathematical physics — heat equation, wave equa- 
tion, Laplace's equation — 'are partial differential equations. 

Notice how the chain rule applies to/= sin xy. Its x derivative is y cos xy. A patient 
reader would check that /is sin# and g is xy and f x is f g g x . Prohably you are not 
so patient — you know the derivative of sin xy. Therefore we pass quickly to the next 
chain rule. Its outside function depends on two inside functions, and each of those 
depends on t. We want dfldt. 

IHE DERIVATIVE OF /(/)) 

Before the formula, here is the idea. Suppose t changes by Ar. That affects x and y; 
they change by Ax and Ay, There is a domino effect on f; it changes by A/. Tracing 
backwards, 

df A i df * dx dy . 

A/ as — Ax + — Ay and Ax ^ — At and Ay ^ — A f . 
dx dy * dt dt 

Substitute the last two into the first, connecting A f to At. Then let Af 0: 


13H Chain rule for f[x{t\ y(t)): 


dt dx dt dy dt 1 


(3) 


This is close to the one-variable rule dzjdx — {dzjdy){dyjdx). There we could “cancer’ 
dy. (We actually canceled Ay in (Az/Ay)(Ay/Ax), and then approached the limit.) 
Now At affects A/ in two ways, through x and through y. The chain rule has two 
terms . If we cancel in {dfjdx){dxjdt) we only get one of the terms! 

We mention again that the true name for^xff), y(t)) is F(t) not /(f). For/(x, y, z) 
the rule has three terms: f x x t +f y y t + f 2 z t is/ (or better dFjdt). 

EXAMPLE 2 How quickly does the temperature change when you drive to Florida? 

Suppose the Midwest is at 30°F and Florida is at 80°F. Going 1000 miles south 
increases the temperature /(x, y) by 50°, or .05 degrees per mile. Driving straight south 
at 70 miles per hour, the rate of increase is (,05)(70) = 3,5 degrees per hour. Note how 
(degrees/mile) times (miles/hour) equals (degrees/hour). That is the ordinary chain rule 
(i ifjdx){dxjdt ) = (dfldt) — there is no y variable going south. 

If the road goes southeast, the temperature is /= 30+ ,05x + .01y. Now x(f) is 
distance south and y(t) is distance east. What is dfldt if the speed is still 70? 

Solution + 05™+. 01^*3 degrees/hour. 

dt dx dt dy dt Ji Ji 6 ' 

In reality there is another term. The temperature also depends directly on f, because 
of night and day. The factor cos(2jif/24) has a period of 24 hours, and it brings an 
extra term into the chain rule: 

_ „ \ L l - , . df df dx , df dy , of 

For f{x t y, t) the chant rule is f = / — + / -^ + T- (4) 

dt dx dt dy dt dt 

This is the total derivative dfjdt, from all causes. Changes in x, y, t all affect / The 

partial derivative dfldt is only one part of dfldt. (Note that dtldt - 1,) If night and 
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day add 12 cos(2ftt/24) to f, the extra term is dfjdt = — it sin(27rt/24). At nightfall that 
is -w degrees per hour. You have to drive faster than 70mph to get warm* 

SECOND DERIVATIVES 

What is d 1 fjdt 2e l We need the derivative of (4), which is painful. It is like acceleration 
in Chapter 12, with many terms* So start with movement in a straight line. 

Suppose x = x Q + t cos 8 and y = y 0 + t sin 8. We are moving at the fixed angle 0, 
with speed L The derivatives are x t = cos 8 and y t — sin 8 and cos 2 0 + sin 2 0 = 1* Then 
dfjdt is immediate from the chain rule: 

ft =f*x, +f y y, =f x COS 8 +f y sin 8, (5) 

For the second derivative apply this rule to Then f„ is 

(ft)x c °s 0 + {ft) y sin 0 = ( f xx cos 0 +f yx sin 0) cos 0 + (f xy cos 9 + f yy sin 0) sin 0. 

Collect terms: f It =f xx cos 2 0 + 2 f xy cos 0 sin 0+f yy sin 2 0* {6) 

In polar coordinates change t to r* When we move in the r direction, 9 is fixed* 
Equation (6) gives f rt from f xx ,f xy ,f yy . Second derivatives on curved paths (with new 
terms from the curving) are saved for the exercises. 

EXAMPLE 3 lff XXi f X y,f yy are all continuous and bounded by M, find a bound on f u . 
This is the second derivative along any line* 

Solution Equation (6) gives \f u \ < M cos 2 0 + M sin 20 + M sin 2 0 < 2Af* This upper 
bound 2M was needed in equation 13*3.14, for the error in linear approximation* 

THE DERIVATIVES OF /(*</, u), KA u» 

Suppose there are two inside functions x and y , each depending on t and u* When t 
moves, x and y both move: dx = x t dt and dy = y t dt. Then dx and dy force a change 
in /: df = f x dx + fydy * The chain rule for dfjdt is no surprise: 


131 Chain rule for^f, u), y(t, u)}: f = + ( 7 ) 


This rule has djdt instead of djdt , because of the extra variable u , The symbols remind 
us that u is constant* Similarly t is constant while u moves, and there is a second 
chain rule for dfjdu: f a =f x x u + f y y ti . 


EXAMPLE 4 In polar coordinates find /* and / eo . Start from/!*, y) -j\r cos 0, r sin 0). 


The chain rule uses the 0 derivatives of x and y: 


88 dx 88 + dy 88 




( 8 ) 


The second 8 derivative is harder, because (8) has four terms that depend on 0. Apply 
the chain rule to the first term dfjdx * It is a function of x and y t and x and y are 
functions of 8: 
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The 9 derivative of dfjdy is similar. So apply the product rule to (8): 

fee = [/**(- r sin 8) +L y (r cos 0)] (- r sin 0) +f x (- r cos 6) 

+ UyA~ r sin 9) +f yy {r cos f>)](r cos 9) +f y (~ r sin 9). (9) 

This formula is not attractive. In mathematics, a messy formula is almost always a 
signal of asking the wrong question. In fact the combination j xx + f is much more 
special than the separate derivatives. We might hope the same for f„ + f 00 , but dimen- 
sionally that is impossible — since r is a length and 9 is an angle. The dimensions of 
f xx and f are matched by f rr and fjr and fw/r 2 . We could even hope that 

f XX +fyy = frr + - fr+ ~z fee- 0°) 

r r 

This equation is true. Add (5) + {6)-h{9) with t changed to r, Laplace's equation 
fxx + Jyy = 0 is now expressed in polar coordinates : f rr + fjr + f^jr 2 — 0. 

A PARADOX 

Before leaving polar coordinates there is one more question. It goes back to drjdx, 
which was practically the first example of partial derivatives: 

IT = Jx 2 + y 2 ~ xljx 2 + y 2 = xir. (11) 

cx (lx 

My problem is this. We know that * is r cos 0. So xjr on the right side is cos 6. On 
the other hand r is x/co$ 9. So drjdx is also 1/cos 9. How can drjdx lead to cos 0 one 
way and 1/cos 9 the other wayl 

I will admit that this cost me a sleepless night. There must be an explanation — 
we cannot end with cos 9 = 1/cos 0. This paradox brings a new respect for partial 
derivatives. May I tell you what I finally noticed? You could cover up the next 
paragraph and think about the puzzle first. 

The key to partial derivatives is to ask: Which variable is held constant ? In 
equation (1 1), y is constant. But when r = x/cos 9 ga vecW5x = 1/cos 9 , 9 was constant. 
In both cases we change x and look at the effect on r. The movement is on a horizontal 
line (constant y) or on a radial line (constant 9). Figure 13. 15 shows the difference. 

Remark This example shows that drjdx is different from 1 j(cxjdr). The neat formula 
(irl£x)(cxldr} — 1 is not generally true. May I tell you what takes its place? We have 
to include (drjdy)[dyjdr). With two variables xy and two variables r0, we need 2 by 
2 matrices! Section 14.4 gives the details: 


drjdx c 

r/ay 

dxivr vxjvti 


1 

0 

c9jcx c 

'0/cy_ 

cyidr dyjdO 


0 

1 




Fig. 13.15 dr = dx cos 0 when y is constant, dr = dxj cos 0 when 0 is constant. 
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NON-INDEPENDENT VARIABLES 

This paradox points to a serious problem. In computing partial derivatives of/fc, y, z\ 
we assumed that x , y, z were independent. Up to now, x could move while y and z 
were fixed. In physics and chemistry and economics that may not be possible. If there 
is a relation between x, y, z, then x can't move by itself. 

EXAMPLE 5 The gas law PV — nRT relates pressure to volume and temperature. 
P, V f T are not independent What is the meaning of dVjdP ? Does it equal 1 /(dPjdV)? 

Those questions have no answers, until we say what is held constant. In the paradox, 
dr/dx had one meaning for fixed y and another meaning for fixed 6 . To indicate what 
is held constant, use an extra subscript (not denoting a derivative): 

(drjdx) y = cos 6 (drjdx) 9 = 1/cos 0. (12) 

(dffdP) y has constant volume and (dfjdP) T has constant temperature. The usual 
dfjdP has both V and T constant. But then the gas law won’t let us change P. 

EXAMPLE 6 Let/= 3x + 2y + z. Compute dfjdx on the plane z - 4x + y. 

Solution 1 Think of x and y as independent. Replace z by 4x + y: 

f= 3x + 2y + (4x + y) so (dfjdx) y = 7. 

Solution 2 Keep jc and y independent. Deal with z by the chain rule: 

(dfjdx) y = df/dx + (dfjdz)(dzjdx) = 3 + (1){4) - 7. 

Solution 3 (different) Make x and z independent. Then y = z - 4x: 

(dfjdx): - df/dx + (dfjdy)(dyjdx) - 3 + (2)(— 4) = - 5. 

Without a subscript, dfjdx means: Take the x derivative the usual way. The answer 
is dfjdx = 3, when y and z don’t move. But on the plane z = 4x + y, one of them must 
move! 3 is only part of the total answer, which is (dfjdx) y = 7 or (dfjdx): = “5. 

Here is the geometrical meaning. We are on the plane z — 4x + y. The derivative 
{dfjdx) y moves x but not y. To stay on the plane, dz is 4dx> The change in /= 
3x + 2y + z is df~ 3 dx + 0 + dz = Idx . 

EXAMPLE 7 On the world line x 2 + y 2 + z 2 — t 2 find [dfjdy) XtS for/= xyzt* 

The subscripts x, z mean that x and z are fixed. The chain rule skips dfjdx and 
dffdz: 

fflfdy)*.* = + (Sfjdt)(dtidy) = xzt + (xyz){y/t). Why y/r? 

EXAMPLE 6 From the law PV= T, compute the product (dPjdV) T (dVldT) P (dTldP) v . 
Any intelligent person cancels 5F’s, dTs, and dP’s to get 1. The right answer is — 1: 

(3P/dV) T = ~TjV 2 (3V/dT)p = 1/P (d TjdP ) v = V. 

The product is — T/PV. This is —1 not + 1! The chain rule is tricky (Problem 42). 

EXAMPLE 9 Implicit differentiation was used in Chapter 4. The chain rule explains it: 

I/F(x, y) = 0 then F x + F y y x = 0 jo dyjdx = - F x /F y . (13) 
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13.5 EXERCISES 


Read-through questions 

The chain rule applies to a function of a a . The x deriva- 
tive of f{g(x y >}) is dfjcx — b The y derivative is dfjdy = 
c The example /=(x + y) n has e = d Because 
dgfex = cgjdy we know that a = t This 9 
differential equation is satisfied by any function of x + y. 

Along a path, the derivative of /(x(f), y(f)) is dfidt = h . 
The derivative of _/(*(( )■ yff), z(r)) is i If /= xy then the 
chain rule gives dfidt = I That is the same as the k 
rule! When x = w,f and y - u 2 t the path is i The chain 
rule for f(x , y) gives dfjdt = m . That is the n deriva- 
tive DJ, 

The chain rule for /(x(f, u), y(r, u)) is of let = o We 
don't write dfidt because p If x = r cos 0 and y = r sin 0, 
the variables f, u change to g In this case dfjdr = 
r and dfidO = 5 That connects the derivatives in 

* and u coordinates. The difference between 
dr/dx = xjr and drjdx = 1/cos 0 is because v is constant 
in the first and w is constant in the second. 

With a relation like xyz = U the three variables are * 
independent. The derivatives (cfjdx) y and [dfjdx) s and (dfidx) 
mean v and * and a For /= x 2 + y 2 + z 2 with 

xyz — 1, we compute (cficx). from the chain rule B In 
that rule dzfcx = c from the relation xyz = 1 . 

Find f x and f y in Problems 1-4. What equation connects them? 

I fix, y) = sin(x + cy) 2 f(x, y) = (a.v + fty) 10 

3 f[x, >') = e 1 - l! 4 f(x, v) = InU + 7y) 

5 Find both terms in the r derivative of (g(.v(f). y(r)) 3 . 

6 If/(x, v) — xy and x = u(t) and v = r;;i. what is dfidt! Prob- 
ably all other rules for derivatives follow from the chain rule. 

7 The step function f(x) is zero for x < 0 and one for x > 0, 
Graph /(x) and j?(x) =J\x + 2) and h(x) =/(x + 4). If f{x + 2 1) 
represents a wall of water (a tidal wave), which way is it 
moving and how fast? 

8 The wave equation hf t = c 2 f xx , (a) Show r that (x + ct) n is 
a solution, (b) Find C different from c so that (x + Ci) n is also 
a solution. 

9 If /= sin(x — f), draw two lines in the xt plane along w hich 
/= 0. Between those lines sketch a sine wave. Skiing on top 
of the sine wave, what is your speed dxidil 

10 If you float at x = 0 in Problem 9, do you go up first or 
dow r n first? At time t = 4 what is your height and upward 
velocity? 

11 Laplaces equation is f xx +f yy — 0. Show from the chain 
rule that any function f[x + fy) satisfies this equation if i 1 = 

— L Check that / = (x + iy) 1 and its real part and 

its imaginary part , all satisfy Laplace's equation. 


12 Equation (10) gave the polar form f rr + fjr + f^jr 2 = 0 of 
Laplace's equation, (a) Check that f=r 2 e 2l$ and its real part 
r 1 cos 20 and its imaginary part t 2 sin 20 all satisfy Laplace’s 
equation, (b) Show from the chain rule that any functionary 0 } 
satisfies this equation if i 1 = — 1. 

In Problems 13-18 find dfidt from the chain rule (3). 

13 /= x 2 + — y — t 2 

14 f= Jx 1 + y 2 , x = t,y = t 2 

15 /= xy, x = 1 - jt, y = 1 + ^ft 

16 /= x/y, x = e 1 , y = 2e t 

17 /= ln(x + y), x = e\ y-e l 

18 f=x\x = uy = t 

19 Tf a cone grows in height by dhjdt — 1 and in radius by 
drjdt = 2, starting from zero, how fast is its volume growing 
at f = 3? 

20 If a rocket has speed dx/dt = 6 down range and dyjdt = 
It upward, how fast is it moving away from the launch point 
at (0, 0)? How r fast is the angle 0 changing, if tan 0 = y/x? 

21 If a train approaches a crossing at 60mph and a car 
approaches (at right angles) at 45 mph, how fast are they 
coming together? (a) Assume they are both 90 miles from the 
crossing, (b) Assume they are going to hit. 

22 In Example 2 does the temperature increase faster if you 
drive due south at 70 mph or southeast at 80 mph? 

23 On the line x = it^L y = u 2 t , r = what combination of 
f x ,j\,f z gives dfidt! This is the directional derivative in 3D, 

24 On the same line x = y = u z t, z = u 3 r, find a formula 
for d 2 fidt 2 . Apply it to f — xyz . 

25 For f(x , y, f) = x + y + t find dfjdt and dfjdt when x = 2t 
and y = 3r. Explain the difference. 

26 If z = (x -hy) 2 then x = J~z-y. Does (czjdx)(dxjdz) = 1? 

27 Suppose x c = t and y, = 2f, not constant as in (5-6), For 
/(x,y) find f and/ ( . The answer \n\o]xt$f x J y J xx J xy J yy . 

28 Suppose = f and y t — t 1 . For/=(x + y) 1 find); and then 
f tt from the chain rule. 

29 Derive dfjdr = {ofjdx) cos 0 + (dfjdy) sin 0 from the chain 
rule. Why do we take dxjcr as cos 0 and not 1/cos 01 

30 Compute f xx for /(x, y) = (ax + by + c) 10 . If x = t and y = 
f compute f tl . True or false; (dfjdx)(dxjdt) = dfjdt. 

31 Show that d 2 rjcx 2 — y 2 jr l in two ways: 

(1) Find the x derivative of drjdx = xj^Jx 2 + y 2 

(2) Find the x derivative of drjdx = xjr by the chain rule. 
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32 Reversing x and y in Problem 31 gives r yy = x 2 /r z . But 
show that r xy — — x y/r 3 . 

33 If sin z = x + y find ( dzjdx) y in two ways: 

(1) Write z = sin " '( x + y) and compute its derivative. 

(2) Take x derivatives of sin z = * + y. Verify that these 
answers, explicit and implicit, are equal. 

34 B y direct computation find f x and f xx and f xy for 
f=y/x 2 + y t . 

35 Find a formula for d 2 f/drd0 in terms of the x and y deriva- 
tives of y). 

36 Suppose i =f[x, y) is solved for x to give x = g(y 7 z). Is it 
true that dzjdx = l/(dx/3z)? Test on examples. 

37 Suppose z = e xy and therefore x — (In z)jy. Is it true or not 
that (dzjdx) = 1 j(dxjdz)? 

38 If x = x(t, «, v) and y = y(t, u ) and z = z (t> *>), find the t 
derivative off(x , y, z). 

39 The t derivative oif(x(t 7 u\ y(r, u)) is in equation (7). What 
is/„? 

40 (a) For f—x 2 y-y lj rz 2 compute dfjdx (no subscript, 
x, y , z all independent)* 

(b) When there is a further relation z = x 2 H- y 2 , use it to 
remove z and compute (df/dx) y . 

(c) Compute (dffdx) y using the chain rule (dfjdx) + 
(dffdz)(dzfdx). 

(d) Why doesn’t that chain rule contain (df/dy)(dy/dxY! 


41 For /= ax + by on the plane z = 3x + 5y, find {df/dx) 2 and 
(dfjdx) y and (cfjdz) x 

42 The gas law in physics is PV^nRT or a more general 
relation F(P, V, T) = 0. Show that the three derivatives in 
Example 8 still multiply to give —1. First find (dPjdV) T from 
dFjdV-y (dF/dP)(dP/dV) T = 0, 

43 If Problem 42 changes to four variables related by 
F(x, y, 2, r) = 0, what is the corresponding product of four 
derivatives? 

44 Suppose x = t + u and y = tu. Find the t and u derivatives 
ofj\x, y). Check when /(x, y) = x 2 — 2y* 

45 (a) For / = r 1 sin 2 8 find f x and f yt 
(b) For /= x 2 + y 2 find f r and /*. 

46 On the curve sin x 4- sin y = 0, find dyjdx and d 2 y/dx 2 by 
implicit differentiation. 

47 (horrible) Suppose f xx + f yy = 0. If x = u + v and y-u — v 
and^x, y) = g(u , r), find g u and g v . Show that g tiU + g b9 = 0. 

48 A function has constant returns to scale if /(c x, cy) — 
cfix, y). When x and y are doubled so are /= v /x 2 + y 2 
and / =3 y/xy. In economics, input/output is constant* In 
mathematics f is homogeneous of degree one. 

Prove that x dfjdx + y dfjdy =J\x y y), by computing the c 
derivative at c = 1. Test this equation on the two examples 
and find a third example. 

49 True or false : The directional derivative of f[r, 6) in the 
direction of is dfjdO. 


H 13.6 Maxima, Minima, and Saddle Points M 

The outstanding equation of differential calculus is also the simplest: dfjdx = 0. The 
slope is zero and the tangent line is horizontal* Most likely we are at the top or 
bottom of the graph — a maximum or a minimum* This is the point that the engineer 
or manager or scientist or investor is looking for — maximum stress or production 
or velocity or profit. With more variables mj\x 7 y) and^x, y , z), the problem becomes 
more realistic* The question still is: How to locate the maximum and minimum ? 

The answer is in the partial derivatives , When the graph is level, they are zero. 
Deriving the equations f x = 0 and^ = 0 is pure mathematics and pure pleasure. 
Applying them is the serious part. We watch out for saddle points, and also for a 
minimum at a boundary point — this section takes extra time, Remember the steps 
for f(x) in one-variable calculus: 

L The leading candidates are stationary points (where dfjdx = 0). 

2. The other candidates are rough points (no derivative) and endpoints (a or b ). 

3. Maximum vs, minimum is decided by the sign of the second derivative. 

In two dimensions, a stationary point requires dfjdx — 0 and dfjdy — 0. The tangent 
line becomes a tangent plane* The endpoints a and b are replaced by a boundary 
curve . In practice boundaries contain about 40% of the minima and 80% of the work. 
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Finally there are three second derivatives f xx ,f xy , and f yy . They tell how the graph 
bends away from the tangent plane — up at a minimum , down at a maximum, both 
ways at a saddle point. This will be determined by comparing (/**)(/„.) with (f xy ) 2 . 

STATIONARY POINT -> HORIZONTAL TANGENT - ZERO DERIVATIVES 

Suppose /has a minimum at the point (x 0 , y 0 ). This may be an absolute minimum or 
only a local minimum. In both cases /(x 0 , y 0 ) ^/(x, y) near the point. For an absolute 
minimum, this inequality holds wherever / is defined, For a local minimum, the 
inequality can fail far away from (x 0 ,y 0 ). The bottom of your foot is an absolute 
minimum, the end of your finger is a local minimum. 

We assume for now that (x 0 , y 0 ) is an interior point of the domain of /. At a 
boundary point, we cannot expect a horizontal tangent and zero derivatives. 

Main conclusion: At a minimum or maximum (absolute or local) a nonzero deriva- 
tive is impossible. The tangent plane would lilt. In some direction/ would decrease. 
Note that the minimum point is (x 0 ,y 0 ), and the minimum value is/(x 0 ,yo). 


13J If derivatives exist at an interior minimum or maximum, they are zero: 

dfjdx = 0 and df/dy - 0 (together this is grad /= 0). (1) 

For a function /tx', y, z) of three variables, add the third equation dfjdz = 0. 


The reasoning goes back to the one- variable case. That is because we look along the 
lines x = x 0 and y= y 0 . The minimum of f(x, y) is at the point where the lines meet. 
So this is also the minimum along each line separately. 

Moving in the x direction along y = y 0 , we find df/dx = 0. Moving in they direction, 
df/dy = 0 at the same point. The slope in every direction is zero. In other words 
grad /= 0. 

Graphically, (x 0 , y 0 ) is the low point of the surface z =/(x, y). Both cross sections 
in Figure 13.16 touch bottom. The phrase “if derivatives exist” rules out the vertex 
of a cone, which is a rough point. The absolute value / — |x| has a minimum without 
dfjdx — 0, and so does the distance /= r. The rough point is (0, 0). 




Fig. 13.16 df/8x = 0 and df/dy = 0 at the minimum. Quadratic / has linear derivatives. 


EXAMPLE 1 Minimize the quadratic /(x, y) = x 2 + xy + y 2 - x - y + 1. 
To locate the minimum (or maximum), set /t = 0 and f y = 0: 

f x = 2x + y— 1 = 0 and f y = x + 2y — 1 = 0. 


( 2 ) 
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Notice what's important: There are two equations for two unknowns x and y. Since / 
is quadratic, the equations are linear. Their solution is x 0 = y, y 0 = j (the stationary 
point). This is actually a minimum, but to prove that you need to read further. 

The constant 1 affects the minimum value / = § — but not the minimum point* The 
graph shifts up by 1 , The linear terms - x- y affect f x and/ y . They move the minimum 
away from (0, 0), The quadratic part x 2 + xy -f- y 2 makes the surface curve upwards* 
Without that curving part, a plane has its minimum and maximum at boundary 
points* 

EXAMPLE 2 (Steiner's problem I Find the point that is nearest to three given points. 

This example is worth your attention* We are locating an airport close to three cities. 
Or we are choosing a house close to three jobs. The problem is to get as near as 
possible to the corners of a triangle* The best point dep ends on the meaning of near." 

The distance to the first corner ( jc i , >■ i ) is = ^/{x - x x ) 2 + (y - ) 2 , The dis- 

tances to the other corners (x 2 > y 2 ) a nd ( x 3 > >’ 3 ) are a nd d 3 . Depending on whether 
cost equals (distance) or (distance) 1 or (distance) 17 , our problem will be: 

Minimize d ] - d> + d^ or d] + d \ + d\ or even d l \ + d% + d 

The second problem is the easiest, when d\ and d\ and d\ are quadratics: 

/(x, >’) = (x - Xj ) 2 + (>’ - y ,) 2 -h (x- x 2 ) 2 4- (y - y 2 ) 2 + (x- X 3) 2 4- (y -y 3 ) 2 

dfjcx = 2[x - x t + x - x, + x - x 3 ] = 0 Sfjdy = 2 [y - y : 4- y - y 2 + y - y 3 ] = 0. 

Solving eficx = 0 gives x - \ ! x , + x 2 4- x , ). Then cfjvy = 0 gives y - \( v , + y 2 4- y 3 ). 
The best point is the centroid of the triangle (Figure 13.17a). It is the nearest point 
to the corners when the cost is (distance) 2 * Note how squaring makes the derivatives 
linear. Least squares dominates an enormous part of applied mathematics. 


(.y v y : j 



The real "Steiner problem ” is to minimize /(x, y) = d x + d 2 + d$ . Wc arc laying down 
roads from the corners, with cost proportional to length. The equations f x = 0 and 
f y = 0 look complicated because of square roots. But the nearest point in 
Figure 13.17b has a remarkable property, which you will appreciate. 

Calculus takes derivatives of d\ = (x — x x ) 2 + (y — j^) 2 . The x derivative leaves 
2</ 1 (2d 1 /3x) = 2{x - x,). Divide both sides by 2 


cd t 
ex 


x~x : 

d. 


and 


cd x 


cy 


d i 


so grad d l = 


v - Xj y — y i 


(3) 


This gradient is a unit vector. The sum of (x — x x ) 2 id\ and (y — Si) 2 ;d\ is d 2 .'d 2 = 1. 
This was already in Section 13.4: Distance increases with slope 1 away from the 
center. The gradient of d x (call it u : ) is a unit vector from the center point (xj, j J* 
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Similarly ttie gradients of d 2 and d 3 are unit vectors u 2 and u 3 . They point directly 
away from the other comers of the triangle. The total cost is f{x t y) = d x + d 2 + d 3s 
so we add the gradients. The equations f x = 0 and f y = 0 combine into the vector 
equation 

grad f= u L + u 2 + u 3 = 0 at the minimum. 

The three unit vectors add to zero ! Moving away from one corner brings us closer to 
another. The nearest point to the three corners is where those movements cancel. 
This is the meaning of “grad /= 0 at the minimum.” 

It is unusual for three unit vectors to add to 2 ero — this can only happen in one 
way. The three directions must form angles of 120°. The best point has this property, 
which is repeated in Figure 13.18a. The unit vectors cancel each other. At the “Steiner 
point,” the roads to the corners make 120° angles. This optimal point solves the 
problem, except for one more possibility. 





Hg. 13.16 Gradients u, +ii 7 + u 3 = 0 for 120° angles. Corner wins at wide angle. Four 
corners. In this case two branchpoints are better — still 120°. 


The other possibility is a minimum at a rough point . The graph of the distance 
function d x (x f y ) is a cone. It has a sharp point at the center (x x , y x ). All three corners 
of the triangle are rough points for d : + d 2 + d 3 , so all of them are possible minimizers. 

Suppose the angle at a corner exceeds 120 u . Then there is no Steiner point. Inside 
the triangle, the angle would become even wider. The best point must be a rough 
point — one of the corners. The winner is the corner with the wide angle. In the figure 
that means d x = 0. Then the sum d 2 + d 3 comes from the two shortest edges. 

Summary The solution is at a 12(F point or a wide-angle corner. That is the theory. 
The real problem is to compute the Steiner point — which I hope you will do. 

Remark 1 Steiner's problem for four points is surprising. We don't minimize 
d{ + d 2 + d 3 + d 4 — there is a better problem. Connect the four points with roads, 
minimizing their total length, and allow the roads to branch. A typical solution is in 
Figure 13. 1 8c. The angles at the branch points are 120 c . There are at most two branch 
points (two less than the number of corners). 

Remark 2 For other powers p, the cost is (d L )^ + (d 2 ) p + [d$) p > The x derivative is 

cfjdx = p(d,) p ~ 2 (x - X,) + pW 2 ) p ' 2 (,v - x 2 ) + p(d i ) p ~ 2 (x - x 3 ). (4) 

The key equations are still df/dx = 0 and dfjdy = 0. Solving them requires a computer 
and an algorithm. To share the work fairly, I will supply the algorithm (Newton’s 
method) if you supply the computer. Seriously, this is a terrific example. It is typical 
of real problems — we know of/dx and Sf/dy but not the point where they are zero. 
You can calculate that nearest point, which changes as p changes. You can also 
discover new mathematics, about how that point moves. I will collect all replies I 
receive to Problems 38 and 39. 
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MINIMUM OP MAXIMUM ON THE BOUNDARY 

Steiner's problem had no boundaries. The roads could go anywhere. But most appli- 
cations have restrictions on x and y, like x ^ 0 or y ^ 0 or x 2 + y 2 ^ 1. The minimum 
with these restrictions is probably higher than the absolute minimum. There are three 
possibilities: 

(1) stationary point J x = 0 f j y = 0 (2) rough point (3) boundary point 

That third possibility requires us to maximize or minimize /{x, y) along the boundary. 

EXAMPLE 3 Minimize f(x, y) = x 2 + xy + y 1 — x — y + 1 in the half-plane x ^ 0, 

The minimum in Example 1 was Tt occurred at x 0 = y 0 = This point is still 
allowed. It satisfies the restriction x ^0. So the minimum is not moved. 

EXAMPLE 4 Minimize the same /(x, y) restricted to the tower half-plane y ^ 0. 

Now the absolute minimum at (j, }) is not allowed. There are no rough points. We 
look for a minimum on the boundary line y = 0 in Figure 13.19a. Set y~0, so/ 
depends only on x. Then choose the best x: 

f[x, 0) = x 2 + 0 - x - 0 + 1 and f x = 2x - 1 = 0. 

The minimum is at x = j and y — 0, where/ = 5. This is up from \. 



Fig. 13 ,19 The boundaries y — 0 and x 3 + y 2 = 1 contain the minimum points. 

EXAMPLE 5 Minimize the same f(x, y) on or outside the circle x 2 -f - y 2 = F 

One possibility is f x = 0 and / — 0. But this is at {3. 3), inside the circle. The other 
possibility is a minimum at a boundary poi nt, on t he circle. 

To follow this boundary we can set y = - x 2 . The function/gets complicated, 

and df/dx is worse. There is a way to avoid square roots: Set x = cos t and y = sin r. 
Then f— x 2 + xy + y 2 - x — y + 1 is a function of the angle t: 

/(f) = 1 f cos f sin f - cos f — sin t + 1 

dfjdt = cos 2 f - sin 2 f + sin f — cos t = (cos t — sin f)(cos t + sin t - 1), 

Now dfidt = 0 locates a minimum or maximum along the boundary. The first factor 

(cos t - sin r) is zero when x = y. The second factor is zero when eos r + sin t = 1, or 

x + y = 1. Those points on the circle are the candidates. Problem 24 sorts them out, 
and Section 13,7 finds the minimum in a new way using 'Lagrange multipliers. ' 
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Minimization on a boundary is a serious problem — it gets difficult quickly — and 
multipliers are ultimately the best solution. 

MAXIMUM VS. MINIMUM VS. SADDLE POINT 


How to separate the maximum from the minimum? When possible, try all candidates 
and decide. Compute / at every stationary point and other critical point (maybe also 
out at infinity), and compare. Calculus offers another approach, based on second 
derivatives. 

With one variable the second derivative test was simple: f xx > 0 at a minimum, 
f xx = 0 at an inflection point, f xx < 0 at a maximum. This is a local test, which may 
not give a global answer. But it decides whether the slope is increasing (bottom of 
the graph) or decreasing (top of the graph). We now find a similar test for /(x, y ). 

The new test involves all three second derivatives. It applies where / x = 0 and 
f y = 0. The tangent plane is horizontal. We ask whether the graph of f goes above or 
below that plane. The tests f xx > 0 and f yy > 0 guarantee a minimum in the x and y 
directions, but there are other directions. 

EXAMPLE 6 /(x, y) = x 2 + lOxy 4* y 2 has f xx = 2, f xy = 10, f yy = 2 (minimum or not ,y .) 

All second derivatives are positive — but wait and see. The stationary point is (0, 0), 
where df/dx and df/dy are both zero. Our function is the sum of x 2 4- y 2 , which goes 
upward, and lOxy which has a saddle. The second derivatives must decide whether 
x 2 + y 2 or lOxy is stronger. 

Along the x axis, where y = 0 and /= x 2 , our point is at the bottom. The minimum 
in the x direction is at (0, 0). Similarly for the y direction. But (0, 0) is not a minimum 
point for the whole function, because of lOxy. 

Try x= 1, y = — 1. Then /= 1 — 10+1, which is negative. The graph goes below 
the xy plane in that direction. The stationary point at x = y = 0 is a saddle point. 



EXAMPLE 7 /(x, y) = x 2 + xy + y 2 has f xx = 2, f xy = 1, f yy = 2 ( minimum or not ?) 

The second derivatives 2, 1, 2 are again positive. The graph curves up in the x and y 
directions. But there is a big difference from Example 6: f xy is reduced from 10 to 1. 
It is the size of f xy (not its signl) that makes the difference. The extra terms - x - y 4- 4 
in Example 1 moved the stationary point to (3, 3). The second derivatives are still 
2, 1,2, and they pass the test for a minimum: 


13K At (0, 0) the quadratic function /(x, y) = ax 2 4- 2 bxy 4- cy 2 has a 
a > 0 a < 0 

minimum if maximum if saddle point if ac < b 2 . 

ac > b 2 ac> b 2 
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For a direct proof, split/(x, y) into two parts by "‘completing the square:'’ 


ax 2 + 2 hxy + cy 2 



ac-h 2 


That algebra can be checked (notice the 2h). It is the conclusion that's important: 

if a > 0 and ac > h 2 < both parts are positive: minimum at (0, 0) 
if a < 0 and ac > b 2 > both parts are negative: maximum at (0, 0) 
if ac < b 2 , the parts have opposite signs: saddle paint at (0, 0)* 

Since the test involves the square of b , its sign has no importance. Example 6 had 
h = 5 and a saddle point. Example 7 had b = j and a minimum. Reversing to 
— .v 2 — xy — y 2 yields a maximum. So does — x 2 + xy — y 2 . 

Now comes the final step, from *;x 2 + 2bxy + cy 2 to a general function /(x, y), For 
all functions, quadratics or not, it is the second order terms that we test. 


EXAMPLE 8 /(. x, y) = e* — x “ cos y has a stationary point at x = 0, y = 0. 

The first derivatives are e x — I and sin both zero. The second derivatives are f xx = 
e x = 1 and f yy = cos y= 1 and/ ty = 0. We only use the derivatives at the stationary 
point . The first derivatives are zero, so the second order terms come to the front in 
the series for e x — x — cos y: 

{1 + X + 4x 2 + »•) - X - (1 - it' 2 + •■■) = ix 2 + \y z + higher order terms. (7) 

There is a minimum at the origin. The quadratic part |x 2 + \y 2 goes upward. The x J 
and y 4 terms are too small to protest. Eventually those terms get large, but near a 
stationary point it is the quadratic that counts. We didn't need the whole series, 
because from f xx —f yy — 1 and f xy = 0 we knew it would start with |x 2 + iy 2 . 


13L The test in 13K applies to the second derivatives a=f xx , b — f xy , c = f yy 
of any/(x, y) at any stationary point. At all points the test decides whether the 
graph is concave up, concave down, or "indefinite." 


EXAMPLE 9 /(x, y) — e xy has f x = ye xy and / v — xe xy . The stationary point is (0, Of, 

The second derivatives at that point are a=f xx = 0, b=f xy = 1, and c=f yy = 0. The 
test b 2 > ac makes this a saddle point. Look at the infinite series: 

e xy =\ +.vy + i\- 2 y 2 + "'- 

No linear term because./* =f y — 0: The origin is a stationary point , No x 2 or y 2 term 
(only xy): The stationary point is a saddle point. 

At x = 2, y — “2 we find f xx f yy > {f xy ) 2 ~ The graph is concave up at that point- 
hut it’s not a minimum since the first derivatives are not zero. 

The series begins with the constant term — not important. Then come the linear 
terms — extremely important. Those terms are decided by first derivatives, and they 
give the tangent plane. It is only at stationary points — when the linear part disappears 
and the tangent plane is horizontal — that second derivatives take over. Around any 
basepoint, these constant-linear-quadratic terms are the start of the Taylor series . 
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THE TAYLOR SERIES 


We now put together the whole infinite series. It is a "Taylor series” — which means 
it is a power series that matches all derivatives of f (at the basepoint). For one 
variable, the powers were x" when the basepoint was 0. For two variables, the 
powers are x n times y m when the basepoint is (0,0). Chapter 10 multiplied the nth 
derivative d n f/dx n by x^/n! Now every mixed derivative (d/dxY'(d/dy) m f(x, y) is computed 
at the basepoint (subscript 0 )- 

We multiply those numbers by x n y m /n\m\ to match each derivative of /(x, y): 


13M When the basepoint is (0, 0), the Taylor series is a double sum II a nm x n y m . 
The term a^f 1 has the same mixed derivative at (0, 0) as /(x, y). The series is 



The derivatives of this series agree with the derivatives of /(x, y) at the basepoint. 


The first three terms are the linear approximation to /(x, y). They give the tangent 
plane at the basepoint. The x 2 term has n = 2 and m = 0, so n\m\ = 2. The xy term 
has n = m= 1, and nlml = 1. The quadratic part \(ax 2 + 2 bxy + cy 2 ) is in control when 
the linear part is zero. 

EXAMPLE 10 All derivatives of e x+y equal one at the origin. The Taylor series is 

V “ \~ X n V m 

^, = l+ , +) , +T+ , y+ ,_ + ... =II _ 

This happens to have ac = b 2 % the special case that was omitted in 13M and 13N. 
It is the two-dimensional version of an inflection point. The second derivatives fail to 
decide the concavity. When f xx f yy = (/ xy ) 2 , the decision is passed up to the higher 
derivatives. But in ordinary practice, the Taylor series is stopped after the quadratics. 

If the basepoint moves to (x 0 , y 0 ), the powers become (x - x 0 )"(y - y 0 ) m — and all 
derivatives are computed at this new basepoint. 

Final question : How would you compute a minimum numerically ? One good way is 
to solve f x = 0 and f y = 0. These are the functions g and h of Newton’s method 
(Section 13.3). At the current point (x„, y„), the derivatives of g=f x and h = f y give 
linear equations for the steps Ax and Ay. Then the next point x n + { = x n + Ax, y n+ { = 
y n + Ay comes from those steps. The input is (x„, yj, the output is the new point, 
and the linear equations are 

(g x ) Ax + (g y )Ay = - g(x„, y n ) (/**)Ax + (f xy )Ay = -/ X (x„, yj 

or (5) 

(h x ) Ax + (h y )Ay = - h(x„ y n ) (f xy ) Ax + (f yy )Ay = -/ y (x„, y„). 

When the second derivatives of /are available, use Newton’s method. 

When the problem is too complicated to go beyond first derivatives, here is an 
alternative — steepest descent. The goal is to move down the graph of /(x, y), like a 
boulder rolling down a mountain. The steepest direction at any point is given by the 
gradient , with a minus sign to go down instead of up. So move in the direction Ax = 
— 5 df/dx and Ay = - s df/dy. 
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The question is: How far to move? Like a boulder, a steep start may not aim 
directly toward the minimum. The stepsize .$ is monitored, to end the step when the 
function /starts upward again (Problem 54). At the end of each step, compute first 
derivatives and start again in the new steepest direction. 


13.6 EXERCISES 


Read-through questions 

A minimum occurs at a Q point (where j x = j y = 0) or a 
b point (no derivative) or a c point. Since /= 
x 2 — xy + 2 y has f x = d and /. = e the stationary 
point is x = t y = g This is not a minimum, 
because /decreases when h 

The minimum of d 1 — (x — x t ) 2 4- (y — _V| ) 2 occurs at the 
rough point l The graph of d is a ) and grad d 
is a k vector that points < The graph of/= \xy\ 

touches bottom along the lines m . Those are “rough 
lines 11 because the derivative n The maximum of d and 
/ must occur on the Q of the allow r ed region because it 
doesn't occur P 

When the boundary curve is x = x(r), y = the derivative 
of /(x, y) along the boundary is q (chain rule). If/— 
x 1 + 2y 2 and the boundary is x = cos r, y = sin r, then dfjdt — 
r It is zero at the poiots s The maximum is at 
t and the minimum is at u Inside the circle/has 

an absolute minimum at v 

To separate maximum from minimum from w com- 
pute the * derivatives at a v point. The tests for a 
minimum are * The tests for a maximum are a [n 
case ac < B or f xx f yy < C we have a D At all 
points these tests decide bctw r ccn concave up and E and 
“indefinite. 11 For/= 8x 2 — 6.vy 4- y 2 , the origin is a f The 
signs of/ at (1,0) and (L 3) are O 

The Taylor series for/(x, y) begins with the six terms H 
The coefficient of x n y m is I To find a stationary point 
numerically, use J or K 


Find all stationary points {f x = f y = 0) in 1-16. Separate mini- 
mum from maximum from saddle point. Test 13K applies to 

^ = Jxxt h —fxyr C = fyy" 


1 

x 2 + 

2xy + 3_r 

2 

xy - x + y 

3 

x 2 + 

4vy + 3r - 6 \ — 

12y 4 

a- 2 - y 2 + 4y 

5 


— X 

6 

xe y — e x 

7 

-x 2 

+ 2xy - 3y 2 

8 

(a + y) : + (a + 2y - 6) 

9 

X 2 + 

y 2 + z 2 -4z 

10 

(a + >■)(* + 2y - 6) 

11 

(X- 

v) 2 

12 

(I +x 2 )/(I + y 2 ) 

13 

(x + 

y) 2 -(.x + 2y) 2 

14 

sin x — cos y 


15 x 3 + y 3 - 3x 2 + 3y 2 16 8xy-x 4 -y 4 

17 A rectangle has sides on the x and y axes and a corner on 
the line x 4- 3 y — 12, Find its maximum area. 

18 A box has a corner at (0, 0, 0) and all edges parallel to the 
axes. If the opposite corner (x, y T z) is on the plane 
3x + 2y + z = 1, what position gives maximum volume? Show 
first that the problem maximizes xy — 3x 2 y — 2xy 2 , 

19 (Straight line fit, Section 11,4) Find x and y to minimize 
the error 

£ = (a + y) 2 + lx + 2 v - 5) 2 + (a + 3 v - 4) 2 . 

Show' that this gives a minimum not a saddle point, 

20 [Least squares) What numbers x, y come closest to satisfy- 
ing the three equations x - y = 1, 2x 4- y = — 1, x + 2y = 1? 

Square and add the errors, (x — y — l) 2 4- + 

. Then minimize, 

21 Minimize /= x 2 + xy + y 2 — x — y restricted by 

(a) x ^ 0 (b) y ^ 1 (c) x ^ 0 and y ^ ] , 

22 Minimize /= x 2 4- y 2 + 2x 4- 4y in the regions 

(a) all x, y (b) y ^ 0 (e) x ^ 0, y ^ 0 

23 Maximize and minimize /= x + v /3y on the circle x = 
cos f, y = sin /, 

24 Example 5 followed /= x 2 + xy + y 2 — ,v — y 4- 1 around 
the circle _x 2 — y 2 = 1. The four stationary points have x = y 
or x + y=l. Compute / at those points and locate the 
minimum. 

25 (a) Maximize /= ax 4- fry on the circle x 2 4- y 2 = 1. 

(b) Minimize x 2 4- y 2 on the line ax 4- by = 1. 

26 For/(x, y) = ^x 4 xy 4 iy 4 , what are the equations / = 
0 and /. — 0? What are their solutions? What is / min ? 

27 Choose c > 0 so that /= x 2 4 xy 4- cy 2 has a saddle point 
at (0, 0), Note that/ > 0 on the lines x - 0 and y — 0 and y = 
x and y = — x, so checking four directions does not confirm 
a minimum. 

Problems 28-42 minimize the Steiner distance /= d x + d 2 4- d 3 
and related functions, A computer is needed for 33 and 36-39, 

28 Draw the triangle with corners at (0, 0), (1, 1), and (l, - I). 

By symmetry the Steiner point will be on the x axis. Write 
down the distances to (x, 0) and find the x that 

minimizes d : +d 2 + d 2i . Check the 1 20 angles. 
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29 Suppose three unit vectors add to zero. Prove that the 
angles between them must be 120°. 

30 In three dimensions, Steiner minimizes the total distance 
j\x, y, z) — d^ + d 2 -\- d$± d± from four points. Show that 
gradd, is still a unit vector (in which direction?) At what 
angles do four unit vectors add to zero? 

31 With four points in a plane, the Steiner problem allows 
branches (Figure 13.18c). Find the shortest network connect- 
ing the corners of a rectangle, if the side lengths are (a) 1 and 
2 (b) 1 and 1 (two solutions for a square) (c) 1 and 0,1. 

32 Show that a Steiner point (120° angles) can never be out- 
side the triangle. 

33 Write a program to minimize J{x y y) = d x + d 2 + d^ by 
Newton’s method in equation (5), Fix two comers at (0, 0), 
(3, 0), vary the third from (1, I) to (2, 1) to (3, 1) to (4, 1), and 
compute Steiner points. 

34 Suppose one side of the triangle goes from (— 1, 0) to (1, 0), 
Above that side are points from which the lines to (—1, 0) and 
(1, 0) meet at a 12Q 3 angle. Those points lie on a circular arc — 
draw it and find its center and its radius. 

35 Continuing Problem 34, there are circular arcs for all three 
sides of the triangle. On the arcs* every point sees one side of 
the triangle at a 120° angle. Where is the Steiner point? 
(Sketch three sides with their arcs.) 

36 Invent an algorithm to converge to the Steiner point based 
on Problem 35. Test it on the triangles of Problem 33. 

37 Write a code to minimize f=d\ + d\ + d% by solving f x =0 
and f y = 0, Use Newton’s method in equation (5). 

38 Extend the code to allow all powers 1, not only p = 
4. Follow the minimizing point from the centroid at p = 2 to 
the Steiner point at p = 1 (try p = 1.8, 1.6, 1.4, 1,2). 

39 Follow the minimizing point with your code as p increases: 
p = 2, p = 4, p = 8, p = 16. Guess the limit at p = oo and test 
whether it is equally distant from the three corners. 

40 At p = oo we are making the largest of the distances 

di,d 2 , d 3 as small as possible. The best point for a 1, 1, 
right triangle is . 

41 Suppose the road from corner 1 is wider than the others, 
and the total cost \sj[x, y) = ^/ld t + d 2 ±d 3 . Find the gradi- 
ent of / and the angles at which the best roads meet. 

42 Solve Steiner’s problem for two points. Where is + d 2 
a minimum? Solve also for three points if only the three 
corners are allowed. 


Find all derivatives at (0, 0). Construct the Taylor series: 

43 fix, y) = (x + y) 3 44 /(x, y) = xe” 

45 f\x, y) = ln(l - xy) 

Find }\... _f yy at the basepoint. Write the quadratic 

approximation to fix* y) — the Taylor series through second- 
order terms: 

46 /= at (0, 0) 47 /= at (1, 1) 

48 /= sin x cos y at (0, 0) 49 /= x 1 + y 2 at (1, —1) 

50 The Taylor series around (x, y) is also written with steps 

ft and k:J[x + ft, y + fc) =_/(*, y) + ft + fc + 

h 2 +hk + Fill in those four blanks. 

51 Find lines along which /(x, y) is constant (these functions 
ha vef XI f„ =fl y or ac = b 2 ): 

(a) /= x 2 — 4xy + 4 y 1 (b)/= eV 

52 For_ftx, y, z ) the first three terms after /( 0, 0, 0) in the Tay- 
lor series are . The next six terms are . 

53 (a) For the error f— f L in linear approximation* the Taylor 

series at (0, 0) starts with the quadratic terms . 

(b) The graph of / goes up from its tangent plane (and 

/ > f L ) if . Then / is concave upward. 

(c) For (0 t 0) to be a minimum we also need 

54 The gradient of x 2 + 2y 2 at the point (1,1) is (2,4). 
Steepest descent is along the line x = 1 - 2s, y = 1 — 4s (minus 
sign to go downward). Minimize x 1 + 2y 2 with respect to the 

stepsize s. That locates the next point , where 

steepest descent begins again. 

55 Newton’s method minimizes x 2 + ly 1 in one step. Starling 
at (x 0 , yo) = (U U find Ax and Ay from equation (5). 

56 If f xx + f yy = 0, show that /(x, y) cannot have an interior 
maximum or minimum (only saddle points). 

57 The value of x theorems and y exercises is/ = x 2 y (maybe). 
The most that a student or author can deal with is 4x + y = 
12. Substitute y — 12 — 4x and maximize / Show that the line 
4x + y = 12 is tangent to the level curve x 2 y =/ mai . 

58 The desirability of x houses and y yachts is_ftx, y)> The 

constraint px + qy = k limits the money available. The cost of 
a house is , the cost of a yacht is . Substi- 

tute y = (k — px)jq into_ftx, y) = F(x) and use the chain rule 
for dFjdx, Show that the slope —/*//, at the best x is — pjq. 

59 At the farthest point in a baseball field, explain why the 
fence is perpendicular to the line from home plate. Assume 
it is not a rough point (corner) or endpoint (foul line). 
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WM. 13.7 Constraints and Lagrange Multipliers !■ 

This section faces up to a practical problem. We often minimize one function^*, y) 
while another function g(x, y) is fixed. There is a constraint on x and y ; given by 
g(x 9 y) — k. This restricts the material available or the funds available or the energy 
available. With this constraint, the problem is to do the best possible (/ m „ or / min ). 

At the absolute minimum of^x, y), the requirement g(x, y) - k is probably violated. 
In that case the minimum point is not allowed. We cannot use f x = 0 and f y = 0 — 
those equations don’t account for g. 

Step 1 Find equations for the constrained minimum or constrained maximum . They 
will involve f x and f y and also g x and g y , which give local information about / and g. 
To see the equations, look at two examples. 

EXAMPLE 1 Minimize /= x 2 + y 2 subject to the constraint g — 2x + y — k. 

Trial runs The constraint allows x — 0, y — fc, where /— k 2 . Also (£fc, 0) satisfies the 
constraint, and /= \k z is smaller. Also x = y = $k gives /= Ik 2 (best so far). 

Idea of solution Look at the level curves of/(x, y) in Figure 13.21. They arc circles 
x 2 + y 2 = c. When c is small, the circles do not touch the line 2 x + y = fc. There are 
no points that satisfy the constraint, when c is too small. Now increase c. 

Eventually the growing circles x 2 -I- y 2 = c will just touch the line x + 2y = fc. The 
point where they touch is the winner. It gives the smallest value of c that can be 
achieved on the line. The touching point is (x min , y min ), and the value of c 

What equation describes that point? When the circle touches the line, they are 
tangent . They have the same slope. The perpendiculars to the circle and the line go in 
the same direction. That is the key fact, which you see in Figure 13.21a. The direction 
perpendicular to/— c is given by grad / — (f x9 f y ). The direction perpendicular to g — 
k is given by grad g“ (g*, g y ). The key equation says that those two vectors are 
parallel. One gradient vector is a multiple of the other gradient vector, with a multi- 
plier k (called lambda) that is unknown: 

13N At the minimum of /(x, y) subject to g(x, y) — k, the gradient of /is 
parallel to the gradient of g — with an unknown number k as the multiplier: 

df dg df , dg 

grad/- X grad g so £ = ^ and f y = (D 

Step 2 There are now three unknowns x, y, X. There are also three equations: 
dfjdx — X dgjdx is 2x = 2/ 

df/dy = Xdg/dy is 2 >- = / (2) 

g(x, y) — k is 2x + y = k. 

In the third equation, substitute 2X for 2x and \X for y. Then 2x + y equals 
equals k. Knowing /. = jk, go back to the first two equations for x, y, and f min : 



The winning point (x min , y min ) is (|fc, $k). It minimizes the “distance squared,” 
f- x 2 + y 2 = $k 2 , from the origin to the line. 
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Question What is the meaning of the Lagrange multiplier XI 

Mysterious answer The derivative of jk 2 is \k, which equals L The multiplier 
X is the derivative of f mlri with respect to k. Move the line by Afc, and/^ changes by 
about X\k. Thus the Lagrange multiplier measures the sensitivity to k. 

Pronounce his name “Lagronge” or better “Lagrongh” as if you are French. 




Rg. 13.21 Circles f=c tangent to line g = k and ellipse g = 4: parallel gradients. 


EXAMPLE 2 Maximize and minimize /= x 2 + y 2 on the ellipse g = (jc — l) 2 + 4 y 2 = 4. 

Idea and equations The circles x 2 + y 1 = c grow until they touch the ellipse. The 
touching point is (x min , y min ) and that smallest value of c is/ mi(l . As the circles grow 
they cut through the ellipse. Finally there is a point (^,^1 where the last circle 
touches. That largest value of c is/ ma!t . 

The minimum and maximum are described by the same rule: the circle is tangent 
to the ellipse (Figure 13.21b). The perpendiculars go in the same direction. Therefore 
(fx> fy) is a multiple of (g Xt g y \ and the unknown multiplier is X: 

fc = Xg x : 2x = A2(x-\) 

f y = *gy: 2y = A8y (3) 

g = k: (jc — \) 2 + 4 y 2 = 4. 

Solution The second equation allows two possibilities: y = 0 or X ~ Following up 
y — 0, the last equation gives (x— 1) 2 = 4. Thus x~3 or x= — L Then the first 
equation gives X = 3/2 or X- 1/2. The values of / are x 1 + y 2 = 3 2 + 0 2 = 9 and 
x 2 + y 1 = (-l) 2 + 0 2 = 1. 

Now follow A - 1/4. The first equation yields x = - 1/3. Then the last equation 
requires y 1 = 5/9. Since x 1 = 1/9 we find x 2 + y 1 = 6/9 = 2/3. This is / min . 

Conclusion The equations (3) have four solutions, at which the circle and ellipse 
are tangent. The four points are (3, 0), (— 1, 0), {- 1/3, >/5/3), and (— 1/3, - >/5/3). The 
four values of / are 9, 1, 3, f . 

Summary The three equations are f x ~ Xg x and f y ~ Xg y and g ~ k> The unknowns 
are x, y, and A. There is no absolute system for solving the equations (unless they are 
linear; then use elimination or Cramer’s Rule). Often the first two equations yield x 
and y in terms of X, and substituting into g—k gives an equation for X. 

At the minimum, the level curve f{x, y) = c is tangent to the constraint curve 
g(x,y) = k. If that constraint curve is given parametrically by x(r) and y((), then 
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minimizing /(x(t), y(t)) uses the chain rule: 


dt dx dt dy dt 


or (grad /) * (tangent to curve) = 0. 


This is the calculus proof that grad / is perpendicular to the curve. Thus grad / is 
parallel to grad g. This means ( f Xi f y ) = X{g x , g y ) 

We have lost f x = 0 and f y — 0. But a new function L has three zero derivatives: 


130 The Lagrange function is Lfx, y. A) =/(x, y)-A(g(x, y)-k). Its three 
derivatives are L x = L y = L x = 0 at the solution: 

d ± = H-i d A = c\ — = K =n —= - cr^- ir = n Ml 

dx dx k dx ° dy dy X dy ° dX 8 * °' ^ 


Note that dLjdk = 0 automatically produces g = k. The constraint is ''built in” to L. 
Lagrange has included a term Mjg — Jfc), which is destined to be zero — but its derivatives 
are absolutely needed in the equations! At the solution, g—k and L=f and 
dL/dk - X. 

What is important is f x — Xg x and f y = Xg y , coming from L x = L y = 0. In words: The 
constraint g—k forces dg = g x dx + gydy — 0. This restricts the movements dx and dy . 
They must keep to the curve. The equations say that df = fjix + f y dy is equal to kdg. 
Thus df is zero in the allowed direction — which is the key point. 


MAXIMUM AND MINIMUM WITH TWO CONSTRAINTS 


The whole subject of min(max)imization is called optimization , Its applications to 
business decisions make up operations research. The special case of linear functions 
is always important' — in this part of mathematics it is called linear programming. A 
book about those subjects won’t fit inside a calculus book, but we can take one more 
step — to allow a second constraint. 

The function to minimize or maximize is now /(x, y, z). The constraints are 
g(x , y, z) = and A(x, y, z) = k 2 > The multipliers are Aj and A 2 . We need at least three 
variables x, y, z because two constraints would completely determine x and y. 


13P To minimize /[x, y s z) subject to g(x> y y z)=k l and h(x, y, z) — k 2 » solve five 
equations for x, y, z, A t , A 2 . Combine g — and h — k 2 with 


W-i d ± ®-i d A+i d l \ 

dx ‘ax 2 dx dy l dy 2 dy 


dz 






(5) 


Figure 13.22a shows the geometry behind these equations. For convenience / is 
x 2 + y 1 + z 2 s so we are minimizing distance (squared). The constraints g = x + y + z = 
9 and h = x + 2y + 3z = 20 are linear — their graphs are planes. The constraints keep 
(x, y, z) on both planes — and therefore on the line where they meet. We are finding 
the squared distance from (0, 0, 0) to a line. 

What equation do we solve? The level surfaces x 2 + y 1 + z 2 = c are spheres. They 
grow as c increases. The first sphere to touch the line is tangent to it. That touching 
point gives the solution (the smallest c). All three vectors grad / grad g , grad h are 
perpendicular to the line : 

line tangent to sphere grad/ perpendicular to line 

line in both planes => grad g and grad h perpendicular to line. 
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Thus grad /, grad g, grad h are in the same plane — perpendicular to the line. With 
three vectors in a plane, grad / is a combination of grad g and grad h : 

</*. /y> /z) = *l(gx, £y> gz ) + A 2 (Jl x , V (6) 

This is the key equation (5). It applies to curved surfaces as well as planes. 

EXAMPLE 3 Minimire x 2 + y 2 + z 2 when x 4- y + z - 9 and x + 2y + 3z = 20. 

In Figure 13.22b, the normals to those planes are gradg = (l, 1, 1) and gradlj = 
(1, 2, 3). The gradient of/— x 2 + y 2 + z 2 is (2x, 2 y, 2z). The equations (5)-(6) are 

2x = A 1 + A. 1 > 2y = A 1 + 2A 2 , 2 z = x 1 + 32 2 . 

Substitute these x, y, z into the other two equations g=x+y+z=9 and h = 20: 

A 2 + A 2 Aj + 2A 2 + 3A 2 a , Aj + A 2 Aj + 2A 2 Aj + 3A 2 _ 

2 2 2 2 2 2 

After multiplying by 2, these simplify to 3A 2 + 6A 2 = 18 and 6A* + 14A 2 = 40. The 
solutions are A 1 = 2andA 2 = : 2. Now the previous equations give (x, y, z) = (2, 3, 4). 

The Lagrange function with two constraints is L(x, y, z, A i>A 2 ) = 
/— A/g — Jki) — A 2 (/i — fc 2 ). Its five derivatives are zero — those are our five equations. 
Lagrange has increased the number of unknowns from 3 to 5, by adding Ai and A 2 . 
The best point (2, 3, 4) gives / min = 29. The A’s give df/dk — the sensitivity to changes 
in 9 and 20. 



13.22 Perpendicular vector grad / is a combination q grad g + A 2 grad h . 


INEQUALITY CONSTRAINTS 

In practice, applications involve inequalities as well as equations. The constraints 
might be g ^ and h ^ 0. The first means: It is not required to use the whole resource 
k , but you cannot use more. The second means: measures a quantity that cannot 
be negative. At the minimum pointy the multipliers must satisfy the same inequalities'. 

^ 0 and A 2 ^0. There are inequalities on the A’s when there are inequalities in the 
constraints. 

Brief reasoning: With g ^ k the minimum can be on or inside the constraint curve. 
Inside the curve, where g < fc, we are free to move in all directions. The constraint is 
not really constraining. This brings back/,-0 and/ > = 0 and A = 0 — an ordinary 
minimum. On the curve, where g = k constrains the minimum from going lower, we 
have A < 0. We don’t know in advance which to expect. 
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For 100 constraints g i ^k ii there are 100 AY Some A*s are zero (when g i < k t ) and 
some are nonzero (when g t = k t ). It is those 2 100 possibilities that make optimization 
interesting. In Unear programming with two variables, the constraints axe x ^ 0, y ^ 0: 


EXAMPLE 4 Minimize /= 5x + 6 y with g — x + y — 4 and h = x ^ 0 and H = y ^ 0. 


The constraint g= 4 is an equation, h and H yield inequalities. Each has its own 
Lagrange multiplier — and the inequalities require A 2 ^ 0 and A 3 ^ 0. The derivatives 
of f g t h, H are no problem to compute: 


8/ , 3g . 3h . 3H 

+ + yieldS + ^ 


df_, dg 


dh 


3H 


— = + A 2 — + A 3 — yields 6 = A 1 + A 3 . 

oy cy ay dy 


( 7 ) 


Those equations make A 3 larger than A 2 . Therefore A 3 > 0, which means that the 
constraint on H must be an equation. (Inequality for the multiplier means equality 
for the constraint.) In other words H = y — 0. Then x + y= 4 leads to x — A . The 
solution is at y min ) = (4, 0), where / min = 20. 

At this minimum, h = x = 4 is above zero. The multiplier for the constraint h ^ 0 
must be A 2 = 0. Then the first equation gives X x = 5. As always, the multiplier mea- 
sures sensitivity. When g= 4 is increased by Afc, the cost / min = 20 is increased by 
5AJc. In economics X x = 5 is called a shadow price — if is the cost of increasing the 
constraint . 

Behind this example is a nice problem in geometry. The constraint curve x + y-4 
is a line. The inequalities x ^ 0 and y ^ 0 leave a piece of that line — from P to Q in 
Figure 13.23. The level curves /=5x + 6y=c move out as c increases, until they 
touch the line. The first touching point is Q = (4, 0), which is the solution. It is always 
an endpoint — or a corner of the triangle PQR. It gives the smallest cost / minJ which 
is c — 20. 



13.7 EXERCISES 


Read-through questions 

A restriction g(x t y) = k is called a q The minimizing 
equations for/(x, y ) subject to g = k are b The number 
A is the Lagrange c . Geometrically, grad / is a to 
grad g at the minimum. That is because the e curve / = 


/mm is * to the constraint curve g — k. The number A 
turns out to be the derivative of fl with respect to h 
The Lagrange function is L = i and the three equations 
for x, y, A are i and * and > 
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To minimize f=x 2 — y subject to g = x — y = 0, the three 
equations for x, y , A are m . The solution is n . In this 

example the curve f[x, 3?) = / min = o is a d which is 
q to the line g = 0 at (x ffllll , y m J. 

With two constraints g(x t y, z) = k x and h(x, y, z) = k 2 there 
are r multipliers. The five unknowns arc The five 

equations are i . The level surface f=f mio is u to the 
curve where g = ki and h = k 2 . Then grad /is v to this 
curve, and so are grad g and * . Thus_x_ is a combina- 
tion of gradg and v . With nine variables and six con- 
straints, there will be * multipliers and eventually A 
equations. If a constraint is an B g^K then its multiplier 
must satisfy A < 0 at a minimum. 

1 Example 1 minimized /=x 2 + y 2 subject to 2 x + y = k> 

Solve the constraint equation for y = k — 2x t substitute into 
/ and minimize this function of x. The minimum is at (x, y) = 
, where / = . 

Note: This direct approach reduces to one unknown x, 
Lagrange increases to x t y f A. But Lagrange is belter when the 
first step of solving for y is difficult or impossible. 

Minimize and maximize /(x, y) in 2-6. Find x, y, and A, 

2 f— x 2 y with g = x 1 + y 2 = 1 

3 /= x + y with g = - ■+ - = 1 

x y 

4 /= 3x + y with g = x 2 + 9y 2 — 1 

5 /= x 2 -f y 1 with g = x 6 + y 6 = 2, 

6 /= x + y with g = x 1/3 y 2/3 = k . With x = capital and y = 
labor, g is a Cobb-Douglas function in economics. Draw two 
of its level curves, 

7 Find the point on the circle x 2 -f y 2 = 13 where /= 2x — 3y 
is a maximum. Explain the answer. 

8 Maximize ax + by + cz subject to x 2 + y 2 + z 2 = Jt 2 . Write 

your answer as the Schwarz inequality for dot products: 
(a, b> c ) - (x, y, z) ^ fc. 

9 Find the plane z = ax + by + c that best fits the points 
(x, y T z) = (0, 0, I), (1, 0, 0), (1, 1, 2), (0, 1, 2), The answer a f b, c 
minimizes the sum of (z — ax — by — c ) 2 at the four points. 

10 The base of a triangle is the top of a rectangle (5 sides, 
combined area = 1). What dimensions minimize the distance 
around? 

11 Draw the hyperbola xy= - 1 touching the circle g = 
x 2 -by 2 = 2. The minimum of /— xy on the circle is reached 

at the points , The equations f x = Xg x and f y = Xg v 

are satisfied at those points with A = . 

12 Find the maximum of f=xy on the circle g = x 2 + y 2 = 2 
by solving/* = Xg x and f y = Xg y and substituting x and y into 
/ Draw the level curve / = that touches the circle. 


13 Draw the level curves of /= x 2 + y 2 with a dosed curve C 
across them to represent g(x, y) — k. Mark a point where C 
crosses a level curve. Why is that point not a minimum of / 
on C? Mark a point where C is tangent to a level curve. Is 
that the minimum of /on C? 

14 On the circle g = x 2 + y z — 1, Example 5 of 13.6 mini- 
mized /= xy — x — y. (a) Set up the three Lagrange equations 

for x, y, X. {b) The first two equations give x = y = . 

(c) There is another solution for the special value A= — 
when the equations become . This is easy to miss 

but it gives / mill = — 1 at the point . 

Problems 15-18 develop the theory or Lagrange multipliers. 

15 (Sensitivity) Certainly L = /— A(g - fc) has dLjdk = A. 
Since L = / min and g = k at the minimum point, this seems to 
prove the key formula df Bi Jdk = X. But x min , y mlD , X, and/ mln 
all change with k. We need the total derivative of Ux, y, A, k): 

dL dL dx dL dy dL dX dL dk 

dk = dite + tydii + l)Xdk + ~dkdk' 

Equation (1) at the minim um point should now yield the 
sensitivity formula df mi Jdk = A. 

16 (Theory behind A) When g(x, y) = fc is solved for y, it 
gives a curve y = /t(x). Then minimizing J[x, y) along this 
curve yields 

dx dy dx * dx dy dx 

Those come from the rule: dfjdx = 0 at the mini- 
mum and dgjdx — 0 along the curve because g = . 

Multiplying the second equation by A = (dfldy)j(dgldy) and 

subtracting from the first gives — 0. Also dfjdy — 

Xdgjdy . These are the equations (I) for x t y, A. 

17 ( Example of failure) X = f y jg y breaks down if g y = 0 at the 
minimum point. 

(a) g = x 2 — y 3 = 0 does not allow negative y because 


(b) When g - 0 the minimum of /= x 2 + y is at the point 


(c) At that point f y = Xg y becomes which is 

impossible. 

(d) Draw the pointed curve g = 0 to see why it is not tan- 
gent to a level curve of / 

18 (No maximum) Find a point on the line g=x+y=l 
where J[x t y) = 2x + y is greater tban 100 (or 1000). Write out 
grad /— A grad g to see that there is no solution. 

19 Find the minimum of /= x 2 + 2y 2 + z 2 if (x, y, z) is 
restricted to the planes g = x + y + z = 0 and h — x — z — 1. 

20 (a) Find by Lagrange multipliers the volume V — xyz of 
the largest box with sides adding up to x + y + z = k. (b) 
Check that A = dV mai fdk. (c) United Airlines accepts baggage 
with x + y + z— 108". If it changes to 111"* approximately 
how much (by A>) and exactly how much does increase? 
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21 The planes x = 0 and y = 0 intersect in the line x = y ~ 0, 

which is the z axis. Write down a vector perpendicular to the 
plane x = 0 and a vector perpendicular to the plane y = 0. 
Find k Y times the first vector plus k 2 times the second. This 
combination is perpendicular to the line . 

22 Minimize /= x 2 + y 2 4- z 2 on the plane ax + by + cz = d — 
one constraint and o ne multiplier Compare / min with the 
distance formula \d\jsja 2 + b 2 + c 2 in Section 11.2, 

23 At the absolute minimum of J\x, y), the derivatives 

are zero. If this point happens to fall on the curve 

g(x, y) = then the equations f x = kg x and f y = Xg y hold with 
; = . 

Problems 24-33 allow inequality constraints, optional but good. 

24 Find the minimum off = lx + 5 y with the constraints g — 
x + 2y = 4 and h — x 2 0 and H = y > 0, using equations like 
(7). Which multiplier is zero? 

25 Figure 13.23 shows the constraint plane g = x + y + z = 1 
chopped off by the inequalities x^O, y >0, z^O. What are 
the three “endpoints” of this triangle? Find the minimum and 
maximum of /= 4x — 2y + 5z on the triangle, by testing / at 
the endpoints. 

26 With an inequality constraint the multiplier at the 
minimum satisfies A < 0. If k is increased, / mlrt goes down (since 
k = df mi Jdk). Explain the reasoning : By increasing fc, (more) 
(fewer) points satisfy the constraints. Therefore (more) (fewer) 
points are available to minimize f. Therefore / min goes (up) 
(down). 


27 With an inequality constraint g < k, the multiplier at a 
maximum point satisfies X > 0. Change the reasoning in 26. 

28 When the constraint h^k is a strict inequality h > k at 

the minimum, the multiplier is k = 0. Explain the reasoning'. 
For a small increase in Jc, the same minimizer is still available 
(since h > fc leaves room to move). Therefore f min is 
(changed) (unchanged), and k = df^Jdk is . 

29 Minimize /— x 2 + y 2 subject to the inequality constraint 

x + y<4. The minimum is obviously at , where f x 

and f y are zero. The multiplier is k = . A small 

change from 4 will leave f min = so the sensitivity 

dftnialdk still equals A, 

30 Minimize /= x 2 + y 2 subject to the inequality constraint 

x +y ^4. Now the minimum is at and the multi- 
plier is k = and f mix} = . A small change to 

4 + dk changes f mm by what multiple of dkl 

31 Minimize/ = 5x -f 6y with g = x + y = 4 and h = x ^ 0 and 

H — Now A 3 <0 and the sign change destroys 

Example 4. Show that equation (7) has no solution and 
choose x, y to make 5x + 6y < — 1000. 

32 Minimize / = 2x + 3y + 4z subject to £=x+y+z= 1 and 
x, y, z ^ 0. These constraints have multipliers k 2 ^0, A 3 > 0, 

a 4 ^0. The equations are 2 = A] +x 2 , , and 4 = 

k : + A 4 . Explain why A 3 > 0 and A 4 > 0 and / mjn - 2. 

33 A wire 40" long is used to enclose one or two squares 
(side x and side y). Maximize the total area x 2 +■ y 2 subject to 
x > 0, y > 0, 4x -h 4y = 40. 
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CHAPTER 14 


Multiple Integrals 


14.1 Double Integrals 


This chapter shows how to integrate functions of two or more variables. First, a 
double integral is defined as the limit of sums. Second, we find a fast way to compute 
it. The key idea is to replace a double integral by two ordinary “single” integrals . 

The double integral {j^x, y)dy dx starts with y)dy. For each fixed x we integ- 
rate with respect to y. The answer depends on x. Now integrate again, this time with 
respect to x. The limits of integration need care and attention! Frequently those limits 
on y and x are the hardest part. 

Why bother with sums and limits in the first place? Two reasons. There has to be 
a definition and a computation to fall back on, when the single integrals are difficult 
or impossible. And also — this we emphasize — multiple integrals represent more than 
area and volume . Those words and the pictures that go with them are the easiest to 
understand. You can almost see the volume as a “sum of slices” or a “double sum of 
thin sticks.” The true applications are mostly to other things, but the central idea is 
always the same: Add up small pieces and take limits . 

We begin with the area of R and the volume of V, by double integrals. 

A LIMIT OF SUMS 


The graph of z -J{x y y) is a curved surface above the xy plane. At the point (x, y) in 
the plane, the height of the surface is 2 . (The surface is above the xy plane only when 
z is positive. Volumes below the plane come with minus signs, like areas below the 
x axis.) We begin by choosing a positive function — for example 1 — 1 + x 2 + y 2 . 

The base of our solid is a region R in the xy plane. That region will be chopped 
into small rectangles (sides Ax and Ay). When R itself is the rectangle O^x ^ 1, 
0 ^ y ^ 2, the small pieces fit perfectly. For a triangle or a circle, the rectangles miss 
part of R . But they do fit in the limit, and any region with a piecewise smooth 
boundary will be acceptable. 

Question What is the volume above R and below the graph of z -f(x y y)l 
Answer It is a double integral — the integral of f{x, y) over R . To reach it we begin 
with a sum, as suggested by Figure 14.1. 
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Fig. 14.1 Base R cut into small pieces A/1. Solid V cut into thin sticks AV = zAA. 


For single integrals, the interval [a, b~\ is divided into short pieces of length Ax. 
For double integrals, ft is divided into small rectangles of area AA = (Ax)(Ay). Above 
the ft h rectangle is a "thin stick 11 with small volume. That volume is the base area 
AA times the height above it — except that this height z = /(x, >■} varies from point to 
point. Therefore we select a point (x ; , y,) in the tih rectangle, and compute the volume 
from the height above that point: 

volume of one stick =fix i> y\)AA volume of all sticks = ^/(x f , y,)A/L 

This is the crucial step for any integral — to see it as a sum of small pieces. 

Now take limits: Ax -* 0 and Ay -*■ 0. The height z = /{x, y) is nearly constant over 
each rectangle. (We assume that /is a continuous function.) The sum approaches a 
limit, which depends only on the base R and the surface above it. The limit is the 
volume of the solid, and it is the double integral of /(x, y) over R : 

j\x.y)dA = lim £/(*;- {!) 
.Jr a* - o 

To repeat: The limit is the same for all choices of the rectangles and the points (x^ yf 
The rectangles will not fit exactly into ft, if that base area is curved. The heights are 
not exact, if the surface z =f(x y y ) is also curved. But the errors on the sides and top, 
where the pieces don't fit and the heights arc wrong, approach zero. Those errors are 
the volume of the "icing” around the solid, which gets thinner as Ax -► 0 and Ay -► 0. 
A careful proof takes more space than we are willing to give. But the properties of 
the integral need and deserve attention: 

1. Linearity: jj(/ + g)dA = j[ fdA + j$g dA 

2. Constant comes outside: jj c/(x, y)dA = c /(x, y)dA 

3. ft splits into S and Tfnot overlapping): jj fdA = jj fdA + JJ/ dA. 

R S T 

In 1 the volume under /+ g has two parts. The "thin sticks” of height/ 4- g split into 
thin sticks under/ and under g. In 2 the whole volume is stretched upward by c\ In 
3 the volumes are side by side. As with single integrals, these properties help in 
computations. 

By writing dA , we allow shapes other than rectangles. Polar coordinates have an 
extra factor r in dA = r dr dO. By writing dx dy\ we choose rectangular coordinates 
and prepare for the splitting that comes now. 
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SPLITTING A DOUBLE INTEGRAL INTO TWO SINGLE INTEGRALS 


The double integral JJ f(x,y)dydx will now be reduced to single integrals in y and 
then x. (Or vice versa. Our first integral could equally well be J/(x, y)dx.) Chapter 8 
described the same idea for solids of revolution. First came the area of a slice, which 
is a single integral. Then came a second integral to add up the slices. For solids 
formed by revolving a curve, all slices are circular disks — now we expect other shapes. 

Figure 14.2 shows a slice of area 4(x). It cuts through the solid at a fixed value of 
x. The cut starts at y = c on one side of R, and ends at y = d on the other side. This 
particular example goes from y = 0 to y = 2 (R is a rectangle). The area of a slice is 
the y integral of /(x, y). Remember that x is fixed and y goes from c to d: 


A(x) = area of slice = 


/(*> y)dy 


(the answer is a function of x). 


EXAMPLE 1 A 


c 2 r v 3 >= 2 

= I (1 +x 2 + y 2 )dy= y + x 2 y+ y - 
Jy = o L * Jy = o 


2 + 2x 2 + 


This is the reverse of a partial derivative! The integral of x 2 dy, with x constant, is 
x 2 y. This “partial integral” is actually called an inner integral. After substituting the 
limits y-2 and y = 0 and subtracting, we have the area /4(x) = 2 + 2x 2 + f . Now the 
outer integral adds slices to find the volume J /l(x) dx. The answer is a number : 

volume ' I ..( 2 + ^ + s)'*’ h + T + 5 1 ! ■ 2 + M ' T 




Fig. 14.2 A slice of V at a fixed x has area A(x) — \ fix, y)dy. 

To complete this example, check the volume when the x integral comes first: 
inner integral = J (1 + x 2 + y 2 )dx = jjx + ^x 3 + y 2 xj = ^ + y 2 

. r /4, ,V R , 1 3 ~\ y=2 8 8 16 

outer integral = \ ^ [y + >’ fiv = |_F ' + F | =0 = 3 + 3 = T 
The fact that double integrals can be split into single integrals is Fubinfs Theorem. 
14A If f(x, y) is continuous on the rectangle R , then 

jo-ftr fix, dx = m: /(x, yMxJ dy. (2) 
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The inner integrals are the cross-sectional areas -4{x) and a(y) of the slices. The outer 
integrals add up the volumes A(x)dx and a(y)dy. Notice the reversing of limits. 

Normally the brackets in (2) are omitted. When the y integral is first, dy is written 
inside dx. The limits on y are inside too . I strongly recommend that you compute the 
inner integral on one line and the outer integral on a separate line. 


EXAMPLE 2 Find the volume below the plane z = x — 2y and above the base 
triangle R. 


The triangle R has sides on the x and y axes and the line x + y — 1. The strips in the 
y direction have varying lengths. (So do the strips in the x direction.) This is the main 
point of the example — the base is not a rectangle. The upper limit on the inner 
integral changes as x changes. The top of the triangle is at y — 1 — x. 

Figure 14,3 shows the strips. The region should always be drawn (except for 
rectangles). Without a figure the limits are hard to find, A sketch of R makes it easy: 

y goes from c = 0 to d = 1 — x. Then x goes from a — 0 to h = 1 . 


The inner integral has variable limits and the outer integral has constant limits : 

r: j (x-2y)rfy=[jcy-y 2 ]*Io 1 = x{l - x)-(l - x) 2 = - 1 + 3x - 2x 2 
Jy=o 


inner: 


outer 


: f ( — 1 T 3 hX — 2 x 2 )dx — - 

L 


3 , 2 % T ,32 1 


X + rX — rX 


o 1 + 2 3 6 


The volume is negative. Most of the solid is below the xy plane. To check the answer 
— do the x integral first: x goes from 0 to 1 — y. Then y goes from 0 ft? 1. 


inner: 


outer 


fl l 1 ~y t 

: L„ |X_W *‘L2* Ho = 2 0 ~ 


y) 2 - 2( 1 — y)y =T — 3y + ^ y 2 


2 r "V 


1 _ j. _ 3 5 _ 

n _ 2 2 + 6 - 6' 


Same answer, very probably right. The next example computes jj 1 dx dy = area of R. 


n ri-y 

dy dx and also dx dy. 

Jy = 0 J x = 0 

The first has vertical strips. The inner integral equals 1 - x. Then the outer integral 
(of 1 — x) has limits 0 and 1, and the area is j. It is like an indefinite integral inside 
a definite integral. 


EXAMPLE 3 The area of 


U 1 -X 

y = 0 




Fig. 14.3 Thin sticks above and below (Example 2). Reversed order (Examples 3 and 4). 
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EXAMPLE 4 


Reverse the order of integration in 


v = 0 J 


x 3 dy dx . 


Solution Draw a figure! The inner integral goes from the parabola y = x 2 up to 
the straight line y — 2x. This gives vertical strips. The strips sit side by side between 
x = 0 and x — 2. They stop where 2x equals x 2 , and the line meets the parabola* 
The problem is to put the x integral first. It goes along horizontal strips. On each 
line y = constant, we need the entry value of x and the exit value of x. From the figure, 
x goes from to yfy\ Those are the inner limits* Pay attention also to the outer 
limits, because they now apply to y , The region starts at y=0 and ends at y = 4. 
No change in the integrand x 3 — that is the height of the solid: 


'* 2 


f4 | 


x 2 dy dx is reversed to 

J x = 0 * 


Jy-n J 


x 3 dx dy. 


(3) 


EXAMPLE 5 Find the volume bounded by the planes x = 0 ? y = 0, z = 0, and 
2x +- y + z = 4. 


Solution The solid is a tetrahedron (four sides). It goes from z = 0 (the xy plane) up 
to the plane 2x + y + z = 4. On that plane z = 4 - 2x - y, This is the height function 
fix, y) to be integrated. 

Figure 14.4 shows the base R. To find its sides, set z = 0* The sides of R are the 
lines x = 0 and y = 0 and 2x T y = 4. Taking vertical strips, dy is inner: 



f4-2* 1 

(4-2x)y~\y 2 

inner: 

II 

1 

K 

1 

-rt 


o 

ii 

l 


1(4 -2x) 3 


outer: 


1(4-2 x) 2 dx = 

x - 0 ^ 


{4 - 2x) 3 
2-3-2 


2-3-2 


16 

T‘ 


Question What is the meaning of the inner integral 1(4 — 2x) 2 ^and also 

Answer The first is A(x), the area of the slice, -y is the solid volume. 

Question What if the inner integral j/(x, y)dy has limits that depend on y? 
Answer It can't. Those limits must be wrong. Find them again. 



Fig. 14.4 Tetrahedron in Example 5, semicircle in Example 6, triangle in Example 7. 


EXAMPLE 6 Find the mass in a semicircle 0 ^ y ^ yj\ — x 2 if the density is p = y* 

This is a new application of double integrals. The total mass is a sum of small masses 
(p times A A) in rectangles of area A A. The rectangles don’t fit perfectly inside the 
semicircle R, and the density is not constant in each rectangle — but those problems 
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disappear in the limit. We are left with a double integral: 


totai mass M = 




p{x, y) dx dy\ 


(4) 


Set p — y. Figure 14.4 shows the limits on x and y (try both dy dx and dx dy): 


mass M = 



y dy dx 


and also 


M = 



y dx dy. 


The first inner integral is \y 2 . Substituting the limits gives ^{1 - x 2 ). The outer integral 
of 4(1 - x 2 ) yields the total mass M = j. 

The second inner integral is xy. Substituting the limits on x gives . Then 

the outer integral is — 1( 1 — y 2 )* 2 . Substituting y = 1 and y = 0 yields M = > 


Remark This same calculation also produces the moment around the x axis, when 
the density is p= 1. The factor y is the distance to the x axis. The moment is = 
|| y dA = j. Dividing by the area of the semicircle (which is nj7) locates the centroid: 
x = 0 by symmetry and 


y = height of centroid = 


moment 

area 


2/3 

tt/2 


4 


3ji 


(5) 


This is the "average height" of points inside the semicircle, found earlier in 8,5. 


EXAMPLE 7 Integrate |^ j* Z 1 cos x 2 dx dy avoiding the impossible | cos x 2 dx. 

This is a famous example where reversing the order makes the calculation possible. 
The base R is the triangle in Figure 14.4 (note that x goes from y to 1). In the opposite 
order y goes from 0 to x. Then | cos x 2 dy = x cos x 3 contains the factor x that we 
need: 

outer integral: | x cos x 2 dx = |~i sin x 2 J* = j sin L 


14.1 EXERCISES 


Read-th rough questions 

The double integral J|*/(x, y)d/t gives the volume between R 
and a The base is first cut into small b of area A A. 
The volume above the ith piece is approximately c . The 
limit of the sum d is the volume integral. Three properties 
of double integrals are s (linearity) and > and 
_fl ■ 

If R is the rectangle 0 ^ x ^ 4, 4 ^ y ^ 6, the integral |Jx dA 
can be computed two ways. One is \jxdydx, when the 
inner integral is h ^ = i The outer integral gives 

J "I* = k When the x integral comes first it equals 

| x dx — 1 = m Then the v integral equals 

n This is the volume between o [describe V). 

The area of R is |j_E_dydx. When R is the triangle 
between x = 0, y = 2x, and y= 1, the inner limits on y are 
q This is the length of a r strip. The (outer) limits 
on x are * The area is t In the opposite order, the 


(inner) limits on x are u , Now the strip is v and the 

outer integral is w When the density is p(x. y). the total 

mass in the region R is J| x The moments are M y = 
v and M x = 1 The centroid has x — MfM. 


Compute the double integrals 1-4 by two integrations. 

f 1 f 1 

1 x 2 dx dy and j j y J dx dy 

J - 0 J ,i - 0 

p* 


2xv dx d\ and 


y~ 2 

' JI: 2 

0 

r j ri 


X - \ 


sin(x + y) dx dy and 


I' f ■’ 
r j" 

Jr 2 jx- 1 

2 P dy d; 
1 Jo 


dx dy-'xy 

dx : (x + vV 


' j ri r$ 

ve xy dx dy and dy h\ v 3 

u J 1 J ■ i J 0 


hx. v 3 + 2x + v 



14.2 Chang* to Bettor Coordinate* 
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Id 5-10, draw tbe region and compote tbe area. 


30 Find the limits and the area under > = t - x 2 : 


5 

7 

9 



6 



I 


(1 — x 3 ) dx and 


II 


1 dx dy (reversed from 29). 


31 A city inside the circle x 2 + y 2 = 100 has population den- 
sity p(jt, y) = 10(100 — x 2 — y 2 ). Integrate to find its pop- 
ulation. 


32 Find the volume bounded by the planes x = 0, y = 0 T 
z = 0, and ax + by + cz = 1. 


In 11-16 reverse the order of integration (and find the new 
limits) in 5-10 respectively. 

In 17-24 find tbe limits on JJ dy dx and Jjdx dy. Draw J? and 
compute its area. 

17 R — triangle inside the lines x = 0,y — l,y = 2x. 

18 R = triangle inside the lines x = — 1, y = 0, x + y = 0 t 

19 R = triangle inside the lines y = x, y = — x, y = 3, 

20 R — triangle inside the lines y = x T y — 2x, y = 4. 

21 R = triangle with vertices (0, 0), (4, 4), (4, 8), 

22 R — triangle with vertices (0, 0), (—2, — 1),{1* —2). 

23 R = triangle with vertices (0, 0), (2, 0), (1, b). Here b > 0. 

*24 R — triangle with vertices (0 T 0), (a, b), (c, d). The sides are 

y = bxja, y = dx/c t and y = b + (x — ct){d — b)j(c — a). Find 
A = JJ dy dx when 0<a<c,Q<d<b. 

25 Evaluate | J d 2 fjdxdy dy. 

26 Evaluate j* j* df/dxdxdy. 


In 27-28, divide the unit square R into triangles $ and T and 
verify SS R fdA = ffsfdA + fj T fdA. 

27 J «x,y) = 2x-3y+ 1 28 f(x, y) = xe> - ye* 

29 The area under y = f(x) is a single integral from a to b or 
a double integral {find the limits): 

|* /(*)*- 1| Wy dx. 


In 33-34 the rectangle with corners (l t 1), (1,3), (2, 1), (2, 3) has 
density p(x, y) = x 2 . The moments are h4 y — \\xpdA and 
M* = ||.vp dA. 

33 Find the mass, 34 Find the center of mass. 

In 35-36 tbe region is a circular wedge of radiis 1 between the 
lines y = x and y = — x. 

35 Find the area, 36 Find the centroid (x, y)> 

37 Write a program to compute JoJo/fa> y)d* dy by the mid- 
point rule ( midpoints of n 2 small squares). Which fix,y) are 
integrated exactly by your program? 

38 Apply the midpoint code to integrate x l and xy and y 2 . 
The errors decrease like what power of Ax = Ay = 1/n? 

Use the program to compute the volume under /(x, y) in 39-42. 
Check by integrating exactly or doubling n. 

39 fix, y) = 3x + 4y + 5 40 fix , y) = \jjx 2 + y 1 

41 fix, y) = x y 42 fix, y) = e* sin try 

43 In which order is JJ x y dx dy = JJ x y dy dx easier to integ- 
rate over the square 0<x^l,0<y< l?By reversing order, 
integrate (x— l)/lnx from 0 to 1 — its antiderivative is 
unknown. 

44 Explain in your own words the definition of the 
double integral of fix, y) over the region I?. 

45 Y, A might not approach JJ y dA if we only know that 

A>1 -*0. In the square 0<x, y ^ I, take rectangles of sides 
Ax and 1 (not Ax and Ay). If (x lf y t ) is a point in the rectangle 
where yj= 1, then A A =, ♦ But JJ y dA = 



14.2 Change to Better Coordinates 



You don’t go far with double integrals before wanting to change variables. Many 
regions simply do not fit with the x and y axes. Two examples are in Figure 14.5, 
a tilted square and a ring. Those are excellent shapes — in the right coordinates. 
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We have to be able to answer basic questions like these: 


Find the area dA and moment xdA and moment of inertia x 2 dA . 


The problem is: What is dAl Wc are leaving the xy variables where dA = dx dy. 

The reason for changing is this: The limits of integration in the y direction are 
miserable, I don't know them and I don't want to know them. For every x we would 
need the entry point P of the line x = constant, and the exit point Q. The heights of 
P and Q are the limits on jdy, the inner integral. The geometry of the square and 
ring are totally missed, if we stick rigidly to x and y. 




Fig, 14.5 Unit square turned through angle y. Ring with radii 4 and 5. 


Which coordinates are better? Any sensible person agrees that the area of the tilted 
square is 1. "Just turn it and the area is obvious?' But that sensible person may not 
know the moment or the center of gravity or the moment of inertia. So we actually 
have to do the turning. 

The new coordinates u and v are in Figure 14.6a. The limits of integration on l arc 
0 and 1. So are the limits on u. But when you change variables, you don't just change 
limit a. Two other changes come with new variables: 

1. The small area dA = dx dy becomes dA = du di\ 

2. The integral of x becomes the integral of 

Substituting u = v .v in a single integral, we make the same changes. Limits x = 0 and 
x = 4 become u — 0 and u = 2. Since x is tr, dx is 2u du. The purpose of the change 
is to find an antiderivative. For double integrals, the usual purpose is to improve the 
limits — but wc have to accept the whole package. 

To turn the square, there are formulas connecting x and y to u and r The geometry 
is clear — rotate axes by y . — but it has to be converted into algebra: 

u = x cos y. + v sin y x = u cos i — v sin y. 

and in reverse ( 1 J 

v = - x sin y. + y cos y. y = a sin y + t: cos x. 

Figure 14.6 shows the rotation. As points move, the whole square turns. A good way 
to remember equation (1) is to follow the corners as they become (1,0) and (0, 1), 

The change from jj x dA to J| dude is partly decided by equation (l). It 

gives x as a function of u and i. We also need dA. For a pure rotation the first guess 
is correct: The area dx dy equals the area dudv, For most changes of variable this is 
false . The general formula for dA comes after the examples. 


14.2 Change to Belter Coordinates 
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Fig. Id, 6 Change of coordinates- axes turned by 3. For rotation dA is du dv. 


EXAMPLE 1 Find jj dA and jj x dA and x and also jj x 2 dA for the tilted square. 

Solution The area of the square is da dr = 1. Notice the good limits. Then 

jj_\ dA = {o cos a — v s ' n dv = \ cos 7 — -} sin z, (2) 

This is the moment around the y axis. The factors \ come from \u 2 and jv 1 . The x 
coordinate of the center of gravity is 




dA = (f cos y. 


-J- sin 7 ) 1 . 


Similarly the integral of y leads to y. The answer is no mystery — the point {x, y) is 
at the center of the square! Substituting x = u cos z — v sin 7 made x dA look worse, 
but the limits 0 and 1 are much better. 

The moment of inertia I y around the y axis is also simplified: 


re r j r i 

x 2 dA= I (u 

J J J o J o 


cos ol - v sin z) 2 du dv = 


cos 2 7 


cos x sin a 


(3) 


You know this next fact but I will write it anyway: The answers don't contain u or v. 
Those are dummy variables like x and y. The answers do contain 7 , because the 
square has turned, (The area is fixed at 1.) The moment of inertia I x = jj y 2 dA is the 
same as equation (3) but w r ith all plus signs. 

Question The sum I x + / v simplifies to j (a constant). Why no dependence on 7? 
Answer I x + / equals 7 0 . This moment of inertia around (0, 0) is unchanged by 
rotation. We are turning the square around one of its corners. 


CHANGE TO POLAR COORDINATES 


The next change is to r and 0. A small area becomes dA = r dr dO (definitely not dr dO f 
Area always comes from multiplying two lengths, and dO is not a length. Figure 14.7 
shows the crucial region — a “polar rectangle' 1 cut out by rays and circles. Its area 
A A is found in two ways, both leading to r dr d0\ 


{Approximate) The straight sides have length Ar. The circular arcs are 
dose to rAO. The angles are 90''. So A A is close to (Ar)(rAfJ). 

(Exact) A wedge has area \r 2 A0. The difference between wedges is A A : 



Ar 

i 


A0 = rAr AO. 
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The exact method places r dead center (see figure), The approximation says; Forget 
the change in rA0 as you move outward, Keep only the first -order terms. 

A third method is coming, which requires no picture and no geometry. Calculus 
always has a third method! The change of variables x = r cos 9, y = r sin 9 will go 
into a general formula for dA, and out will come the area r dr d6. 




Fig. 14.7 Ring and polar rectangle in xy and rfl, with stretching factor r = 4.5, 


EXAMPLE 2 Find the area and center of gravity of the ring. Also find jjx 2 dA. 

Solution The limits on r are 4 and 5. The limits on 6 a re 0 and 2n. Polar coordinates 
are perfect for a ring. Compared with limits like x = y/25 - y 2 , the change to rdrdO 
is a small price to pay; 

area = f | r dr d9= 2n:[ jr 2 !* - ?:5 2 - 7t4 2 = 9 n. 

0 4 J 

The 8 integral is 27t (full circle). Actually the ring is a giant polar rectangle. We could 
have used the exact formula r A r AO, with A 8 = 2k and Ar = 5 - 4. When the radius 
r is centered at 4.5, the product r ArAd is (4.5)(l)(2tt) = 9 jt as above. 

Since the ring is symmetric around {0, 0), the integral of x dA must be zero'. 

jj x dA = j" | (rcos (?)r dr d8 = [$r 3 ]^ [sin ft]*" = 0. 

Notice rcos 9 from x — the other r is from dA. The moment of inertia is 

f f x 2 dA = J f r 2 cos 2 £J r dr d9 = \ jr 4 l* f cos 2 f? d8 — j(5 4 - 4 4 )7t, 

R 0 4 L J O 

This 0 integral is tt not 2 tt, because the average of cos 2 f? is \ not 1. 

For reference here are the moments of inertia when the density is p{x, y)\ 

l y = jjx 2 p dA I x = jj y 2 p dA I 0 = ^r 2 pdA= polar moment = / x + f y . (4) 

EXAMPLE 3 Find masses and moments for semicircular plates; p - 1 and p = 1 — r. 

Solution The semicircles in Figure 14.8 have r= 1. The angle goes from 0 to ti 
( the upper half-circle). Polar coordinates are best. The mass is the integral of the 
density p: 

M = J \ r dr d8 = (i)(n) and M = | | (1 - r)r dr dd = (i)(Jt). 




14,2 Change to Better Coordinates 


The first mass ji/ 2 equals the area {because p = 1), The second mass nj 6 is smaller 
(because p < 1). Integrating p = 1 is the same as finding a volume when the height is 
r — 1 {part of a cylinder). Integrating p = 1 — r is the same as finding a volume when 
the height is z = 1 - r {part of a cone). Volumes of cones have the extra factor j. 

The center of gravity involves the moment M x = j"jyp dA. The distance from the 
x axis is y. the mass of a small piece is p dA , integrate to add mass times distance. 
Polar coordinates are still best, with y = r sin 6. Again p = 1 and p—\ — r. 


The height of the center of gravity is y = MJM = moment divided by mass: 


Fig. 14.6 Semicircles with density piled above them, 


Fig. 14.9 Bell-shaped curve. 


Question Compare y for p = 1 and p = other positive constants and p = 1 — r. 
Answer Any constant p gives y - 4/3jt. Since 1 - r is dense at r = 0, y drops to 1/n. 

Question How is y = 4/3jt related to the “average” of y in the semicircle? 

Answer They are identical. This is the point of y. Divide the integral by the area: 


The average value of a function is 


The integral of/is divided by the integral of 1 (the area). In one dimension J* r(x) dx 
was divided by J* 1 dx (the length b - a). That gave the average value of u(x) in 
Section 5.6, Equation (5) is the same idea for/(x, y). 


EXAMPLE 4 Compute A 


A is the area under a “bell-shaped curve” — see Figure 14.9. This is the most important 
definite integral in the study of probability. It is difficult because a factor 2x is not 
present. Integrating 2xe~ xi gives -e“*\ but integrating e~ xl is impossible — except 
approximately by a computer. How can we hope to show that A is exactly yji r? 

The trick is to go from an area integral A to a volume integral A 2 . This is unusual 
(and hard to like), but the end justifies the means: 


The double integrals cover the whole plane. The r 2 comes from x 2 + y 2 , and the 
key factor r appears in polar coordinates. It is now possible to substitute u = r 2 . 
The r integral is e~“du = %. The 0 integral is 2 ji. The double integral is (3)(2je). 
Therefore A 2 = n and the single integral is A = y/n. 


S’ 

r-v _ >2 

jr area - 
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EXAMPLE 5 Apply Example 4 to the “ normal distribution'' p(x) = e 

Section 8.4 discussed probability. It emphasized the importance of this particular p(x). 
At that time we could not verify that j p(x)dx = 1, Now we can: 


Question 

Answer 


x = yfiy yields 


/•co 




^ X= 7«J 


1 dy— 1. 


( 7 ) 


Why include the 2’s in p(jc)? The integral of e x *!^fn also equals 1. 
With the 2’s the “variance" is J x 2 p(x) dx = 1. This is a convenient number. 


CHANGE TO OTHER COORDINATES 


A third method was promised, to find r dr dO without a picture and without geometry. 
The method works directly from x — r cos 0 and y = r sin 6. It also finds the 1 in 
du dv, after a rotation of axes. Most important, this new method finds the factor J in 
the area dA — J du dv , for any change of variables. The change is from xy to uu. 

For single integrals, the “stretching factor " J between the original dx and the new 
du is (not surprisingly) the ratio dx/du . Where we have dx, we write (dxjdu)du. Where 
we have (dujdx)dx, we write du. That was the idea of substitutions — the main way 
to simplify integrals. 

For double integrals the stretching factor appears in the area: dxdy becomes 
| J\ du dv . The old and new variables are related by x — jc(u, v) and y = y(u, u). The point 
with coordinates u and v comes from the point with coordinates x and y. A whole 
region 5, full of points in the uv plane, comes from the region R full of corresponding 
points in the xy plane. A small piece with area \ J\ du dv comes from a small piece with 
area dx dy . The formula for J is a two-dimensional version of dxjdu. 


14B The stretching factor for area is the 2 by 2 Jacobian determinant J(u , v): 

dx/du dxjdv I 


J = 


dx dy ^ dx dy 
du dv dv du 


( 8 ) 


dyjdu dy/dv 

An integral over R in the xy plane becomes an integral over S in the uv plane: 


f(x, J?) dx dy = 


f[x(u, v), y(u, d)) |J| du dv. 


(9) 


The determinant J is often written d(x,y)/e)(u, u), as a reminder that this stretching 
factor is like dxjdu . We require J ^ 0. That keeps the stretching and shrinking under 
control. 

You naturally ask: Why take the absolute value \J\ in equation (9)? Good 
question — it wasn’t done for single integrals. The reason is in the limits of integration. 
The single integral dx is 1 ( —du ) after changing x to — u. We keep the minus sign 
and allow single integrals to run backward. Double integrals could too, but normally 
they go left to right and down to up. We use the absolute value \J\ and run forward. 


EXAMPLE 6 Polar coordinates have x = u cos u = r cos 8 and y = u sin v = r sin 6. 


With no geometry: 


J = 


dx/dr 

dxjdd 


cos 8 

— r sin 8 

dy/dr 

8yjd8 


sin 0 

r cos 8 


= r. 


( 10 ) 
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EXAMPLE 7 Find J for the linear change to x = au + hv and y = cu + dv. 


Ordinary determinant: 


dxidu 

Sxjcv ' 


a b 

cyjcu 

dyjcv 


c d 


UD 


Why make this simple change, in which a, h, c, d are all constant? It straightens 
parallelograms into squares (and rotates those squares). Figure 14.10 is typical. 

Common sense indicated J = 1 for pure rotation — no change in area. Now J = 1 
comes from equations (1) and (11), because ad — be is cos 2 2 + sin 2 a. 

' In practice, xy rectangles generally go into nr rectangles. The sides can be curved 
(as in polar rectangles) but the angles are often 90 u + The change is 11 orthogonal T The 
next example has angles that are not 90°, and J still gives the answer. 


(3,3) 



v 



Fig. 14.1 1 Curved areas are also 
dA = \J\du dv. 


EXAMPLE 8 Find the area of R in Figure 14.10. Also compute jj e x dx dy . 


Solution The figure shows x = and y = + 5 v ♦ The determinant is 


dx/du dx/dv 


2/3 

1/3 

dyjdu vyjdv 


1/3 

2/3 


The area of the xy parallelogram becomes an integral over the uv square: 

3 3 

dx dy = || \ J\du dv = { J 3 du dv = -y ■ 3 * 3 — 3. 
r s 00 


The square has area 9, the parallelogram has area 3. I don’t know if 7 = 3 is a 
stretching factor or a shrinking factor. The other integral jj e x dx dy is 



plu} 3 V; 3 


r du dv = 




u/3 


Jo 



\^ 2 ~ l)(e~ 1). 


Main point: The change to u and v makes the limits easy (just 0 and 3). 

Why is the stretching factor J a determinant ? With straight sides, this goes back to 
Section 1 1.3 on vectors. The area of a parallelogram is a determinant. Here the sides 
are curved, but that only produces (du) 2 and (dv) 2 , which we ignore. 

A change du gives one side of Figure 14.11 — it is (dx/duiT dyjdu\)du. Side 2 is 
(dxldv i + dyjc uj)di;. The curving comes from second derivatives. The area (the cross 
product of the sides) is \ J\du dv. 
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Final remark I can’t resist looking at the change in the reverse direction. Now the 
rectangle is in x y and the parallelogram is in uv. In all formulas, exchange x for u 
and y for v : 


new J = 


dufdx 

dvjdx 


du/dy 

dvjdy 


d(u , v) __ 1 

d(.x, y) old J 


(12) 


This is exactly like du(dx= 1 j(dxjdu). It is the derivative of the inverse function. 
The product of slopes is 1 — stretch out, shrink back- From x y to uv we have 2 by 2 
matrices, and the identity matrix / takes the place of 1 : 


dxdu . , \dxj3u dx/dv 

— — =1 becomes 

dadx dyjdu dyjdv 


du/dx 

dvjdx 



(13) 


The first row times the first column is (dx/3u)(dujdx) + (dxjdv)(dvjdx)— dxjdx = 1, 
The first row times the second column is (dxjdu)(dujdy) + (dxjdv)(dvjdy) = dxjdy = 0. 
The matrices are inverses of each other. The determinants of a matrix and its inverse 
obey our rule: old J times new J — 1. Those J’s cannot be zero, just as dxjdu and 
dujdx were not zero. (Inverse functions increase steadily or decrease steadily.) 

In two dimensions, an area dx dy goes to J du dv and comes back to dx dy. 


14.2 EXERCISES 


Read-through questions 

We change variables to improve the o of integration. 
The disk x 2 + y 2 < 9 becomes the rectangle 0 ^ r u . 

0 < 0 < c The inner limits on [j dy dx are y = ± d 

In polar coordinates this area integral becomes e = 
f 

A polar rectangle has sides dr and q . Two sides are 
not h but the angles are still i The area between 
the circles r = 1 and r = 3 and the rays 0 = 0 and 0 = ?r/4 is 
I The integral jjxdydx changes to JJ fc This is 
the l around the m axis. Then x is the ratio n 
This is the x coordinate of the *> . and it is the p 

value of x ♦ 

In a rotation through ac, the point that reaches (u, u) starts 
at x = ucos 3 — i? sin a. v = Q . A rectangle in the uv plane 
comes from a r in xy. The areas are * so the stretch- 
ing factor is J = t This is the determinant of the matrix 
u containing cos a and sin a. The moment of inertia 
jj x z dx dy changes to JJ * du dv. 

For single integrals dx changes to w du. For double 
integrals dxdy changes to Jdudv with J = * The 

stretching factor J is the determinant of the 2 by 2 matrix 
V The functions t>) and y(u, t>) connect an xy region 
R to a uv region S , and dy = * = area of * . 

For polar coordinates x = B y = c For x = u, y = 
u + 4u the 2 by 2 determinant is J = p A square in the 
uv plane comes from a £ in xy. In the opposite direction 

the change has u = x and v = i(y — x) and a new J = F 

This J is constant because this change of variables is s . 


in 1-12 R is a pie-shaped wedge: 0 < r < 1 and rc/4 < 0 < 3a/4. 

1 What is the area of R? Check by integration in polar 
coordinates. 

2 Find limits on dy dx to yield the area of J?, and integ- 
rate. Extra credit: Find limits on J j dx dy. 

3 Equation (1) with x = ?r/4 rotates R into the uv region S = 
. Find limits on JJ du dv. 

4 Compute the centroid height y of R by changing JJ y dx dy 
to polar coordinates. Divide by the area of R. 

5 The region R has x = 0 because . After rotation 

through i = jt/ 4, the centroid {x, y) of R becomes the centroid 
of S. 

6 Find the centroid of any wedge 

7 Suppose R* is the wedge R moved up so that the sharp 
point is at x = 0, y = 1, 

(a) Find limits on dydx to integrate over R*. 

(b) With .x* = v and y* = y— 1, the xy region R * corres- 
ponds to what region in the x*y* plane? 

(c) After that change dx dy equals dx*dy *. 

8 Find limits on [J r dr dO to integrate over R* in Problem 7. 

9 The right coordinates for R* are r* and 6* t with x ~ 
r* cos 0* and y = r* sin 0* + 1. 

(a) Show that J = r* so dA = r*dr*d0*. 

(i») Find limits on §jr*dr*d0* to integrate over R *. 
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10 If the centroid of R is (0, y), the centroid of R* is . 

The centroid of the circle with radius 3 and center (1, 2) is 
. The centroid of the upper half of that circle is 


11 The moments of inertia I X9 I y , J 0 of the original wedge R 

are * 

12 The moments of inertia J x , / y , / 0 of the shifted wedge R* 

are . 

Problems 13-16 chaoge four-sided regions to squares. 

13 R has straight sides y = 2x, x = 1> y — 1 + 2x, x — 0. Locate 
its four comers and draw R. Find its area by geometry. 

14 Choose a, b, c,d so that the change x = au + bt>, y = 
cu + dv takes the previous R into S, the unit square 0 ^ u < I, 
0 ^ v < 1. From the stretching factor J = ad — be find the area 
of R. 

15 The region R has straight sides x = Q, x = 1, y = Q, y = 
2x + 3. Choose a , b, c so that x = u and y = au 4- bv -f cuv 
change R to the unit square S. 

16 A nonlinear term uv was needed in Problem 15. Which 
regions R could change to the square S with a linear x = 
au + hi), y = cu 4- dt>? 

Draw the xy region R that corresponds in 17-22 to (be uv 
square S with corners (0,0), (1,0), (0, 1), (1, 1). Locate the 
corners of R and then its sides (like a jigsaw puzzle). 

17 x = 2u + i\ y = u + 2v 

18 x = 3u + 2u, y = u + v 

19 x = e 2u+v t y = e* +2v 

20 x- uv, y = v 2 — u 2 

21 x = u y y = u(l -f u 2 ) 

22 x = u cos v, y = u sin v (only three corners) 

23 In Problems 17 and 19, compute J from equation (8). Then 

find the area of R from dv. 

24 In 18 and 20, find J = d{x, y)/d(u, v) and the area of R. 

25 If R lies between x = 0 and x = 1 under the graph of y — 
j\x)> 0, then x — u, y = tf[u) takes R to the unit square S. 
Locate the comers of R and the point corresponding to 
u = i, v = 1. Compute J to prove what we know: 

area of R = f(x)dx = J du dv. 


26 From r = y/x 1 4- y 2 and 0 = tan" L (y/x), compute dr/dx> 
drfdy , BO/dx , BO/dy , and the determinant J = d(r, 0)jd{x , y). 
How is this J related to the factor r = d(x, y)/d(r, 0) that enters 
r dr dOl 

27 Example 4 integrated e'* 1 from 0 to oo (answer ^/n). Also 
B = $le~* 2 dx leads to B 2 = j l 0 e~ xl dx jje _y2 dy. Change this 
double integral over the unit square to r and 0 — and find 
the limits on r that make exact integration impossible. 

28 Integrate by parts to prove that the standard normal 
distribution p(x) = e ' has <r 2 = $*^x 2 p(x)dx = 1. 

29 Find the average distance from a point on a circle to the 

points inside. Suggestion: Let (0, 0) be the point and let 
0 ^ r < 2a cos 0, 0 < 0 < n be the circle (radius a). The distance 
is r, so the average distance is r = JJ j\\ . 

30 Draw the region R: Q < x < 1, 0< y < ao and describe it 
with polar coordinates (limits on r and 0). Integrate 
|f*(x 2 + y 2 )~* i2 dx dy in polar coordinates. 

31 Using polar coordinates, find the volume under z = 
x 2 + y 2 above the unit disk x 2 + y 2 ^ 1. 

32 The end of Example 1 stated the moment of inertia 
\\y 2 dA. Check that integration. 

33 In the square — 1 < x ^ 2, — 2 <y < 1, where could you 
distribute a unit mass (with JJ p dxdy = 1) to maximize 

(a )jjx 2 pdA (b) jjy 2 pdA (c) \\r 2 p dAI 

34 True or false , with a reason: 

(a) If the uv region S corresponds to the xy region R, then 
area of S = area of R. 

(b) jjx dA*Z$$x 2 dA 

(c) The average value ofJ\x y y) is Jj/l(x, y)dA 

(d) ^„xe~ xl dx = 0 

(e) A polar rectangle has the same area as a straight-sided 
region with the same corners. 

35 Find the mass of the tilted square in Example I if the 
density is p — xy. 

36 Find the mass of the ring in Example 2 if the density is 
p = x 2 -f y 2 . This is the same as which moment of inertia with 
which density? 

37 Find the polar moment of inertia / 0 of the ring in 
Example 2 if the density is p = x 2 + y 2 . 

38 Give the following statement an appropriate name: 

y)d/l — f[P) times (area of R), where P is a point in R. 
Which point P makes this correct for /= x and /— y? 

39 Find the xy coordinates of the top point in Figure 14,6a 
and check that it goes to (u, v) = (1, I). 
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14.3 Triple Integrals 


At this point in the book, f fed I can speak to you directly. You can guess what triple 
integrals are like. Instead of a small interval or a small rectangle, there is a small box. 
Instead of length dx or area dx dy, the box has volume dV— dx dy dz. That is length 
times width times height. The goal is to put small boxes together (by integration). 
The main problem will be to discover the correct limits on x t y, z. 

We could dream up more and more complicated regions in three-dimensional 
space. But I don’t think you can see the method clearly without seeing the region 
clearly. In practice six shapes are the most important: 

box prism cylinder cone tetrahedron sphere. 

The box is easiest and the sphere may be the hardest (but no problem in spherical 
coordinates). Circular cylinders and cones fall in the middle, where xyz coordinates 
are possible but rQz are the best. I start with the box and prism and xyz. 


EXAMPLE 1 By triple integrals find the volume of a box and a prism (Figure 14.12). 

I J J dxdydz and fjf dV= f J f dxdydz 

box £ = 0 y — 0 x -0 prism z = G y= 0 x = Q 

The inner integral for both is j dx = 2, Lines in the x direction have length 2 , cutting 
through the box and the prism. The middle integrals show the limits on y (since dy 
comes second): 


After two integrations these are areas , The first area 6 is for a plane section through 
the box. The second area 6 - 6z is cut through the prism. The shaded rectangle goes 
from y = 0 to y — 3 - 3z — we needed and used the equation y + 3z - 3 for the bound- 
ary of the prism. At this point z is still constant ! But the area depends on z, because 
the prism gets thinner going upwards. The base area is 6 - 6z = 6, the top area is 


The outer integral multiplies those areas by dz , to give the volume of slices. They 
are horizontal slices because z came last. Integration adds up the slices to find the 
total volume: 

6 prism volume = f (6 “ 6z)dz = Tfiz — 3z 2 l^ = 3. 


box volume 


The box volume 2*3-1 didn't need calculus. The prism is half of the box, so its 
volume was sure to be 3“ but it is satisfying to see how 6z - 3z 2 gives the answer. 
Our purpose is to see how a triple integral works. 


area 3/2 
volume 3/2 dx 


area 


Fig, 14,12 Box with sides 2, 3, 1. The prism is half of the box: volume |(6— 6z)dz or f i dx. 
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Question Find the prism volume in the order dz dy dx (six orders are possible). 

**"' m: 


To find those limits on the z integral, follow a line in the z direction. It enters 
the prism at z = 0 and exits at the sloping face y + 3z = 3. That gives the upper limit 
z = (3 - y)j 3. It is the height of a thin stick as in Section 14.1. This section writes out 
j dz for the height, but a quicker solution starts at the double integral. 

What is the number f in the last integral? It is the area of a vertical slice , cut by a 
plane x - constant. The outer integral adds up slices. 


Iff 


f(x, y, z) dV is computed from three single integrals 


J [J 


That step cannot be taken in silence — some basic calculus is involved. The triple 
integral is the limit of AV, a sum over small boxes of volume AV. Here f is any 
value offlx, y, z) in the ith box. {In the limit, the boxes fit a curved region.) Now take 
those boxes in a certain order . Put them into lines in the x direction and put the lines 
of boxes into planes. The lines lead to the inner x integral, whose answer depends on 
y and z. The y integral combines the lines into planes. Finally the outer integral 
accounts for all planes and all boxes. 

Example 2 is important because it displays more possibilities than a box or prism. 


EXAMPLE 2 Find the volume of a tetrahedron (4-sided pyramid). Locate (x, y, z). 

Solution A tetrahedron has four flat faces, all triangles. The fourth face in 
Figure 14.13 is on the plane x + y 4- z = 1. A line in the x direction enters at x = 0 
and exits at x = 1 - y - z. (The length depends on y and z. The equation of the 
boundary plane gives x.) Then those lines are put into plane slices by the y integral: 

f f dxdy- f (1 -y~z)dy = [y-jy 2 ~zy]l * = $(! -z) 2 . 

Jy-0 Jx= 0 Jy= 0 

What is this number ^(1 — z) 2 ? It is the area at height z. The plane at that height 
slices out a right triangle, whose legs have length 1 - z . The area is correct, but look 
at the limits of integration. If x goes to 1 — y — z, why does y go to 1 - z? Reason: 
We are assembling lines, not points. The figure shows a line at every y up to 1 - z. 




Rg. 14,t3 Lines end at plane x + y + z = L Triangles end at edge y + z= 1. The average 
height is z = fi\zdVI\l\dV. 
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Adding the slices gives the volume: i(l - z) 2 dz = [£(z - 1) 3 ]£ = This agrees 
with y(base times height), the volume of a pyramid. 

The height z of the centroid is “z avtra , e We compute Jjj z dV and divide by the 
volume. Each horizontal slice is multiplied by its height z, and the limits of integration 
don’t change: 


* J* J* i*l_y i* i —y—2 A 1 

z dV= zdxdy dz = 

J J J jo jo Jo Jo 


2 24 


This is quick because z is constant in the x and y integrals. Each triangular slice 
contributes z times its area j(\ - z) 2 times dz. Then the z integral gives the moment 
1/24. To find the average height, divide 1/24 by the volume: 


z = height of centroid = 


fff zdV _ 1/24 _ 1 
fff dV 1/6 4' 


By symmetry x = j and y = The centroid is the point {*, {, *). Compare that with 
(j, i), the centroid of the standard right triangle. Compare also with the center of 
the unit interval. There must be a five-sided region in four dimensions centered at 

li M h 
\5 » 5 ' 5f' 

For area and volume we meet another pattern. Length of standard interval is 1, 
area of standard triangle is volume of standard tetrahedron is hyper volume in 
four dimensions must be . The interval reaches the point x = t, the triangle 

reaches the line x + y= 1, the tetrahedron reaches the plane x + y + z = 1. The four- 
dimensional region stops at the hyperplane 1. 


EXAMPLE 3 Find the volume jfj dx dy dz inside the unit sphere x 2 4- y 2 + z 2 = 1. 

First question: What are the limits on x? If a needle goes through the sphere in the 
x direction, where does it enter and leave? Moving in the x direction, the numbers y 
and z stay constant. The inner integral deals only with x. The smallest and largest x 
are at the boundary where x 2 -+■ y 2 + i 2 = L This equation does the work — we solve 
it for x. Look at the limits on the x integral: 

1 ? v 1 ~yl - ■ : 2 ? ? 

volume of sphere — j j J dx dy dz = j j 2^/l — y 2 — z 2 dy dz. ( 1 ) 

-j i - v 1 - y 3 -z a i 7 

The limits on y are ~ 1 - z 2 and + yf\ - z 2 . You can use algebra on the boundary 

equation x 2 + y 1 + i 2 — 1 . But notice that x is gone! We want the smallest and largest 
y, for eaeh z. It helps very much to draw the plane at height z, slic ing through the 
sphere in Figure 14,14. The slice is a circle of radius r ~ 1 - z 2 . So the area is 

7 rr\ which must come from the y integral: 

| 2sJ) - y 1 - z 2 dy - area of slice = n{\ - z 2 ). (2) 

I admit that I didn't integrate. Is it cheating to use the formula nr 2 l I don’t think so. 
Mathematics is hard enough, and we don’t have to work blindfolded. The goal is 
u nderstanding , and if you know the area then use it. Of course the integral of 
yf 1 — y 2 — z 2 can be done if necessary — use Section 7.2, 

The triple integral is down to a single integral. We went from one needle to a cirele 
of needles and now to a sphere of needles. The volume is a sum of slices of area 
rr(l — z 2 \ The South Pole is at z = — 1, the North Pole is at z — +1, and the integral 
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is the volume 4n/3 inside the unit sphere: 



tt( 1 — z 2 )dz = n 




( 3 ) 


Question 1 A cone also has circular slices. How is the last integral changed? 
Answer The slices of a cone have radius 1 z. Integrate (1 — z} 2 not v /l z 2 . 

Question 2 How does this compare with a circular cylinder (height 1. radius 1)? 
Answer Now all slices have radius 1. Above z = 0. a cylinder has volume n and a 
half-sphere has volume \n and a cone has volume \ n . 

For solids with equal surface area, the sphere has largest volume. 


Question 3 

Answer 


What is the average height z in the cone and half-sphere and cylinder? 

_ f zfslice areajdz 1,3 ,1 

} (slice area)dz 4 an 8 an 2’ 



Fig. 14,14 Jdx = length of needle, 


z= 1 




dx dy = area of slice. Ellipsoid is a stretched sphere. 


EXAMPLE 4 Find the volume JJJ dx dy dz inside the ellipsoid x 2 ja 2 4 y 2 /b 2 4 z 2 jc 2 - 

The limits on x arc now ±yf\ —y l fb x - z z jc z . The algebra looks terrible. The geom- 
etry is better— all slices are ellipses. A change of variable is absolutely the best. 

Introduce u = x/a and v-yjb and w = z/c . Then the outer boundary becomes 
u 2 4- v 2 + w 2 = 1. In these new variables the shape is a sphere. The triple integral for 
a sphere is jj’j du dvdw = 4tt/ 3. But what volume dV in xyz space corresponds to a 
small box with sides du and dv and dw? 

Every uvw box comes from an xyz box. The box is stretched with no bending or 
twisting. Since u is x/a, the length dx is a du . Similarly dy = b dv and dz = c dw . The 
volume of the xyz box (Figure 14,14) is dx dy dz = ( abc ) du dv dw. The stretching factor 
J = ahe is a constant, and the volume of the ellipsoid is 

bad limits better limits A n 

||| dx dy dz — IJj (abc) du dv dw = — abc. (4) 

ellipsoid sphere 3 

You realize that this is special — other volumes are much more complicated. The 
sphere and ellipsoid are curved, but the small xyz boxes are straight. The next section 
introduces spherical coordinates, and we can finally write “ good limits f But then we 
need a different J . 
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14.3 EXERCISES 


Read-through questions 

Six important solid shapes are o . The integral 
||| dx dy dz adds the volume b of small c , For com- 

putation it becomes d single integrals. The inner integral 
|dx is the e of a line through the solid. The variables 
t and g are held constant. The double integral 
dx dy is the b of a slice, with < held constant. 
Then the z integral adds up the volumes of 1 . 

If the solid region V is bounded by the planes x = 0, y = 0, 
z = 0, and x + 2y + 3z = ! , the limits on the inner x integral 
are k . The limits on y are i The limits on z are 
m , In the new variables u = x, v = 2y, w = 3z, the equation 
of the outer boundary is n The volume of the tetrahe- 
dron in uvw space is o From dx = du and dy — dv/2 and 
dz = P . the volume of an xyz box is dx dy dz = 
q du dv dw. So the volume of V is _j_. 

To find the average height z in V we compute » / t 

To find the total mass in V if the density is p = e* we compute 
the integral u To find the average density we compute 
v / w , In the order [|| dz dx dy the limits on the inner 
integral can depend on x , The limits on the middle integ- 
ral can depend on v The outer limits for the ellipsoid 
x 2 + 2 y 1 + 3z 2 ^ 8 are * , 

1 For the solid region G^x^y<z<l, find the limits in 
Iff dx dy dz and compute the volume, 

2 Reverse the order in Problem 1 to f|f dzdydx and find 

the limits of integration. The four faces of this tetrahedron 
are the planes x = 0 and y — x and 

3 This tetrahedron and five others like it fill the unit cube. 
Change the inequalities in Problem 1 to describe the other five. 

4 Find the centroid (x, y , z) in Problem 1. 

Find the limits of integration in ||J dx dy dz and the volume of 
solids £-16* Draw a very rough picture. 

5 A cube with sides of length 2, centered at (0, 0, 0). 

6 Half of that cube, the box above the xy plane. 

7 Part of the same cube, the prism above the plane z = y. 

8 Part of the same cube, above z = y and z = 0. 

9 Part of the same cube, above z = x and below z - y. 

10 Part of the same cube, where x^y^z. What shape is 
this? 

H The tetrahedron bounded by planes x = 0, y = 0, z = 0, 
and x + y + 2z = 2. 

12 The tetrahedron with comers (0, 0, 0), (2, 0, 0), (0, 4, 0), 
(0, 0, 4). First find the plane through the last three corners. 


13 The part of the tetrahedron in Problem 11 below z = j, 

14 The tetrahedron in Problem 12 with its top sliced off by 
the plane z= 1. 

15 The volume above z— 0 below the cone ^/x 2 + y 2 — 1 —z. 

*16 The tetrahedron in Problem 12, after it falls across the 
x axis onto the xy plane. 

In 17-20 find the limits in JJJ dx dy dz or ||j dz dy dx . Compute 
the volume. 

17 A circular cylinder with height 6 and base x 2 + y 2 =£ 1. 

18 The part of that cylinder below the plane z = x. Watch the 
base. Draw a picture. 

19 The volume shared by the cube (Problem 5) and cylinder. 

20 The same cylinder lying along the x axis. 

21 A cube is inscribed in a sphere: radius 1, both centers at 
(0, 0, 0). What is the volume of the cube? 

22 Find the volume and the centroid of the region bounded 
by x — 0, y = 0, z = 0, and xfa + yjb + zjc = 1. 

23 Find the volume and centroid of the solid 
0 ^ z ^ 4 — x 2 — y 2 . 

24 Based on the text, what is the volume inside 
x 2 + 4y 2 4- 9z 2 = 16? What is the “hypervolume” of the 
4-dimensional pyramid that stops at x + y + z-hw — 1? 

25 Find the partial derivatives dl{6x y dl/dy , d 2 lfdydz of 

/ = 1 1 dx dy and 1 = 1 1 f /(x, y, z) dx dy dz . 
oo oo o 

26 Define the average value of /{x, y, z) in a solid V. 

27 Find the moment of inertia ||| l 2 dV of the cube |x| < 1, 
|y| ^ 1, \z\ ^ 1 when l is the distance to 

(a) the x axis (b) the edge y = z = 1 (c) the diagonal x = y = z. 

28 Add upper limits to produce the volume of a unit cube 
from small cubes: V= ]T JT X (Ax) 3 = 1. 

i= i ;= 1 *= 1 

3/Ax 2 1 Ax j 

*29 Find the limit as Ax of ]T £ X (Ax) 3 . 

f=i j= i *=i 

30 The midpoint rule for an integral over the unit cube 
chooses the center value f{\ y i). Which functions / = x^yz'* 
are integrated correctly? 

31 The trapezoidal rule estimates |J|o|o /{x, y, z) dx dy dz 
as £ times the sum of f{x t y y z) at 8 corners. This correctly 
integrates x m y"z p for which m, n, p? 

32 Propose a 27-point “Simpson's Rule” for integration over 
a cube. If many small cubes fill a large box, why are there 
only 8 new points per cube? 





14.4 Cylindrical and Spherical Coordinates 


541 


■ 14.4 Cylindrical and Spherical Coordinates ■ 

Cylindrical coordinates are good for describing solids that are symmetric around an 
axis. The solid is three-dimensional, so there are three coordinates rj) y z: 

r: out from the axis 0: around the axis z: along the axis. 

This is a mixture of polar coordinates rO in a plane, plus z upward. You will not find 
rOz difficult to work with. Start with a cylinder centered on the z axis: 

solid cylinder: 0 ^ r ^ 1 flat bottom and top : 0 ^ z ^ 3 half cylinder: 

Integration over this half-cylinder is j* \ l 0 ? dr dO dz. These limits on r, z are 

especially simple. Two other axially symmetric solids are almost as convenient: 

cone integrate for + z= 1 sphere: integrate to r 2 + z 1 — R 2 

I would not use cylindrical coordinates for a box. Or a tetrahedron. 

The integral needs one thing more — the volume dV. The movements dr and dO 
and dz give a "curved box"' in xyz space, drawn in Figure 14. 1 5c, The base is a polar 
rectangle, with area r dr dO, The new part is the height dz. The volume of the curved 
box is r dr dO dz , Then r goes in the blank space in the triple integral— the stretching 
factor is J - r. There are six orders of integration (we give two): 



0-0 O axis) 

Fig. 14,15 Cylindrical coordinates for a point and a half-cylinder Small volume r dr dO dz. 

EXAMPLE 1 (Volume of the half-cylinder). The integral of rdr from 0 to 1 is j. The 
0 integral is n and the z integral is 3. The volume is 3rr/2, 

EXAMPLE 2 The surface r = 1 — z encloses the cone in Figure 14.16. Find its volume. 

First solution Since r goes out to 1 - z, the integral of r dr is j(l - z) 2 . The 0 integral 
is 2 tt (a full rotation). Stop there for a moment. 

We have reached J'f r dr dO = j(! - z) 2 2 n. This is the area of a slice at height z. The 
slice is a circle, its radius is 1 — z, its area is tt( 1 — z) 3 . The z integral adds those slices 
to give jt/ 3. That is correct, but it is not the only way to compute the volume. 
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Second solution Do the z and 6 integrals first. Since z goes up to 1 — r, and 0 goes 
around to 2n 7 those integrals produce JJ r dz d9= r{\ - r)2n. Stop again — this must 
be the area of something. 

After the z and 9 integrals we have a shell at radius r. The height is 1 - r (the outer 
shells are shorter). This height times 2nr gives the area around the shell. The choice 
betweeen shells and slices is exactly as in Chapter 8. Different orders of integration 
give different ways to cut up the soSd. 

The volume of the shell is area times thickness dr. The volume of the complete 
cone is the integral of shell volumes: rf 1 — r)2?i dr = n/3. 

Third solution Do the r and z integrals first: JJ rdrdz = £. Then the 9 integral is 
j £ d9 t which gives £ times 2 n. This is the volume *r/3 — but what is % d9 ? 

The third cone is cut into wedges. The volume of a wedge is £ d6. It is quite common 
to do the 9 integral last, especially when it just multiplies by In, It is not so common 
to think of wedges. 

Question Is the volume £ d9 equal to an area z times a thickness d0? 

Answer No! The triangle in the third cone has area \ not £. Thickness is never dO, 




Fig. 14,16 A cone cut three ways: slice at height z, shell at radius r, wedge at angle 0 . 


This cone is typical of a solid of revolution. The axis is in the z direction. The 8 
integral yields 2 n 7 whether it comes first, second, or third. The r integral goes out to 
a radius^z), which is 1 for the cylinder and 1 - 2 for the cone. The integral jj r dr d9 
is n{f{z)) 2 = area of circular slice. This leaves the z integral j n (f[z)) 2 dz. That is our 
old volume formula j n{/[x)) 2 dx from Chapter 8, where the slices were cut through 
the * axis. 

EXAMPLE 3 The moment of inertia around the z axis is jjj r z dr d9 dz. The extra r 2 
is (distance to axis) 2 . For the cone this triple integral is 7r/10. 

EXAMPLE 4 The moment around the z axis is jjj r 2 dr d8 dz. For the cone this is 7t/6. 
The average distance r is (moment)/(volume) - (ti/6}/(7i/3) - 

EXAMPLE 5 A sphere of radius R has the bou ndary r 2 + z 2 = R 2 < in cylindrical 
coordinates. The outer limit on the r integral is ^ /R 2 — z 2 . That is not acceptable in 
difficult problems. To avoid it we now change to coordinates that are natural for a 
sphere. 
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SPHERICAL COORDINATES 

The Earth is a solid sphere (or near enough). On its surface we use two coordinates — 
latitude and longitude. To dig inward or fly outward, there is a third coordinate — 
the distance p from the center. This Greek letter rko replaces r to avoid confusion 
with cylindrical coordinates. Where r is measured from the z axis, p is measured from 
the origin. Thus r 1 = x 2 + y 2 and p 2 = x 2 + y 2 + z 2 . 

The angle 6 is the same as before. It goes from 0 to 2 n. It is the longitude, which 
increases as you travel east around the Equator. 

The angle <j> is new. It equals 0 at the North Pole and n (not 2n) at the South Pole. 
It is the polar angle, measured down from the z axis. The Equator has a latitude of 
0 but a polar angle of rc/2 (halfway down). Here are some typical shapes: 

solid sphere (or ball): 0 < p ^ Jl surface of sphere: p = R 

upper half-sphere: 0 < <j> ^ rc/2 eastern half-sphere: 0 ^ 8 ^ n 





Fig. 14.17 Spherical coordinates p4>B. The volume dV= p 2 sin <P dp dtp dO of a spherical box. 


The angle <j> is constant on a cone from the origin. It cuts the surface in a circle 
(Figure 14.17b), but not a great circle. The angle 8 is constant along a half-circle 
from pole to pole. The distance p is constant on each inner sphere, starting at the 
center p = 0 and moving out to p = Jl. 

In spherical coordinates the volume integral is jj j p 2 sin <j> dp dtp dd. To explain that 
surprising factor J = p 2 sin <j > , start with x = r cos 8 and y = r sin 8. In spherical coor- 
dinates r is p sin 0 and z is p cos o — see the triangle in the figure. So substitute 
p sin <f> for r: 

x = p sin <j> cos 0, y = p sin <j> sin 0, z= p cos <j>. (1) 

Remember those two steps, p4>8 to r8z to xyz. We check that x 2 + y 2 + z 2 = p 2 : 
p 2 (sin 2 0 cos 2 0 + sin 2 <£ sin 2 0 + cos 2 <j>) = p 2 (sin 3 0 + cos z <t>) = p 2 . 

The volume integral is explained by Figure 14.17c. That shows a “spherical box” 
with right angles and curved edges. Two edges are dp and pd<j>. The third edge is 
horizontal. The usual rd8 becomes p sin 4> d8. Multiplying those lengths gives dV. 

The volume of the box is dV = p 2 sin <j> dp d<f> d8. This is a distance cubed, from p 2 dp. 
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EXAMPLE 6 A solid ball of radius R has known volume V ~ jJtK 3 . Notice the limits: 


jo lo jo p2S ' n ^ d(> ^ = ^ p3 ]o cos ^]o Mo* = {iK 3 H2)(27t). 

Question What is the volume above the cone in Figure 14.17? 

Answer The integral stops at [-cos 4*]^ 1 The volume is (iK 3 )(^){27i), 


EXAMPLE 7 The surface area of a sphere is A = AnR 2 . Forget the p integral; 


A- P* P JJ 2 sin0^d0=R 2 [-cos0j" [fl]J*= R a (2)(2»t). 

Jo Jo 


After those examples from geometry, here is the real thing from science. I want to 
compute one of the most important triple integrals in physics — “the gravitational 
attraction of a solid sphere/’ For some reason Isaac Newton had trouble with this 
integral. He refused to publish his masterpiece on astronomy until he had solved it. 
I think he didn’t use spherical coordinates — and the integral is not easy even now* 

The answer that Newton finally found is beautiful. The sphere acts as if all its mass 
were concentrated at the center . At an outside point {0, 0, D\ the force of gravity is 
proportional to IjD 2 . The force from a uniform solid sphere equals the force from a 
point mass, at every outside point P. That is exactly what Newton wanted and 
needed, to explain the solar system and to prove Kepler’s laws. 

Here is the difficulty* Some parts of the sphere are closer than D , some parts are 
farther away. The actual distance q, from the outside point P to a typical inside point, 
is shown in Figure 14.18. The average distance q to ail points in the sphere is not D. 
But what Newton needed was a different average, and by good luck or some divine 
calculus it works perfectly: The average of\jq is 1 ID, This gives the potential energy: 


potential at point P = 

sphere 



volume of sphere 
_ 


( 2 ) 


A small volume dV at the distance q contributes dVjq to the potential (Section 8.6, 
with density 1). The integral adds the contributions from the whole sphere. Equation 
(2) says that the potential at r = D is not changed when the sphere is squeezed to 
the center. The potential equals the whole volume divided by the single distance D. 

Important point: The average of 1 jq is IjD and not \jq. The average of j and £ is 
not y* Smaller point: I wrote “sphere” where I should have written “ball.” The 
sphere is solid: 0 ^ 2 tt. 


What about the force? For the small volume it is proportional to dVjq 2 {this is the 
inverse square law). But force is a vector , pulling the outside point toward dV — not 
toward the center of the sphere. The figure shows the geometry and the symmetry* 
We want the i component of the force. (By symmetry the overall x and y components 
are zero*) The angle between the force vector and the z axis is ot, so for the z component 
we multiply by cos j. The total force comes from the integral that Newton discovered: 


„ ( n rrr cos 2 lmf volume of sphere 

force at point P = |JJ — — dV = — — - — - 

sphere q D 


( 3 ) 


I will compute the integral (2) and leave you the privilege of solving {3). 1 mean that 

word seriously. If you have come this far, you deserve the pleasure of doing what at 
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Fig. 14.18 Distance q from outside point to inside point. Distances q and Q to surface. 


one time only Isaac Newton could do. Problem 26 offers a suggestion (just the law 
of cosines) but the integral is yours. 

The law of cosines also helps with (2). For the triangle in the figure it gives q 2 — 
D 2 - 2pD cos <j> + p 2 . Call this whole quantity «. We do the surface integral first 
and dB with p fixed). Then q 2 = u and q = and du — 2 pD sin 4> dtp: 


"2k 

** p 2 sin <p dtp d9 f 

Jo . 

o q 


2pD 



(4) 


2jt came from the 9 integral. The integral of duf^/u is 2 Ju. Since cos <f>~ — 1 at the 
upper limit, u is D 2 + 2 pD + p 2 . The square root of u is D + p. At the lower limit 
cos 4> = + 1 and u = D 2 - 2 pD + p 2 . This is another perfect square — its square root 
is D - p. The surface integral (4) with fixed p is 


Last comes the p integral: 4np 2 dpjD = ^rcR 3 //). This proves formula (2): 

potential equals volume of the sphere divided by D . 

Note 1 Physicists are also happy about equation (5). The average of 1 jq is IjD not 
only over the solid sphere but over each spherical shell of area 4irp 2 . The shells can 
have different densities, as they do in the Earth, and still Newton is correct. This also 
applies to the force integral (3)— each separate shell acts as if its mass were concen- 
trated at the center. Then the final p integral yields this property for the solid sphere. 


Note 2 Physicists also know that force is minus the derivative of potential. The 
derivative of (2) with respect to D produces the force integral (3). Problem 27 explains 
this shortcut to equation (3). 


EXAMPLE 6 Everywhere inside a hollow sphere the force of gravity is zero . 

When D is smaller than p, the lower limit y/u in the integral (4) changes from D — p 
to p - D, That way the square root stays positive. This changes the answer in (5) to 
47rp 2 /p, so the potential no longer depends on />. The potential is constant inside the 
hollow shell Since the force comes from its derivative, the force is zero. 

A more intuitive proof is in the second figure. The infinitesimal areas on the surface 
are proportional to q 2 and Q 2 . But the distances to those areas are q and Q , so the 
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forces involve \jq 2 and \/Q 2 (the inverse square law). Therefore the two areas exert 
equal and opposite forces on the inside point, and they cancel each other. The total 
force from the shell is zero, 

I believe this zero integral is the reason that the inside of a car is safe from lightning. 
Of course a car is not a sphere. But electric charge distributes itself to keep the surface 
at constant potential. The potential stays constant inside— therefore no force. The 
tires help to prevent conduction of current (and electrocution of driver). 

RS. Don't just step out of the car. Let a metal chain conduct the charge to the 
ground. Otherwise you could be the conductor, 

CHANGE OF COORDINATES— STRETCHING FACTOR J 

Once more we look to calculus for a formula. We need the volume of a small curved 
box in any uvw coordinate system. The r8z box and the p<t>8 box have right angles, 
and their volumes were read off from the geometry (stretching factors J = r and J — 
p 2 sin in Figures 14,15 and 14,17). Now we change from xyz to other coordinates 
uvw — which are chosen to fit the problem. 

Going from xy to uv t the area dA = J du dv was a 2 by 2 determinant. In three 
dimensions the determinant is 3 by 3. The matrix is always the “Jacobian matrix,” 
containing first derivatives. There were four derivatives from xy to uv, now there are 
nine from xyz to uuw. 


14C Suppose x 7 y , z are given in terms of u, v * w. Then a small box in uvw 
space (sides du , dv , dw) comes from a volume dV= J dudvdw in xyz space; 


J - 


Sxjdu dxjdv 
dyjdu dyjdv 
dzfdu dzjdv 


dx/dw 

dy/dw 

dzjdw 


, , , 3(jc, y, z) 

— stretching factor — -. 

M* f\ W) 


( 6 ) 


The volume integral JJj dx dy dz becomes \J\ du dv dw t with limits on uvw , 


Remember that a 3 by 3 determinant is the sum of six terms (Section 1 1.5). One term 
in J is ( dxjdu){dy/dv){dzldw ), along the main diagonal. This comes from pure stretch- 
ing, and the other five terms allow for rotation. The best way to exhibit the formula 
is for spherical coordinates — where the nine derivatives are easy but the determinant 
is not: 

EXAMPLE 9 Find the factor J for x = p sin <t> cos 0, y = p sin <t> sin 8, z — p cos <£♦ 

sin cos 8 p cos cos 8 — p sin sin 8 

sin sin 8 p cos ^ sin 8 p sin cos 8 , 

cos <t> — p sin ^ 0 

The determinant has six terms, but two are zero — because of the zero in the corner. 
The other four terms are p 2 sin cos 2 <£ sin 2 f? and p 2 sin 4> cos 2 4> cos 2 f? and 
p 1 sin 3 <£ sin 2 0 and p 2 $in 3 4> cos 2 0. Add the first two (note sin 2 f? + cos 2 0) and separately 
add the second two. Then add the sums to reach J = p 2 sin <j>. 

Geometry already gave this answer. For most uvw variables, use the determinant. 


c{x, y> z) 
3(p, 8 ) 
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Read-through questions 

The three o coordinates are rdz. The point at x = y — 
7—1 has r= b . 9 — c . z — d The volume 
integral is JJJ a . The solid region 1 ^ r < 2, 0 < 6 < 2n, 
0 < z < 4 is a I . Its volume is g . From the r and 0 
integrals the area of a h equals I From the z and 
6 integrals the area of a I equals k . In rOz coordi- 
nates the shapes of I are convenient, while m are 

not 

The three n coordinates are pp9 , The point at x = y = 

z = 1 has p = __o , <t>= p . 9= q . The angle p is 

measured from t . 9 is measured from t , p is the 
distance to t where r was the distance to u If p<£0 

are known then jc = v y — w , z = x The 

stretching factor J is a 3 by 3 y and volume is Jf[ * 

The solid region l<p<2, 0<<£<7r, O^0^2nisa a . 
Its volume is b . From the p and 6 integrals the area of 
a c at radius p equals o Newton discovered that 
the outside gravitational attraction of a E is the same as 
for an equal mass located at f . 

Convert the xyz coordinates in 1-4 to rOz and pp9. 


S«Cl P 


m l 

0 

C2n f*/3 r 

" J. J. J. 

m sii 

rn: 
m: 
m: 


p 2 sin </> dp d(j> dO 


17 


18 


19 


p 2 sin p dp dp dQ 
p 2 sin p dp dp dO 
p 2 sin p dp dtp dO 
p 2 sin p dp dp dO 
p 2 sin p dp dp dO 


21 Example 5 gave the volume integral for a sphere in rOz 
coordinates. What is the area of the circular slice at height z? 
What is the area of the cylindrical shell at radius r? Integrate 
over slices (dz) and over shells (dr) to reach 4nR 3 /3. 

22 Describe the solid with 0 ^ p < 1 — cos p and find its 
volume. 


1 (A 0,0) 2 (0, -D, 0) 

3 {0, 0, D) (watch 9) 4 (3, 4, 5) 

Convert the spherical coordinates in 5-7 to xyz and rOz. 

5 p = 4, p = a/4, 9 = — n/4 

6 p — 2, P = 7r/3, 6 = nib 

7 p = \ y <t> = n t 9 = anything. 

8 Where does x-r and y = 0? 

9 Find the polar angle p for the point with cylindrical 
coordinates rdz. 


23 A cylindrical tree has radius a. A saw cuts horizontally, 
ending halfway in at the x axis. Then it cuts on a sloping 
plane (angle a with the horizontal), also ending at the x axis. 
What is the volume of the wedge that falls out? 

24 Find the mass of a planet of radius R , if its density at each 
radius p is <5 = (p -h 1)/p, Notice the infinite density at the 
center, but finite mass M = dV. Here p is radius, not 
density. 

25 For the cone out to r = I — z, the average distance from 
the z axis is r — For the triangle out to r = 1 — z the average 
is f = 3 . How can they be different when rotating the triangle 
produces the cone? 


10 What are *(t), y(t\ 2 (f) on the great circle from p = l, 
p = 77/2, 0 = 0 with speed 1 to p — 1, p = tt/4, 0 = 77 / 2 ? 


From the limits of integration describe each region in 11-20 
and find its volume. The inner integral has the inner limits. 



Problems 26-32, on the attraction of a sphere, use Figure 14,18 

and the law of cosines q 2 — D 2 — 2 pD cos p 4- p 2 - u. 

26 Newtons achievement Show that j]J(cos j)dV/q 2 equals 
volume/D 2 . One hint only : Find cos j from a second law of 
cosines p 2 = D 2 — 2^0 cos a + q 2 . The p integral should 
involve 1 jq and 1 (q 3 . Equation (2) integrates \/q, leaving 
jjj dV/q 2 still to do. 

27 Compute dq/dD in the first cosine law and show from 
Figure 14.18 that it equals cos a. Then the derivative of 
equation (2) with respect to D is a shortcut to Newton’s 
equation (3), 

28 The lines oflength D and q meet at the angle a. Move the 
meeting point up by AD. Explain why the other line stretches 
by Aq * AD cos 3. So dq/dD = cos 1 as before. 
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29 Show that the average distance is q — ARj3, from the 
North Pole (/) = ft) to points on the Earth’s surface (p = ft). 
To compute: q = }J<jft 2 sin d<j> d0/(area AnR 2 ). Use the same 
substitution u. 

30 Show as in Problem 29 that the average distance is 

q = D + from the outside point (0, 0, D ) to points on 

the shell of radius p. Then integrate jjlqdV and divide by 
4 *iJ? 3 /3 to find q for the solid sphere. 

31 In Figure 14.18b, it is not true that the areas on the surface 
are exactly proportional to q 1 and Q 1 . Why not? What hap- 
pens to the second proof in Example 8? 

32 For two solid spheres attracting each other (sun and 
planet), can we concentrate both spheres into point masses at 
their centers? 

*33 Compute JJjcos a dVjq 2 to find the force of gravity at 
(0, 0, D) from a cylinder x 2 + y 2 < a 2 , 0 ^ z < h. Show from a 
figure why q 1 = r 2 + {D — z} 2 and cos tr = (D — z)fq. 

34 A linear change of variables has x = au + bv + cw, y = 
du-\-ev-\-fw, and z=gu + hv + iw. Write down the six terms 
in the determinant J. Three terms have minus signs, 

35 A pure stretching has x = au, y = bv , and z - cw. Find the 
3 by 3 matrix and its determinant J. What is special about 
the xyz box in this case? 

36 (a) The matrix in Example 9 has three columns. Find the 
lengths of those three vectors (sum of squares, then square 
root). Compare with the edges of the box in Figure 14T7* 
(b) Take the dot product of every column in J with every 
other column. Zero dot products mean right angles in the 
box. So J is the product of the column lengths. 


37 Find the stretching factor J for cylindrical coordinates 
from the matrix of first derivatives. 

38 Follow Problem 36 for cylindrical coordinates — find the 
length of each column in J and compare with the box in 
Figure 14.15. 

39 Find the moment of inerlia around the z axis of a spherical 

shell (radius p, density 1). The distance from the axis to a 
point on the shell is r — . Substitute for r to find 

Up) = Jo* Jo r Vsin 0 d<f> do. 

Divide by mr 2 (which is 4 np A ) to compute the number J for 
a hollow ball in the rolling experiment of Section 8 + 5 + 

40 The moment of inertia of a solid sphere (radius ft, 

density 1) adds up the hollow spheres of Problem 39: 
/ = Divide by mft 2 (which is f*ft 5 ) to 

find J in the rolling experiment. A solid ball rolls faster than 
a hollow ball because * 

41 Inside the Earth, the force of gravity is proportional to 
the distance p from the center. Reason: The inner ball of 

radius p has mass proportional to (assume constant 

density). The force is proportional to that mass divided by 

. The rest of the Earth (sphere with hole) exerts no 
force because . 

42 Dig a tunnel through the center to Australia. Drop a ball 
in the tunnel at y = ft; Australia is y= — R. The force of 
gravity is — cy by Problem 41. Newton’s law is my" = — cy. 
What does the ball do when it reaches Australia? 
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CHAPTER 15 


Vector Calculus 


Chapter 14 introduced double and triple integrals. We went from | dx to |j dx dy and 
HI dx dy dz. All those integrals add up small pieces, and the limit gives area or volume 
or mass. What could be more natural than that? I regret to say, after the success of 
those multiple integrals, that something is missing. It is even more regrettable that 
we didn't notice it. The missing piece is nothing less than the Fundamental Theorem 
of Calculus. 

The double integral || dx dy equals the area. To compute it, we did not use an 
antiderivative of L At least not consciously. The method was almost trial and error, 
and the hard part was to find the limits of integration. This chapter goes deeper, to 
show how the step from a double integral to a single integral is really a new form of 
the Fundamental Theorem — when it is done right. 

Two new ideas are needed early, one pleasant and one not. You will like vector 
fields. You may not think so highly of line integrals. Those are ordinary single integrals 
like | u(x)dx, but they go along curves instead of straight lines. The nice step dx 
becomes the confusing step ds. Where | dx equals the length of the interval, | ds is 
the length of the curve. The point is that regions are enclosed by curves, and we have 
to integrate along them. The Fundamental Theorem in its two-dimensional form 
(Green's Theorem) connects a double integral over the region to a single integral along 
its boundary curve . 

The great applications are in science and engineering, where vector fields are so 
natural. But there are changes in the language. Instead of an antiderivative, we speak 
about a potential function . Instead of the derivative, we take the “divergence" and 
“curl" Instead of area, we compute flux and circulation and work. Examples come 
first. 



15.1 Vector Fields 



For an ordinary scalar function, the input is a number x and the output is a number 
/(x). For a vector field (or vector function), the input is a point (x, y ) and the output 
is a two-dimensional vector F(x, y ). There is a “field” of vectors, one at every point. 
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In three dimensions the input point is (x, y,z) and the output vector F has three 
components- 

DEFINITION Let R be a region in the xy plane- A vector field F assigns to every point 
(x, y) in R a vector F(x, y) with two components: 

F(x,y)=M(x,y)i + A r (.x,yM. (1) 

This plane vector field involves two functions of two variables. They are the compo- 
nents M and N , which vary from point to point. A vector has fixed components, a 
vector field has varying components. 

A three-dimensional vector field has components M(x, y, z) and N(x f y,z) and 
P(x, y, z). Then the vectors are F = Mi + Nj + Pk, 

EXAMPLE 1 The position vector at (x, y) is R = xi 4- yj. Its components are M = x 
and N =>\ The vectors grow larger as we l eave the origin (Figure 15-la). Their 
direction is outward and their length is |R| = Jx 1 + y 2 = r. The vector R is boldface, 
the number r is lightface. 

EXAMPLE 2 The vector field R jr consists of unit vectors u„ pointing outward. We 
divide R = xi + yj by its length, at every point except the origin- The components 
of R jr are M = x/r and N = y/r. Figure 15.1 shows a third field R/r 2 , whose length 
is l/r. 



/■ ^ 





\ 


fc * 

4 4 

if * 


Fig. 15.1 The vector fields R and R/r and R/r 2 are radial. Lengths r and 1 and l/r. 


EXAMPLE 3 The spin field or rotation field or turning field goes around the origin 
instead of away from it. The field is S. Its components are Xf = — y and N — x: 


S = — yi 4- xj also has length |S| = % /(— y) 2 4- x 1 = r. (2) 

S is perpendicular to R — their dot product is zero: S* R = (— y)(x) + (x)(y) = 0, The 
spin fields S/r and S/r 2 have lengths 1 and l/r: 


S y . x 

- = — i h — i has 
r r r 




x 2 + y 2 


1 + 


x 2 +y il 


has 


The unit vector S/r is u 0 . Notice the blank at (0, 0), where this field is not defined. 





Fig. 15.2 The spin fields S and Sir and S/r 2 go around the origin. Lengths r and l and 1 r. 
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EXAMPLE 4 A gradient field starts with an ordinary function /(*, y). The components 
Af and N are the partial derivatives dffdx and dffdy. Then the field F is the gradient 
of/: 

F = grad /= V/= dflux i H- cfidy ] (3) 

This vector field grad / is everywhere perpendicular to the level curves f(x, y) = c. The 
length |grad/| tells how fast /is changing (in the direction it changes fastest). Invent 
a function like / = x 2 y , and you immediately have its gradient field F = 2xy\ + x 2 j. 
To repeat, M is dffdx and N is dffdy. 

For every vector field you should ask two questions: Is it a gradient field ? If so, 
what is ft Here are answers for the radial fields and spin fields: 


15A The radial fields R and R/r and R jr 2 are all gradient fields. 
The spin fields S and S fr are not gradients of any /( x, y). 

The spin field S fr 2 is the gradient of the polar angle 6 — tan' 


The derivatives of/ - ?{x 2 + y 2 ) are x and y, Thus R is a gradient field. The gradient 
of f=r is the unit vector R/r pointing outwards. Both fields are perpendicular to 
circles around the origin. Those are the level curves oif—jr 2 and /= r. 

Question Is every R/r" a gradient field? 

Answer Yes , But among the spin fields, the only gradient is S/r 2 . 

A major goal of this chapter is to recognize gradient fields by a simple test. The 
rejection of S and S/r will be interesting. For some reason — yi + xj is rejected and 

yi + xj is accepted. (It is the gradient of .) The acceptance of S/r 2 as the 

gradient of /= 9 contains a surprise at the origin (Section 1 5,3). 

Gradient fields are called conservative. The function / is the potential function . 
These words, and the next examples, come from physics and engineering. 

EXAMPLE 5 The velocity field is V and the flow field is pV. 

Suppose fluid moves steadily down a pipe. Or a river flows smoothly (no waterfall). 
Or the air circulates in a fixed pattern. The velocity can be different at different points, 
but there is no change with time. The velocity vector V gives the direction of flow 
and speed of flow at every point. 

In reality the velocity field is V(x, y, z), with three components M, N y P, Those are 
the velocities v li v 2 ,v$ in the x,y,z directions. The speed |V| is the length: |V| 2 = 
t?i + In a “plane flow” the k component is zero, and the velocity field is 
Pi i + 



gravity 


\ / 


/ \ 

F = - R/r - 1 


Fig. 15.3 A steady velocity field V and two force fields F. 
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For a compact disc or a turning wheel, V is a spin field (V = o>S, o> = angular 
velocity). A tornado might be closer to V = S jr 2 (except for a dead spot at the center). 
An explosion could have V = R/r 2 . A quieter example is flow in and out of a lake 
with steady rain as a source term. 

The flow field p\ is the density p times the velocity field. While V gives the rate of 
movement, pV gives the rate of movement of mass . A greater density means a greater 
rate |pV| of “mass transport ” It is like the number of passengers on a bus times the 
speed of the bus. 


EXAMPLE 6 Force fields from gravity: F is downward in the classroom, F is radial 
in space. 


When gravity pulls downward, it has only one nonzero component: F = — mgk. This 
assumes that vectors to the center of the Earth are parallel — almost true in a class- 
room , Then F is the gradient of —mgz (note dfjdz — — mg). 

in physics the usual potential is not — mgz but -\~mgz . The force field is minus grad / 
also in electrical engineering. Electrons flow from high potential to low potential. 
The mathematics is the same, but the sign is reversed. 

In space, the force is radial inwards: F = -mA/GR/r 3 . Its magnitude is propor- 
tional to 1/r 2 (Newton’s inverse square law). The masses are m and M, and the 
gravitational constant is G = 6.672 x 10“ :1 — with distance in meters, mass in kilo- 
grams, and time in seconds. The dimensions of G are (force) (distance) 2 /(mass) 2 . This 
is different from the acceleration g = 9.8m/sec 2 , which already accounts for the mass 
and radius of the Earth. 

Like all radial fields, gravity is a gradient field . It comes from a potential/: 


mMG J df 
and 

dx 


mMGx 
t — and 


ft 

dy 


mMGy 


and 


ft 

di 


mMGz 

< 4 > 


EXAMPLE 7 (a short example) Current in a wire produces a magnetic field B. It is 
the spin field S/r 2 around the wire, times the strength of the current. 


STREAMLINES AND LINES OF FORCE 


Drawing a vector field is not always easy. Even the spin field looks messy when the 
vectors are too long (they go in circles and fall across each other). The circles give a 
clearer picture than the vectors. In any field, the vectors are tangent to ‘field lines"— 
which in the spin case are circles. 

DEFINITION C is a field line or integral curve if the vectors F{jc, y) are tangent to C. 
The slope dyjdx of the curve C equals the slope N/M of the vector F = Afi + Nj: 

*!>.!!!*£ f = -- lot the .pin Held\ (6) 

dx M(x, jO V y } 

We are still drawing the field of vectors, but now they are infinitesimally short. 
They are connected into curves! What is lost is their length, because S and S/r and 
S/r 2 all have the same field lines (circles). For the position field R and gravity field 
R/r\ the field lines are rays from the origin. In this case the “curves” are actually 
straight. 

EXAMPLE 8 Show that the field lines for the velocity field V = yi + xj are hyperbolas. 
dy N x 

dx M y 


y dy = x dx 


?y 2 ” I* 2 = constant. 
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Fig. 15.4 Velocity fields are tangent to streamlines. Gradient fields also have equipotentials. 

At every point these hyperbolas line up with the velocity V. Each particle of fluid 
travels on a field line . In fluid flow those hyperbolas are called streamlines. Drop a 
leaf into a river, and it follows a streamline. Figure 15.4 shows the streamlines for a 
river going around a bend 

Don't forget the essential question about each vector field. Is it a gradient held? 
For V = yi + xj the answer is yes , and the potential is /= xy : 

the gradient of xy is (df/dx) i + (df/dy) j = yi + xy (7) 

When there is a potential, it has level curves. They connect points of equal potential, 
so the curves /(x, y) — c are called equipotentials . Here they are the curves xy = c — 
also hyperbolas. Since gradients are perpendicular to level curves, the streamlines are 
perpendicular to the equipotentials. Figure 1 5.4 is sliced one way by streamlines and 
the other way by equipotentials, 

A gradient field F = dfjdx i + df/dy j is tangent to the field lines (stream- 
lines) and perpendicular to the equipotentials (level curves of /). 

In the gradient direction / changes fastest. In the level direction / doesn't change at 
all. The chain rule along/(x,y) = c proves these directions to be perpendicular: 

df dx df dy n t t . 

— — + — — = 0 or (grad /) ’(tangent to level curve) = 0. 
ox dt cy dt 

EXAMPLE 9 The streamlines of S/r 2 are circles around (0, 0). The equipotentials are 
rays 8 = c. Add rays to Figure 1 5,2 for the gradient field S/r 2 . 

For the gravity field those are reversed. A body is pulled in along the field lines (rays). 
The equipotentials are the circles where/ = l/r is constant. The plane is crisscrossed 
by “orthogonal trajectories” — curves that meet everywhere at right angles. 

If you bring a magnet near a pile of iron filings, a little shake will display the field 
lines. In a force field, they are “lines of force.” Here are the other new words. 

Vector field F(x, y> z) = Mi + + Pk Plane field F = M{x , y)i + N(x, y\ j 

Radial field: multiple of R =*= xi + yj + zk Spin field: multiple of S = — yi + xj 

Gradient field = conservative field: M =; dfjdx , N = df/dy t P =*= dfjdz 

Potential /(x, y) (not a vector) Equipotential curves /(x, y) — c 

Streamline ~ field line = integral curve: a curve that has F(x, y) as its tangent 
vectors. 
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15.1 EXERCISES 


Read-through questions 

A vector field assigns a q to each point ( x , y) or (x, y, z). 

In two dimensions F(x,y) = , b , i 4 c j. An example is 

the position field R = a . Its magnitude is |R| = e 
and its direction is t It is the gradient field for /= 
g The level curves are h and they are i to 
the vectors R. 

Reversing this picture, the spin field is S= J Its mag- 
nitude is |S| = k and its direction is I It is not a 
gradient field, because no function has dfjdx ~ m and 

dfjdy = n . S is the velocity field for flow going o . 

The streamlines or P lines or integral q are r . 

The flow field pV gives the rate at which » is moved 
by the flow. 

A gravity field from the origin is proportional to F = t 
which has |FI = u . This is Newton’s y , , square law. 
It is a gradient field, with potential / = * . The equipoten- 

tial curves /(x, y) = c are They are v to the field 

lines which are This illustrates that the A of a 

function f(x, y) is b to its level curves* 

The velocity field yi4xj is the gradient of /=_£_. Its 
streamlines are D . The slope dyjdx of a streamline equals 
the ratio e of velocity components* The field is f to 
the streamlines. Drop a leaf onto the flow, and it goes along 
G . 

Find a potential /(x, y) for the gradient fields 1-8. Draw tbe 
streamlines perpendicular to the equipotentials /(x, y) = c. 

1 F = i + 2j (constant field) 2 F = xi + j 


Find equations for the streamlines in 19-24 by solving dyjdx - 
NjM (including a constant C)* Draw the streamlines. 

19 F = i — j 20 F — i + xj 

21 F = S (spin field) 22 F = S jr (spin field) 

23 F = grad (x/y) 24 F = grad (2x + y). 

25 The Earth’s gravity field is radial, but in a room the field 

lines seem to go straight down into the floor. This is because 
nearby field lines always look . 

26 A line of charges produces the electrostatic force field F — 
R/r 2 — (xi 4 yj)/(x 2 + y 2 )* Find the potential /(x, y). (F is also 
the gravity field for a line of masses.) 

In 27-32 write down tbe vector fields Mi 4- Njj. 

27 F points radially away from the origin with magnitude 5. 

28 The velocity is perpendicular to the curves x 3 4 y 3 = c and 
the speed is 1. 

29 The gravitational force F comes from two unit masses at 
(0,0) and (1,0). 

30 The streamlines are in the 45° direction and the speed is 4. 

31 The streamlines are circles clockwise around the origin 
and the speed is 1. 

32 The equipotentials are the parabolas y = x 2 4 c and F is 
a gradient field. 

33 Show directly that the hyperbolas xy = 2 and x 2 —y 2 = 3 
are perpendicular at the point (2, 1), by computing both slopes 
dyjdx and multiplying to get — L 


3 F = cos(x 4 y)i 4cos(x 4y)i 4 F = (1/y)i — (x/y 2 )j 

5 F = (2xi 4 2yj)/(x 2 4 y 2 ) 6 F = x 2 i 4 y 2 j 

7 F = xyi 4 j 8 F = yfyi 4 j 

9 Draw the shear field F = xj. Check that it is not a gradient 
field: If dfjdx — 0 then dfjdy = x is impossible* What are the 
streamlines (field lines) in the direction of F? 

10 Find all functions that satisfy dfjdx = — y and show that 
none of them satisfy dfjdy = x. Then the spin field S = 
— yi 4 xj is not a gradient field. 


Compute dfjdx and dfjdy in 11-18. Draw the gradient field 
F = grad / and the equipotentials f[x, y) = c: 

12 f—x — 3y 

14 /= (x — l) 2 4 y 2 

16 f=e x cos y 


11 /= 3x 4 y 
13 /=x 4y 2 
tS f—x 2 — y 1 


17 f=e*~ 


1 Sf**yfx 


34 The derivative of /(x, y) = c\sf x + fadyjdx) = 0. Show that 
the slope of this level curve is dyjdx = — MjN. It is perpendic- 
ular to streamlines because (— MjN){NjM) = 

35 The x and y derivatives of f(r) are dfjdx = and 

dfjdy = by the chain rule* (Test f=r 2 .) The equi- 
potentials are „ 

36 F = (ax 4 fry)i 4 (bx 4 cy)j is a gradient field. Find the 
potential / and describe the equipotentials. 

37 True or false: 

L The constant field i 4 2k is a gradient field* 

2* For non -grad ient fields, equipotentials meet stream- 
lines at non- right angles* 

3. In three dimensions the equipotentials are surfaces 
instead of curves. 

4. F = x 2 i 4 y 2 j 4 2 2 k points outward from (0, 0, 0)— 
a radial field. 

38 Create and draw / and F and your own equipotentials 
and streamlines. 
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39 How can different vector fields have the same streamlines? 
Can they have the same equi potentials? Can they have the 
same/? 


40 Draw arrows at six or eight points to show the direction 
and magnitude of each field: 

(a) R + S (b) R/r — S/r (c) x 2 i + x 2 \ (d)yi. 


15.2 Line Integrals 


A line integral is an integral along a curve. It can equal an area, but that is a special 
case and not typical. Instead of area, here are two important line integrals in physics 
and engineering: 




Work along a curve = 


FT ds 
c 


Flow across a curve = 


F ■ n ds. 

c 


In the first integral, F is a force field ♦ In the second integral, F is a flow field. Work 
is done in the direction of movement, so we integrate F • T Flow is measured through 
the curve C, so we integrate F • n. Here T is the unit tangent vector, and F • T is the 
force component along the curve. Similarly n is the unit normal vector, at right angles 
with T Then F • n is the component of flow perpendicular to the curve. 

We will write those integrals in several forms. They may never be as comfortable 
as j y(x) dx , but eventually we get them under control. I mention these applications 
early, so you can see where we are going. This section concentrates on work, and 
flow comes later. (It is also called flux — the Latin word for flow.) You recognize ds 
as the step along the curve, corresponding to dx on the x axis. Where j dx gives the 
length of an interval (it equals b — a), j ds is the length of the curve. 


EXAMPLE 1 Flight from Atlanta to Los Angeles on a straight line and a semicircle. 


According to Delta Airlines, the distance straight west is 2000 miles. Atlanta is at 
(1000,0) and Los Angeles is at (—1000,0), with the origin halfway between. The 
semicircle route C has radius 1000. This is not a great circle route . It is more of a 
“flat circle,” which goes north past Chicago, No plane could fly it (it probably goes 
into space). 

The equation for the semicircle is + y 2 = 1000 2 * Parametrically this path is x = 
1000 cos f, y = 1000 sin t ♦ For a line integral the parameter is better. The plane leaves 
Atlanta at t = 0 and reaches L.A. at r = tt, more than three hours later. On the straight 
2000-mile path, Delta could almost do it. Around the semicircle C, the distance is 
IOOOtt miles and the speed has to be 1000 miles per hour. Remember that speed is 
distance ds divided by time dt: 

dsldt = yf{, dxjdi ) 2 + (dyidi) 1 = 1 000^/( — sin t) 2 + (cos t) 2 — 1000. (1) 

The tangent vector to C is proportional to (dx/dt, dyjdt) = (— 1000 sin t, 1000 cos r). 
But T is a unit vector, so we divide by 1000— which is the speed. 

Suppose the wind blows due east with force F = Mi. The components are M and 
zero. For Af = constant, compute the dot product F*T and the work —2000 M: 


F ■ T = Mi * (— sin r i + cos r j) = M(— sin r) + 0(cos t) = — M sin r 


F-T ds = 

c 


{- M sin f ) 


= o 



- IT 

— 1000 A/ sin t dt = — 2000 Atf. 

. o 
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Work is force times distance moved. It is negative, because the wind acts against the 
movement. You may point out that the work could have been found more simply — 
go 2000 miles and multiply by — M + 1 would object that this straight route is a 
different path. But you claim that the path doesn't matter — the work of the wind is 
— 2000M on every path. I concede that this time you are right (but not always). 

Most line integrals depend on the path. Those that don't are crucially important. 
For a gradient field, we only need to know the starting point P and the finish Q, 


15B When F is the gradient of a potential function f(x , y ), the work J c F • T ds 
depends only on the endpoints P and Q, The work is the change in /: 


if F = df/dx i + dfjdy j then 


F-T ds =f{Q) -f(P). 


( 2 ) 


When F = Mi, its components M and zero are the partial derivatives of/= Mx. To 
compute the line integral, just evaluate/ at the endpoints. Atlanta has x = 1000, Los 
Angeles has x = — 1000, and the potential function f— Mx is like on antiderivative : 

work =f(Q) -f(P) = M(-1000) - M(1000) = - 2000M. (3) 



LAX 
- 1000 



AIL 

1000 


depends on path 

Eg. 15.5 Force Mi, work — 2000M on all paths, f orce Myi, no work on straight path. 


May I give a rough explanation of the work integral J F ■ T ds? It becomes clearer 
when the small movement T ds is written as dx i + dy j, The work is the dot product 
with F: 

F ■ T(/s = \-f-i + T^j J ■ (e/jf i — v j) = f dx + °f dy = df (4) 

V<va- ay j ' cx c y 

The infinitesimal work is df The total work is J df= f(Q) —f(P). This is the Fundamen- 
tal Theorem for a line integral. Only one warning: When F is not the gradient of any 
/ (Example 2), the Theorem does not apply. 


EXAMPLE 2 Fly these paths against the non-constant force field F = Myi. Compute 
the work. 


There is no force on the straight path where y = 0. Along the x axis the wind does 
no w r ork. But the semicircle goes up where y = 1000 sin i and the wind is strong: 

F - T = (Myi) * (— sin ;i + cos t j) = — My sin t = — 1000M sin 2 £ 


r x 


FT ds = 


(— 1000M sin 2 ;) — dt = 

at 


C* n 

-10 fi M sin 2 ; dt= - -10 ft M, 

Jo 2 


This work is enormous (and unrealistic). But the calculations make an important 
point — everything is converted to the parameter t. The second point is that F = Myi 
is not a gradient field. First reason : The work was zero on the straight path and 
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nonzero on the semicircle. Second reason : No function has dfjdx = My and dfjdy = 
0. {The first makes / depend on y and the second forbids it. This F is called a shear 
force.) Without a potential we cannot substitute P and Q — and the work depends 
on the path. 


THE DEFINITION OF LINE INTEGRALS 


We go back to the start, to define j F -T ds . We can think of F -T as a function g(x, y ) 
along the path, and define its integral as a limit of sums: 

| & = limit of £ gix it y^ASj as (As) m „-*0. (5) 

Jc i= 1 

The points (x h y t ) lie on the curve C, The last point Q is (x N , y N ); the first point P is 
(x 0t Vo). The step A is the distance to (x h y t ) from the previous point As the steps 
get small (As 0) the straight pieces follow the curve. Exactly as in Section 8,2, the 
special case g = 1 gives the arc length. As long as g(x, y) is piecewise continuous 
(jumps allowed) and the path is piecewise smooth (corners allowed), the limit exists 
and defines the line integral. 

When g is the density of a wire, the line integral is the total mass. When g is F “T s 
the integral is the work. But nobody does the calculation hy formula (5), We now 
introduce a parameter t — which could be the time, or the arc length s, or the distance 
x along the base. 

The differential ds becomes [ds/dt)dt Everything changes over to t. 


I 


g(x, )¥s = 


«(x(0. JM) y/{dx/dt) 2 + (dy/dt) 2 dt. 

t = a 


( 6 ) 


The curve starts when t = a, runs through the points (x(r), y(r)), and ends when t = b. 
The square root in the integral is the speed dsjdt. In three dimensions the points on 
C are (x(f), and ( dzjdt ) 2 is in the square root. 


EXAMPLE 3 The points on a coil spring are {jc, y, z) = {cos t, sin t, r). Find the mass 
of two complete turns (from t = 0 to t = 4it) if the density is p = 4. 

Solution The key is (dx/dt) 1 + (dyjdt) 2 + {dzjdt) 1 — sin 2 ( + cos 2 t + 1=2. Thus 
dsjdt — v /2. To find the mass, integrate the mass per unit length which is g = p = 4: 

f 4- * ds f 4ir r~ 
mass = I p—dt = j 4^/2 dt = I6yj2n. 

That is a line integral in three-dimensional space. It shows how to introduce t. But 
it misses the main point of this section, because it contains no vector field F, This 
section is ahout w or It, not just mass. 


DIFFERENT FORMS OF THE WORK INTEGRAL 


The work integral j F -T ds can be written in a better way. The force is F = Af i + JVj. 
A small step along the curve is dx i + dy j. Work is force times distance, but it is only 
the force component along the path that counts. The dot product F *Tds finds that 
component automatically. 
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15C The vector to a point on C is R = 

xi 4 yj. Then dR = T ds = dx i 4 dy j: 

work — 

A 

bdu = 

c 

* 

M dx + N dy. (7) 

c 

Along a space curve the work is j F * T ds = J F • dR = | M dx 4 N dy 4 P di. 


The product M dx is (force in x direction ((movement in x direction). This is zero if 
either factor is zero. When the only force is gravity, pushing a piano takes no work. 
It is friction that hurts. Carrying the piano up the stairs brings in Pdz , and the total 
work is the piano weight P times the change in z. 

To connect the new j F • JR with the old J F * T ds , remember the tangent vector 
T, It is dRjds. Therefore Tds is d R. Th e best for comput ations is d R, because the 
unit vector T has a division by dsjdt = {dxjdt) 2 + (dyjdt) 2 . Later we multiply by this 
square root, in converting ds to (dsjdt)dt. It makes no sense to compute the square 
root, divide by it, and then multiply by it. That is avoided in the improved form 
j M dx 4 N dy. 


EXAMPLE 4 Vector field F = - yi 4 xj, path from (1,0) to (0, 1): Find the work. 

Note 1 This F is the spin field S. It goes around the origin, while R = xi 4 yj goes 
outward. Their dot product is F • R = — yx 4 xy = 0. This does not mean that 
F • dR = 0. The force is perpendicular to R, but not to the change in R. The work to 
move from (1,0) to (0, 1), x axis to y axis, is not zero. 

Note 2 We have not described the path C That must be done. The spin field is not 
a gradient field, and the work along a straight line does not equal the work on a 
quarter-circle: 


straight line x = \ — t, y = t 

Calculation of work Change F* dR 


Straight line: 


— y dx + x dy 


Quarter-circle: 


— y dx 4 x d r = 


quarter-circle x = cos r, y = sin t. 

M dx 4 jV dy to the parameter f: 

* i 

i — f(— df) + ( 1 -r)df = 1 
o 

' TT; 2 

sin t(— sin t dt) 4- cos f(co$ t dt) = 


71 

y 


General method The path is given by x(t) and y(t). Substitute those into M(x, y) 
and /V(x, y ) — then F is a function of L Also find dxjdt and dyjdt. Integrate 
M dxjdt + N dyjdt from the starting time t to the finish. 



Fig. 15.6 Three paths for f F * dR = j —y tlx 4 x dy = 1, tt/2, 0. 


* 
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For practice, take the path down the x axis to the origin {x = 1 — t, y — 0), Then 
go up the y axis (x = 0, y = t — 1). The starting time at (1, 0) is t = 0. The turning time 
at the origin is f = 1, The finishing time at (0, 1) is t = 2. The integral has two parts 
because this new path has two parts: 

Bent path: J — y dx + xdy = 0 + 0 (y = 0 on one part, then x = 0). 

Note 3 The answer depended on the path, for this spin field F = S, The answer did 
not depend on the choice of parameter. If we follow the same path at a different 
speed, the work is the same. We can choose another parameter t, since (dsjdt)dt and 
(ds/di)di both equal ds. Traveling twice as fast on the straight path (x = 1 — 2 t, 
y = 2t) we finish at z = | instead of t = L The work is still 1 : 

r r ur 

— y dx + x dy = (— 2t)(— 2 dx) + (1 — 2r)(2dx) = 2 dx = 1. 

J Jo Jo 

CONSERVATION OF TOTAL ENERGY (KINETIC + POTENTIAL} 

When a force field does work on a mass m, it normally gives that mass a new velocity, 
Newton’s Law is F = ma = m dy/dt, (It is a vector law. Why write out three compo- 
nents?) The work J F * dR is 


JV dvjdt) ■ (y dt) = fay • y]° = *m] y(Q) | 2 - WHP)\ 2 - 


( 8 ) 


The work equals the change in the kinetic energy im|v| 2 . But for a gradient field the 
work is also the change in potential — with a minus sign from physics: 


work = |f • dR = - | df=f{P) —f[Q). 


(9) 


Comparing (8) with (9), the combination jm|v| 2 +/ is the same at P and Q. The total 
energy , kinetic plus potential, is conserved, 

INDEPENDENCE OF PATH: GRADIENT FIELDS 


The work of the spin field S depends on the path. Example 4 took three paths — 
straight line, quarter-circle, bent line. The work was 1, n/2, and 0, different on each 
path. This happens for more than 99.99% of all vector fields. It does not happen for 
the most important fields. Mathematics and physics concentrate on very special 
fields — for which the work depends only on the endpoints. We now explain what 
happens, when the integral is independent of the path. 

Suppose you integrate from P to Q on one path, and back to P on another path. 
Combined, that is a closed path from P to P (Figure 15.7). But a backward integral 
is the negative of a forward integral, since dR switches sign. If the integrals from P 
to Q are equal , the integral around the closed path is zero : 


r p ro rp ro r c 

d> F *dR = F*dR + F 1 dR = F-dR- 
Jp J P JQ JP JP 


F-dR=0. 


( 10 ) 


closed 


path 1 back path 2 path 1 


path 2 


The circle on the first integral indicates a closed path. Later we will drop the P's. 

Not all closed path integrals are zero! For most fields F, different paths yield 
different work. For “ conservative ” fields, all paths yield the same work. Then zero 
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work around a dosed path conserves energy. The big question is: How to decide 
which fields are conservative, without trying all paths ? Here is the crudal information 
about conservative fields, in a plane region R with no holes: 

15D F = M(x, y) i + N(x , y)j is a conservative field if it has these properties: 

A. The work |F*JR around every dosed path is zero. 

B. The work depends only on P and Q , not on the path. 

C F is a gradient field : M = dfjdx and N = df/dy for some potential /(x, y), 

D. The components satisfy dMjdy = dN}dx. 

A field with one of these properties has them all, D is the quick test 


These statements A D bring everything together for conservative fields (alias gradient 
fields). A closed path goes one way to Q and back the other way to P. The work 
cancels, and statements A and B are equivalent. We now connect them to C Note: 
Test D says that the “curl" of F is zero. That can wait for Green's Theorem in the 
next section — the full discussion of the curl comes in 15.6. 

First, a gradient field F — grad f is conservative. The work is f(Q)—f(P\ by the 
fundamental theorem for line integrals. It depends only on the endpoints and not the 
path. Therefore statement C leads back to B. 

Our job is in the other direction, to show that conservative fields Mi + N\ are 
gradients. Assume that the work integral depends only on the endpoints. We must 
construct a potential / so that F is its gradient. In other words, cfjcx must be M and 
efidy must be N t 

Fix the point P. Define j\Q) us the work to reach Q. Then F equals grad / 

Check the reasoning. At the starting point T, / is zero. At every other point Q % f is 
the work j M dx + N dy to reach that point. All paths from P to Q give the same j\Q ), 
because the field is assumed conservative. After two examples wc prove that grad / 
agrees with F — the construction succeeds. 



Fig. 15.7 Conservative fields: JfFw/R = () and fj?F*dR = /(Q) f{P) r Her cf(P} = 0. 


EXAMPLE 5 Find f(x. y) when F = Mi + N\ = 2.x yi + v 2 j, We w r ant vf cx = 2xy 
and cficy = x 2 . 

Solution 1 Choose P = {0, 0). Integrate M dx + N dy along to (x, 0) and up to (w y): 

2xy dx = 0 {since y = 0) x 2 dy = x 2 \ (which is /). 

JlO.Ol JtA'.Dj 

Certainly /= x 2 y meets the requirements:/* = 2 xy and /. = x 2 . Thus F = grad / Note 
that dy = 0 in the first integral (on the x axis). Then dx = 0 in the second integral 
(x is fixed). The integrals add to/= x 2 y\ 
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Solution 2 Integrate 2xy dx + x 2 dy on the straight line (xt, yt ) from r - 0 to r — 1 : 

’i p 

2(xf)(yt)(x dt) + (xf) 2 (y df) ~ 3 x 2 yt 2 dt - x 2 yr 3 ]o = x 2 y. 

Jo Jo 

Most authors use Solution 7, I use Solution 2, Most students use Solution 3 : 

Solution 3 Directly solve dfjdx = M = 2 xy and then fix up df/dy = N = x 2 : 

dfjdx = 2 xy gives / = x 2 y ( plus any function of y). 

In this example x 2 y already has the correct derivative dfjdy = x 2 . No additional 
function of y is necessary. When we integrate with respect to x , the constant of 
integration (usually C) becomes a function C(y ). 

You will get practice in finding / This is only possible for conservative fields! I 
tested M = 2 xy and N = x 2 in advance (using D) to be sure that dMjdy — dNjdx . 

EXAMPLE 6 Look for /(x, y) when Mi + iVj is the spin field — yi -I- xj. 

Attempted solution 1 Integrate — y dx -f xdy from (0, 0) to (x, 0) to (x, y): 

n*. o) r(jc.» 

— y dx = 0 and x dy = xy (which seems like /). 

JfO p O) J [x* 0) 

Attempted solution 2 Integrate — y dx + xdy on the line (x£, yt) from £ = 0 to 1 : 

* I 

—(yt)(xdt) + (x£)(ydt) = 0 (a different / also wrong). 

Jo 

Attempted solution 3 Directly solve dfjdx — — y and try to fix up dfjdy = x: 

dfjdx — — y gives f — — xy (plus any function C(y)), 

The y derivative of this / is — x + dC/dy. That does not agree with the required 
dfjdy = x. Conclusion : The spin field -yi + xj is not conservative . There is no / 
Test D gives dMjdy — — 1 and dNjdx = + 1. 

To finish this section, we move from examples to a proof. The potential f{Q) is 
defined as the work to reach Q , We must show that its partial derivatives are M and 
N. This seems reasonable from the formula f{Q) — j\ldx + Ndy , but we have to 
think it through. 

Remember statement A, that all paths give the same /{()). Take a path that goes 
from P to the left of Q. It comes in to Q on a line y = constant (so dy = 0). As the 
path reaches Q t we are only integrating Mdx. The derivative of this integral, at Q , is 
dfjdx = AT That is the Fundamental Theorem of Calculus. 

To show that dfjdy = N, take a different path. Go from P to a point below Q. The 
path comes up to Q on a vertical line (so dx = 0). Near Q we are only integrating 
N dy , so dfjdy — N, 

The requirement that the region must have no holes will be critical for test D. 

EXAMPLE 7 Find /(x, y) = ^ xdx + y dy. Test D is passed: dNjdx = 0 = dMjdy. 

Solution 1 JJ^xdx = jx 2 is added to J[*[o>ydy — \y 2 . 

Solution 2 J* (xf)(x df) + (yr)(y dt) = (x 2 + y 2 )t dr = i(x 2 + y 2 ). 

Solution 3 dfjdx — x gives / = ^x 2 + C(y). Then dfjdy — y needs C(y) = jy 2 . 
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15.2 EXERCISES 


Read-through questions 

Work is the o of F *dR, Here F is the and R is 

the c . The d product finds the component of o 

in the direction of movement 4R = dxi + dy], The straight 
path (x s y) = {£, 2t) goes from > at £ = Q to fl at t = 
1 withdR = dt i + ft . The work of F = 3i + j is jF*dR = 
f ' dt - I . 

Another form of dR is T ds, where T is the * vector to 
the path and ds = ./ < . For the path (£, 2t), the unit vector 

T is ™ and ds= n dr. For F = 3i + j, F ’Tds is still 
o df. This F is the gradient of f— P . The change in 
/= 3x + y from {0, 0) to (1, 2) is q . 

When F = grad/, the dot product F*dR is (dffdx)dx + 
r = df. The work integral from P to Q is $df= J . 
In this case the work depends on the t but not on the 
_u_, Around a closed path the work is v . The field is 
called w . F = (l+y)i + xi is the gradient of * , 

The work from (0, 0) to (1 , 2) is v . the change in potential. 

For the spin field S = * the work (does) (does not) 

depend on the path. The path (x, y) = (3 cos t t 3 sin r) is a 
circle with S-dR= A . The work is & around the 
complete circle. Formally jgfx,y)ds is the limit of the sum 
c . 

The four equivalent properties of a conservative field F = 
Mi + N\ are A: P . B: _i_, C: f . and D: Q , 
Test D is (passed)(not passed) by F = (y + l)i + xj. The work 
j F * dR around the circle (cos £ t sin t) is h . The work on 
the upper semicircle equals the work on i . This field is 
the gradient of so the work to (—1, 0} is * > 

Compute the line integrals in 1-6, 

1 J c ds and j c dy: x = r* y = 2t, 0 < £ ^ I. 

2 J c xds and J c xyds: x = cos £, y — sin U 0 *£ £ *£?i/2, 

3 f c xyds: bent line from (0, 0) to (1, 1) to (1, 0). 

4 j c ydx -xdy: any square path, sides of length 3, 

5 jcdx and jcyd x: any dosed circle of radius 3. 

6 \ c {d$!dt)dt: any path of length 5. 

7 Does J^xydy equal ixy 2 ]p? 

8 Does jpxdx equal 

9 Does l\ c ds) 2 = (jc^) 2 + (JcdjO 2 ? 

10 Does j c (ds) 2 make sense? 

In 11-16 find the work in moving from (1, 0) to (0, 1), When F 
is conservative, construct / Choose your own path when F is 
not conservative. 

11 F = i + yj 12 F = yi+j 


13 F = xy 2 j + yx 2 j 14 F = e p t + xe y i 

15 F = (x/r)i + {y/r)j 16 F=-y 2 i + x 2 j 

17 For which powers n is S/r" a gradient by test D? 

18 For which powers n is R/r" a gradient by test D? 

19 A wire hoop around a vertical circle x 2 + z 2 = a 2 has 
density p = a + z. Find its mass M = j pds. 

20 A wire of constant density p lies on the semicircle 
x 2 + y a = fl 2 , y^Q. Find its mass M and also its moment 
M x = f py ds. Where is its center of mass x = M,/M, y - MJ 
M? 

21 If the density around the circle x 2 + y 2 = a 2 is p — x 2 , what 
is the mass and where is the center of mass? 

22 Find |F*dR along the space curve x = f, y = f 2 , z = £ 3 , 
0<f < 1, 

(a) F = grad (xy + xz) (b) F = yi - xj + zk 

23 (a) Find the unit tangent vector T and the speed dsfdt 
along the path R = 2ri + t 2 j. 

(b) For F = 3xi + 4j, find F'Tds using (a) and F-</R 
directly. 

(c) What is the work from (2, 1} to (4, 4)? 

24 If M(x, y, z ) i + N{x, y, z)j is the gradient of f(x> y> z\ show 
that none of these functions can depend on z . 

25 Find all gradient fields of the form M(y)i + N(x)j. 

26 Compute the work W(x, = dx + N dy on the 
straight line path (x£, y£) from r = 0 to f = 1, Test to see if d Wj 
dx = M and dWjdy = N, 

(a) M = y\N = 3xy 2 (b) M = x\N = 3yx 2 

(c) M = x/y, N = y/x (d) M = N = e* +y 

21 Find a field F whose work around the unit square (y = 0 
then x = 1 then y = 1 then x = 0) equals 4, 

28 Find a nonconservative F whose work around the unit 
circle x 2 + y 2 = 1 is zero. 

In 29-34 compute jF*dR along tbe straight line R = d + fj 
and the parabola R = ft + f 2 j, from (0, 0) to (1, I)l When F is a 
gradient field, use its potential /(x> y). 

29 F = i — 2 j 30 F = x 2 j 

31 F = 2xy 2 t + 2yx 2 j 32 F = x 2 yi + xy 2 j 

33 F = yi — x] 34 F - (xi + y])/(x 2 + y 2 + 1) 

35 For which numbers a and b is F = uxyi + (x 2 + fiy)j a 
gradient field? 

36 Compute j-ydx + xdy from (1,0) to (0, 1) on the fine 
x = 1 — t 2 , y = t 2 and the quarter-circle x = cos 2t, y — sin 2£, 
Example 4 found 1 and nfl with different parameters. 
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Apply the test N x = M y to 37-42. Find/ when test D m passed. 

37 F ^ y 2 e~ x i — 2ye _J j 38 F = yVi — 2 

grad jcj' 

I grad xy| 

42 F = (a* + by) i + (c* + rfylj 


39 F = 


xi + yi 


l^ + rfl 

41 F = R + S 


43 Around the unit circle find §ds and and $xds. 

44 True or false, with reason: 

(a) When F = yi the line integral jF*JR along a curve 
from P to Q equals the usual area under the curve. 

(b) That line integral depends only on P and Q t not on the 
curve. 

(c) That line integral around the unit circle equals it. 



15.3 Green's Theorem 


This section contains the Fundamental Theorem of Calculus, extended to two dimen- 
sions. That sounds important and it is. The formula was discovered 1 SO years after 
Newton and Leibniz, by an ordinary mortal named George Green. His theorem 
connects a double integral over a region R to a line integral along its boundary C. 

The integral of dfjdx equals f(b) — f{a). This connects a one-dimensional integral 
to a zero-dimensional integral. The boundary only contains two points a and b\ The 
answer f(b) — f{a) is some kind of a “point integral.” It is this absolutely crucial idea — 
to integrate a derivative from information at the boundary — that Green’s Theorem 
extends into two dimensions. 

There are two important integrals around C. The work is J F • T ds = J Af dx + N dy. 
The flux is J F • n ds = J M dy — N dx (notice the switch). The first is for a force field, 
the second is for a flow field. The tangent vector T turns 90 s clockwise to become 
the normal vector n. Green’s Theorem handles both, in two dimensions. In three 
dimensions they split into the Divergence Theorem (15.5) and Stokes’ Theorem (15.6). 

Green’s Theorem applies to “smooth” functions M(x,y) and N(x,y), with con- 
tinuous first derivatives in a region slightly bigger than II. Then all integrals are well 
defined. M and N will have a definite and specific meaning in each application — to 
electricity or magnetism or fluid flow or mechanics. The purpose of a theorem is to 
capture the central ideas once and for all. We do that now, and the applications 
follow. 


15E Green's Theorem Suppose the region R is bounded by the simple 
closed piecewise smooth curve C. Then an integral over R equals a line integral 
around C: 




a) 


A curve is “simple” if it doesn’t cross itself (figure 8's are excluded). It is “closed” if 
its endpoint Q is the same as its starting point P. This is indicated by the closed circle 
on the integral sign. The curve is “smooth” if its tangent T changes continuously — 
the word “piecewise” allows a finite number of comers. Fractals are not allowed, but 
all reasonable curves are acceptable (later we discuss figure 8’s and rings). First comes 
an understanding of the formula, by testing it on special cases. 
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I x 7 dy 


Fig. 15.8 Area of R adds up strips: Jx dy 



jj dx dy and | y dx = — JJ dy dx. 


Special case 1: M = 0 and N = x. Green’s Theorem with dN/dx = 1 becomes 



1 dxdy 


(which is the area of R ). 


( 2 ) 


The integrals look equal, because the inner integral of dx is x. Then both integrals 
have xdy — but we need to go carefully. The area of a layer of R is dy times the 
difference in x (the length of the strip). The line integral in Figure 15.8 agrees. It has 
an upward dy times x (at the right) plus a downward — dy times x (at the left). The 
integrals add up the strips, to give the total area. 

Special case 2: M = y and N = 0 and § c ydx = J j R (— 1 ) dx dy = — (area of R). 

Now Green’s formula has a minus sign, because the line integral is counterclockwise. 
The top of each slice has dx <0 (going left) and the bottom has dx > 0 (going right). 
Then ydx at the top and bottom combine to give minus the area of the slice in 
Figure 15.8b. 

Special case 3: § 1 dx = 0. The dx' s to the right cancel the dx's to the left (the curve 
is closed). With M = 1 and N = 0, Green’s Theorem is 0 = 0. 


Most important case: Mi + N] is a gradient field. It has a potential function /(x, y). 
Green’s Theorem is 0 = 0, because dM/dy = dN/dx . This is test D: 


c_M 

dy 



. , dN 

is the same as — = 

dx 



( 3 ) 


The cross derivatives always satisfy f yx = J xy . That is why gradient fields pass test D. 

When the double integral is zero, the line integral is also zero: M dx + N dy = 0. 
The work is zero. The field is conservative ! This last step inA=>B=>C=>D=>A will 
be complete when Green's Theorem is proved. 

Conservative examples are §xdx = 0 and §ydy = 0. Area is not involved. 

Remark The special cases §xdy and —§ydx led to the area of R. As long as 
1 = dN/dx — dM/dy\ the double integral becomes JJ 1 dxdy. This gives a way to com- 
pute area by a line integral. 

The area of R is (j) xdy = — (j) ydx = ^<j) (xdy — ydx). (4) 


EXAMPLE 1 The area of the triangle in Figure 15.9 is 2. Check Green's Theorem. 

The last area formula in (4) uses £S, half the spin field. N =$x and M - — \y yield 
N x — M y = ^ -h \ = 1 . On one side of Green’s Theorem is jj 1 dx dy = area of triangle. 
On the other side, the line integral has three pieces. 
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(0, h) 


t 


( 2 . 0 ) 

Fig. 15.9 Green’s Theorem: Line integral 


around triangle, area integral for ellipse. 


Two pieces are zero: x dy — ydx = 0 on the sides where x = 0 and y = 0. The sloping 
side x = 2 — y has dx=—dy. The line integral agrees with the area, confirming 
Green’s Theorem: 


\_ 

2 


xdy — ydx = 



(2 - y)dy + ydy = 


1 C 2 

2* o 


2dy = 2. 


EXAMPLE 2 The area of an ellipse is nab when the semiaxes have lengths a and b ♦ 

This is a classical example, which all authors like. The points on the ellipse are 
x = a cos f, y = h sin f, as t goes from 0 to 2 n. (The ellipse has (xja) 2 + (y/b) 2 = F) 
By computing the boundary integral, we discover the area inside. Note that the 
differential xdy - ydx is just ab dt : 

(a cos t)(5 cos l dt) — (h sin r)(—a sin t dt) = ab(cos 2 t + sin 2 f)df = ahdt. 

The line integral is jjl* ab dt = nab , This area nab is nr 2 , for a circle with a = b = r. 

Proof of Green’s Theorem: In our special cases, the two sides of the formula were 
equal. We now show that they are always equal. The proof uses the Fundamental 
Theorem to integrate ( dN/dx)dx and (dM/dy)dy. Frankly speaking, this one-dimen- 
sional theorem is all we have to work with — since we don’t know M and N. 

The proof is a step up in mathematics, to work with symbols M and N instead of 
specific functions. The integral in (6) below has no numbers. The idea is to deal with 
M and N in two separate parts, which added together give Green’s Theorem: 

f ff dN 

dxdy and separately 0 N dy = — dxdy. (5) 

Jc Jin®* 


0 M dx 
c 


-H 


dM 

dy 


Start with a “very simple” region {Figure 15.10a). Its top is given by y=f(x) and 
its bottom by y = g(x). In the double integral, integrate —OM/dy first with respect to 
y. The inner integral is 


r/<*> 

J*<*> 


<5M 

dy 


dy = 




- M(xJ( x)) + M(x, g{x)). 


( 6 ) 


The Fundamental Theorem (in the y variable) gives this answer that depends on x. 
If we knew M and/ and g, we could do the outer integral — from x = a to x = b. But 
we have to leave it and go to the other side of Green’s Theorem — the line integral: 


®Mdx=f M(x, y)dx + j M(x, y)dx = f M(x,f{x)]dx+ j M(x, g(x))dx. (7) 
J J lop J bouom J b Ja 
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Compare (7) with (6). The integral of Af(x, g(x)) is the same for both. The integral of 
Af(x,/{x)) has a minus sign from (6). In (7) it has a plus sign but the integral is from 
h to a. So life is good. 

The part for N uses the same idea. Now the x integral comes first, because 
(8Nldx)dx is practically asking to be integrated — from x — G{y) at the left to x = F(y) 
at the right. We reach N(F(y) i y ) — N(G(y), y). Then the y integral matches § Ndy and 
completes (5). Adding the two parts of (5) proves Green's Theorem. 

Finally we discuss the shape of R . The broken ring in Figure 15.10 is not “very 
simple,” because horizontal lines go in and out and in and out. Vertical lines do the 
same. The x and y strips break into pieces. Our reasoning assumed no break between 
y —f(x) at the top and y-g(x) at the bottom. 

There is a nice idea that saves Green's Theorem. Separate the broken ring into 
three very simple regions R lt The three double integrals equal the three line 
integrals around the R' s. Now add these separate results , to produce the double 
integral over all of R> When we add the line integrals, the crosscuts CC are covered 
twice and they cancel. The cut between R x and R z is covered upward (around Ri) 
and downward (around R 2 ). That leaves the integral around the boundary equal to 
the double integral inside — which is Green's Theorem, 

When R is a complete ring, including the piece R 4 , the theorem is still true. The 
integral around the outside is still counterclockwise. But the integral is clockwise 
around the inner circle. Keep the region R to your left as you go around C. The 
complete ring is “doubly” connected, not “simply” connected. Green's Theorem 
allows any finite number of regions R; and crosscuts CC and holes. 

EXAMPLE 3 The area under a curve is y dx> as we always believed. 

In computing area we never noticed the whole boundary. The true area is a line 
integral — §ydx around the closed curve in Figure 15.11a. But y = 0 on the x axis. 
Also dx — 0 on the vertical lines (up and down at h and a). Those parts contribute 
zero to the integral of ydx , The only nonzero part is back along the curve — which 
is the area — jlydx or J* y dx that we know well. 

What about signs, when the curve dips below the x axis? That area has been 
counted as negative since Chapter l. 1 saved the proof for Chapter 15. The reason 
lies in the arrows on C. 

The line integral around that part goes the other way. The arrows are clockwise, 
the region is on the rights and the area counts as negative. With the correct rules, a 
figure 8 is allowed after all. 
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Fig. 15.11 Closed path gives the sign of the area. Nonconservative field because of hole. 


CONSERVATIVE FIELDS 


We never leave gradients alone! They give conservative fields — the work around a 
dosed path is /(P) — j\P) = 0. But a potential function f(x, y) is only available when 
test D is passed: Ifdfjdx = M and df/dy = N then SM/dy = dN/dx. The reason is that 

-fc- 

Some applications prefer the language of “differentials/’ Instead of looking for 
fix, y\ we look for df : 

DEFINITION The expression M(x, y)dx + N(x, y)dy is a differentia l form . When it 
agrees with the differential df = {dfjdx)dx + (8f(8y)dy of some function, the form 
is called exact. The test for cm exact differentia! is D: dN/dx = SM/dy . 

Nothing is new but the language* Is ydx an exact differential? No, because M y — 1 
and N x = 0* Is ydx + xdy an exact differential? Yes , it is the differential of f=xy. 
That is the product rule! Now comes an important example, to show why R should 
be simply connected ( a region with no holes). 


EXAMPLE 4 The spin field $/r 2 ={— yi + xj)/(x 2 + y 2 ) almost passes test D, 

st <L( x \ * 2 +.y 2 -*( 2 *) w d ( ~ y \ 

x 8x\x 2 + y 2 ) (x 2 + y 2 ) 2 y 8y\x 2 + y 2 ) (x 2 + y 2 ) 2 


( 8 ) 


Both numerators are y 2 — x 2 . Test D looks good. To find / integrate M — Sf/Sx : 


/(-v,y) = 


f 


-ye/x/(x 2 + y 2 ) = tan ‘(y/x) + C(y). 


The extra part C(y) can be zero — the y derivative of tan '(y/x ) gives N with no help 
from C(y). The potential f is the angle 0 in the usual x, y, r right triangle. 


Test D is passed and F is grad 6. What am I worried about? it is only this, 
that Green's Theorem on a circle seems to give 2n — 0. The double integral is 
ft (N x — M y )dx dy. According to (8) this is the integral of zero. But the line integral is 
2jt: 

<j)F * d R = (f)(-y dx + x dy)/(x 2 + y 2 ) - 2(area of circle)/ a 2 = 2na 2 /a 2 - 27t. (9) 


With x = a cos f and y = a sin ( we would find the same answer. The work is 2 n (not 
zero!) when the path goes around the origin. 

We have a paradox. If Green’s Theorem is wrong, calculus is in deep trouble. Some 
requirement must be violated to reach 2 tt = 0. Looking at S/r 2 , the problem is at the 
origin. The field is not defined when r = 0 (it blows up). The derivatives in (8) are not 
continuous, Test D does not apply at the origin, and was not passed. We could remove 
(0, 0), but then the region where test D is passed would have a hole. 
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It is amazing how one point can change everything- When the path circles the 
origin, the line integral is not zero. The potential function f= 0 increases by 2 n. That 
agrees with f F 'dR — 2n from (9). It disagrees with ft 0 dxdy. The 2n is right, the zero 
is wrong. N x — M y must be a “ delta function of strength 2n ” 

The double integral is 2 n from an infinite spike over the origin — even though N x = 
M y everywhere else. In fluid flow the delta function is a “vortex ” 

ROW ACROSS A CURVE: GREEN'S THEOREM TURNED BY 90° 

A flow field is easier to visualize than a force field, because something is really there 
and it moves. Instead of gravity in empty space, water has velocity M(x, y)i + N(x, y]j. 
At the boundary C it can flow in or out. The new form of Green’s Theorem is a 
fundamental “balance equation” of applied mathematics: 

Flow through C (out minus in) = replacement in R (source minus sink). 

The flow is steady. Whatever goes out through C must be replaced in R, When there 
are no sources or sinks (negative sources), the total flow through C must be zero. 
This balance law is Green’s Theorem in its “normal form” (foT n) instead of its 
“tangential form” (forT): 


15F For a steady flow field F - M(x , y ) i + N( jc, y] j, the flux f F *n ds through 
the boundary C balances the replacement of fluid inside R: 

(10) 


Figure 15.12 shows the 90° turn. T becomes n and “circulation” along C becomes 
flux through C In the original form of Green’s Theorem, change N and M to Af and 
— N to obtain the flux form: 


<j)Mdx + N dy -► O — N dx + Mdy — MJidxdy -► JJ{Af JC + N y )dxdy< (11) 


Playing with letters has proved a new theorem! The two left sides in (1 1) are equal, 
so the right sides are equal — which is Green’s Theorem (10) for the flux. The compo- 
nents M and N can be chosen freely and named freely. 

The change takes Mi + Nj into its perpendicular field - Ni + Afj. The field is turned 
at every point (we are not just turning the plane by 90°), The spin field S = — yi + xj 
changes to the position field R = xi + yj, The position field R changes to — S, Stream- 
lines of one field are equi potentials of the other field. The new form (10) of Green's 



Fig. 15.12 


circulation 


- rfT ds 

x 

/ \dx 




-jdr nds 

The perpendicular component F -n flows through C. Note n ds = dy i — dx j. 
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Theorem is just as important as the old one— in fact I like it better It is easier to 
visualize flow across a curve than circulation along it. 

The change of letters was just for the proof. From now on F — Mi 4* A/j. 

EXAMPLE 5 Compute both sides of the new form (10) for F — 2xi + 3yj. The region 
R is a rectangle with sides a and b. 

Solution This field has dM/dx + dN/dy = 2 + 3. The integral over R is 5 dx dy — 
Sab. The line integral has four parts, because R has four sides. Between the left and 
right sides, M = 2x increases by 2a. Down the left and up the right, \Mdy^2ab 
(those sides have length b). Similarly N = 3y changes by 3 h between the bottom and 
top. Those sides have length a, so they contribute 3 ab to a total line integral of 5 ah. 

Important: The "divergence” of a flow field is dM/dx + dN/dy. The example has 
divergence =5. To maintain this flow we must replace 5 units continually — not just 
at the origin but everywhere. {A one-point source is in example 7,) The divergence is 
the source strength, because it equals the outflow. To understand Greet Ts Theorem 
for any vector field M\ 4- Ny look at a tiny rectangle (sides dx and 

Flow out the right side minus flow in the left side = (change in M) times dy 
Flow out the top minus flow in the bottom = (change in N) times dx 
Total flow out of rectangle: dM dy -I- dN dx = (dM/dx 4 dN/dy)dx dy , 

The divergence times the area dxdy equals the total flow out . Section 15.5 gives more 
detail with more care in three dimensions. The divergence is M x + N y 4- P z . 


flux 3 ah 





Fig. 15.13 M x -h N y = 2 + 3 = 5 yields llux = 5(area) = Sab. The flux is dM dy + dN dx — 
(M x + N y ) dx dy. The spin field has no flux. 


EXAMPLE 6 Find the flux through a closed curve C of the spin field S = — yi 4 xj. 

Solution The field has M = — y and N — x and M x + Ny = 0. The double integral is 
zero. Therefore the total flow (out minus in) is also zero — through any closed curve. 
Figure 15.13 shows flow entering and leaving a square. No fluid is added or removed. 
There is no rain and no evaporation. When the divergence M x + N y is zero , there is 
no source or sink . 

FLOW FIELDS WITHOUT SOURCES 

This is really quite important. Remember that conservative fields do no work around 
C, they have a potential f and they have “zero curl.” Now turn those statements 
through 90°, to find their twins. Source-free fields have no flux through C, they have 
stream functions g , and they have L zero divergence T The new statements E-F-G-H 
describe fields without sources. 
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15G The field F — M(x, y ) i + N(x > y)j is source-free if it has these properties: 
E The total flu* § F • n ds through every closed curve is zero. 

F Across all curves from P to Q , the flux F • n ds is the same. 

G There is a stream function g(x 9 y\ for which M - 8g/8y and N ~ - dgjdx . 
H The components satisfy 8M/8x + 8N/8y — 0 (the divergence is zero). 

A field with one of these properties has them all. H is the quick test. 


The spin field — yi + xj passed this test (Example 6 was source-free). The field 
2xi + 3yj does not pass (Example 5 had M x + N y = 5). Example 1 almost passes. 


EXAMPLE 7 The radial field R/r 2 = (xi + yj)/{x 2 -f y 2 ) has a point source at (0, 0). 
The new test H is divergence = dMjdx + 8N/dy = 0. Those two derivatives are 


d( x \ x 2 + y 2 -> 
dx \x 2 + y 1 ) (x 2 + y 1 ' 


~x(2x) 

) 2 


and 


dy\x 2 + y 2 J 


x 2 + y 2 - j<2y) 


(x + y 2 ) 4 


2\2 


(12) 


They add to zero. There seems to be no source (if the calculation is correct). The flow 
through a circle x 2 + y 2 = a 2 should be zero. But it's not: 

(j)M dy — N dx = (j)(x dy — y dx)/(x 2 + y 1 ) = 2(area of circle)/a 2 — In. (13) 


A source is hidden somewhere. Looking at R/r 2 , the problem is at (0, 0). The field is 
not defined when r = 0 (it blows up). The derivatives in (12) are not continuous. Test 
H does not apply, and was not passed. The divergence M x + N y must be a “delta 
function” of strength 2 7 t. There is a point source sending flow out through all circles. 

1 hope you see the analogy with Example 4. The field S/r 2 is curl-free except at r = 0. 
The field R/r 2 is diver gence-free except at r = 0. The mathematics is parallel and the 
fields are perpendicular A potential /and a stream function g require a region without 
holes. 


THE BEST FIELDS: CONSERVATIVE AND SOURCE-FREE 


What if F is conservative and also source-fretf Those are outstandingly important 
fields. The curl is zero and the divergence is zero. Because the field is conservative, it 
comes from a potential. Because it is source-free, there is a stream function: 


df^dg 
cx cy 


and 



dg 

ex' 


(14) 


Those are the Cauchy-Riemann equations , named after a great mathematician of his 
time and one of the greatest of all time, I can't end without an example. 


EXAMPLE 8 Show that yi + xj is both conservative and source-free. Find / and g. 

Solution With M = y and N = x, check first that oMjdy = \ = dNjdx. There must 
be a potential function. It is / - xy, which achieves cfjcx=y and df/dy = x. Note 

that f xx -\-fy y — 0. 

Check next that dMjdx + dNjdy = 0 + 0. There must be a stream function. It is 
g = ±(y 2 - x 2 ), which achieves dgjdy - y and dgjdx - -x. Note that g xx + g yy - 0. 
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The curves / — constant are the equipotentials. The curves g = constant are the 
streamlines (Figure 15.4). These are the twin properties— a conservative field with 
a potential and a source-free field with a stream function. They come together into 
the fundamental partial differential equation of equilibrium — Laplace's equation 

fxx +fyy = 0 * 


15H There is a potential and stream function when M y - N x and M x = — N y . 
They satisfy Laplace's equation: 

fxx +fyy ~M x + N y = G and gxx + g yy ~ - N x + M y = 0. (15) 


If we have / without g> as in /=x 2 +y 2 and M = 2x and AT = 2y, we don’t have 
Laplace’s equation: f xx + f yy = 4. This is a gradient field that needs a source. 
If we have g without / as in g = x 2 + y 2 and Af = 2y and N = — 2x, we don’t have 
Laplace’s equation. The field is source-free hut it has spin. The first field is 2R and 
the second field is 2S. 

With no source and no spin, we are with Laplace at the center of mathematics and 
science. 


Green's Theorem: Tangential form ‘*T d$ and normal form $F ••nds 
^Md* + Ndy = JJ (N x —M y )dxdy ^ hf dy— Ndx = if. (M x +N y )dxdy 

work curl flux divergence 

Conservative: work - zero, N x — M y9 gradient of a potential: M-f x and N — f y 
Source-free: flux = zero, M x = — N y , has a stream function : M — g y and N — ~ g x 
Conservative 4* source-free: Cauchy- Riemann + Laplace equations for/ and g . 


15.3 EXERCISES 


Read-through questions 

The work integral $ M dx + N dy equals the double integral 
a by b *s Theorem. For F = 3i + 4j the work is c 
For F = _d and e .the work equals the area of R. When 
M — df/dx and N = df/dy , the double integral is zero because 
j The line integral is zero because q An example is 
F — h The direction on C is I around the outside and 

I around the boundary of a hole. If R is broken into very 
simple pieces with crosscuts between them, the integrals of 
* cancel along the crosscuts. 

Test D for gradient fields is [ A field that passes this 
test has|F -tiR = m . There is a solution to f* = n and 
fy— ° _ > Then df= M dx + N dy is an p differential. 
The spin field S/r 2 passes test D except at q . Its potential 
f= f increases by » going around the origin. The 
integral (iV x - M y )dx dy is not zero but t . 

The flow form of Green’s Theorem is u = v The 
norma] vector in F * nds points w and [n| ^ x and n ds 


equals dy i — dx j. The divergence of Mi + iVj is y . For F = 
xi the double integral is * There (is)(is not) a source. For 
F = yi the divergence is a . The divergence of R/r 2 is zero 

except at B This field has a C source. 

A field with no source has properties E — D « F - _E_, 
G = F . H = zero divergence. The stream function g 
satisfies the equations 6 . Then dMjdx + 

dN/dy — 0 because d 2 g/dxdy — h The example F = yi has 
g = i There (is)(is not) a potential function. The example 
F = xi — yj has g = J and also / = < This / satisfies 

Laplace’s equation L because the field F is both M 

and N . The functions/and g are connected by the o 
equations df/dx — dgjdy and P 

Compute the line integrals 1-6 and (separately) the double integ- 
rals in Green's Theorem (1). The circle has x = a cos r, 
y = a sin t. The mangle has sides x = 0, y = 0* x + y = L 

I $ x dy along the circle 2 §x 2 ydy along the circle 
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3 jxdx along the triangle 4 §ydx along the triangle 
5 | x 2 y dx along the circle 6 $ x 2 y dx along the triangle 

7 Compute both sides of Green’s Theorem in the form (10): 

(a) F ~ xi 4 >i, R = upper half of the disk x 2 4 y 2 < L 

(b) F = xH 4 xyj, C = square with sides y = 0, x = 1, y = 1, 

x = 0. 

8 Show that | c (x 2 y 4 2x)dy 4- xy 2 dx depends only on the 
area of R. Does it equal the area? 


23 Evaluate |a(x, y)dx 4 b(x , y)dy by both forms of Green’s 
Theorem, The choice A i = a t N = b in the work form gives 

the double integral . The choice M =b, N = — a in 

the flux form gives the double integral There was 

only one Green. 

24 Evaluate $ cos 3 y dy — sin 3 x dx by Green’s Theorem. 

25 The field R/r 2 in Example 7 has zero divergence except at 
r = 0. Solve dg/dy = x/(x 2 4 y 2 ) to find an attempted stream 
function g. Does g have trouble at the origin? 


9 Find the area inside the hypocycloid x — cos 3 t, y — sin 3 t 
from $$xdy — ydx. 

10 For constants b and c t how is $bydx + cxdy related to 
the area inside Cl If b = 7, which c makes the integral zero? 

11 For F = grad v ^x 2 4 y 2 , show in three ways that $F *dR = 
0 around x = cos t y y = sin t. 

(a) F is a gradient field so . 

( b ) Compute F and directly integrate F • 

(c) Compute the double integral in Green’s Theorem. 

12 Devise a way to find the one-dimensional theorem 
[ b a {dfjdx)dx = /(£>) —/(a) as a special case of Green’s Theorem 
when R is a square, 

13 (a) Choose x(t) and y(t ) so that the path goes from (1,0) 
to (1, 0) after circling the origin twice , 

(b) Compute $ ydx and compare with the area inside your 
path. 

(c) Compute £(y dx — x dy)j{x 2 4 y 1 ) and compare with 2n 
in Example 7. 

14 In Example 4 of the previous section, the work |S*dR 
between (1,0) and (0, l) was 1 for the straight path and njl 
for the quarter-circle path. Show that the work is always twice 
the area between the path and the axes. 

Compute both sides of £F • n d$ = j|(Af x 4 N y ) dxdy in 15-20. 

15 F = yi 4 x] in the unit circle 

16 F — xyi in the unit square 0 < x, y ^ 1 

17 F ~ R/r in the unit circle 

18 F = S/r in the unit square 

19 F = x 2 y\ in the unit triangle (sides x = 0,y = 0, x4y=I) 

20 F = grad r in the top half of the unit circle. 

21 Suppose divF = 0 except at the origin. Then the flux 
$F*n ds is the same through any two circles around the 

origin, because , (What is jj (M* 4 N y )dx dy between 

the circles?) 

22 Example 7 has div F = 0 except at the origin. The flux 

through every circle x 2 4y 2 = a 2 is 2n. The flux through a 
square around the origin is also 2n because . (Com- 

pare Problem 21,) 


26 Show that S/r 2 has zero divergence (except at r = 0). Find 
a stream function by solving dgjdy = y/(x 2 4 y 2 ). Does g have 
trouble at the origin? 

27 Which differentials are exact: y dx — x dy, x 2 dx + y 2 dy, 
y 2 dx 4 x 2 dyl 

28 If M x 4 iVy = 0 then the equations dgjdy — and 

dgjdx = yield a stream function. If also N x — 

show that g satisfies Laplace’s equation. 


Compute the divergence of each field in 29-36 and solve g y = 
Af and g x = — N for a stream function (if possible). 

29 2xyi — y 2 j 30 3xy 2 i-y 3 j 

31 x 2 i + y 2 j 32 y*i + x 2 } 


33 e* cos yi — e x sin yj 34 e ,x+y (i — j) 
35 2yi/x4y 2 j/x 2 36 xyi-xyj 


37 Compute N x — M y for each field in 29-36 and find a 
potential function /when possible. 

38 The potential f(Q) is the work j^F’Tds to reach Q from 
a fixed point P (Section 15.2). In the same way, the stream 

function g(Q) can be constructed from the integral 

Then g{Q) — g{P) represents the flux across the path from P to 
Q. Why do all paths give the same answer? 

39 The real part of (x 4 iy) 3 = x 3 4 3 ix 2 y - 3 xy 2 — iy 3 is /- 

x 3 — 3xy 2 . Its gradient field is F = grad /= « The 

divergence of F is . Therefore /satisfies Laplace’s 

equation f xx 4 f yy = 0 (check that it does). 

40 Since div F = 0 in Problem 39, we can solve dgjdy = 

and dgjdx = . The stream function is g = 

. It is the imaginary pari of the same (x 4 iy) 3 . Check 

that / and g satisfy the Cauchy-Riemann equations. 


41 The real pari/ and imaginary part g of (x 4 iy )* satisfy the 
Laplace and Cauchy-Riemann equations for n=l,2, .... 
(They give all the polynomial solutions.) Compute / and g for 
n = 4. 


42 When is M dy — N dx an exact differential dgl 

43 The potential /= e* cos y satisfies Laplace’s equation. 
There must be a g. Find the field F = grad / and the stream 
function g{x , y). 
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44 Show that the spin field 5 does work around every simple 
dosed curve. 

45 For F=/(x)j and /l = unit square 0<x<l, Os^y^l, 
integrate both sides of Green’s Theorem (1). What formula is 
required from one-variable calculus? 

46 A region K is “ simply connected " when every dosed curve 


inside R can be squeezed to a point without leaving Jl. Test 
these regions: 

1, xy plane without (0 3 0) 2. xyz space without (0, 0, 0) 

3. sphere r 2 + / + z 2 = 1 4. a torus (or doughnut) 

5. a sweater 6. a human body 

7. the region between two spheres 

8, xyz space with circle removed. 


15.4 Surface Integrals 

The double integral in Green’s Theorem is over a flat surface JR, Now the region 
moves out of the plane. It becomes a curved surface S, part of a sphere or cylinder 
or cone. When the surface has only one z for each (x, y\ it is the graph of a function 
z(x t y). In other cases S can twist and close up — a sphere has an upper z and a lower 
2 . In all cases we want to compute area and flux. This is a necessary step (it is our 
last step) before moving Green’s Theorem to three dimensions. 

First a quick review. The basic integrals are j dx and jj dxdy and fjjdx dydz. The 
one that didn’t fit was j ds — the length of a curve. When we go from curves to 
surfaces, ds becomes dS, Area is JjdS and flux is JjF-n dS, with double integrals 
because the surfaces are two-dimensional. The main difficulty is in dS. 

AU formulas are summarized in a table at the end of the section. 

There are two ways to deal with ds (along curves). The same methods apply to dS 
(on surfaces). The first is in xyz coordinates; the second uses parameters. Before this 
subject gets complicated, I will explain those two methods. 

Method i is for the graph of a function: curve y(x) or surface z{x, y ), 

A small piece of the curve is almost straight. It goes across by dx and up by dy : 

length ds = ^/{dxf + (dy) 1 = + {dyjdxf dx . (1) 

A small piece of the surface is practically flat. Think of a tiny sloping rectangle. One 
side goes across hy dx and up by ( dzldx)dx , The neighboring side goes along by dy 
and up by ( dzfdyjdy , Computing the area is a linear problem (from Chapter 11), 
because the flat piece is in a plane. 

Two vectors A and B form a parallelogram. The length of their cross product is the 
area . In the present case, the vectors are A = i + (dz/dx) k and B = j -I- (dz/dy) k. Then 
Adx and Bdy are the sides of the small piece, and we compute A x B: 

i j k 

A x B = 1 0 dz/dx = — dz/dxi — dz/dy\ + k. (2) 

0 l dz/dy 

This is exactly the normal vector N to the tangent plane and the surface, from 
Chapter 13. Please note: The small flat piece is actually a parallelogram (not always 
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a rectangle). Its area dS is much like ds, but the length of N = A x B involves two 
derivatives: 

area dS = |Arfx x fefyj = |N| dx dy = + {dzjdxf + {dz/dy) 2 dx dy. (3) 

EXAMPLE 1 Find the area on the plane z = x + 2 y above a base area A . 

This is the example to visualize. The area down in the xy plane is A . The area up on 
the sloping plane is greater than A. A roof has more area than the room underneath 
it. If the roof goes up at a 45° angle, the ratio is % /2. Formula (3) yields the correct 
ratio for any surface — including our plane z = x + 2y. 



y 


Roof area = base area 



times |N|. Cone and cylinder with parameters u and v , 


The derivatives are dz/dx — 1 and dz/dy = 2. They are constant (planes are easy). 
The square root in (3) contains I + l 2 + 2 2 - 6, Therefore dS — v /6 dx dy. An area in 
the xy plane is multiplied by y^6 up in the surface (Figure 1 5.14a), The vectors A and 
B are no longer needed— their work was done when we reached formula (3) — but 
here they are: 

A - i + (dz/dx)k - i + k B = j + {dz/dy) k = j + 2k N = — i — 2| -h k. 

The length of N = A x B is y'6, The angle between k and N has cos 0 — \/J 6. That 

is the angle between base plane and sloping plane. Therefore the sloping area is V ' 6 
times the base area. For curved surfaces the idea is the same, except that the square 
root in |N| - !/cos 6 changes as we move around the surface. 

Method 2 is for curves x(f ), yf f) an d surfaces x{u, r), y(w, u), z(u t v) with parameters . 

A curve has one parameter r. A surface has two parameters u and v (it is two- 
dimensional). One advantage of parameters is that x, y , z get equal treatment instead 
of picking out z as fix, y). Here are the first two examples: 

cone x — u cos v f y — u sin i\ z — u cylinder x — cos v, y — stn v, z — u. (4) 

Each choice of u and v gives a point on the surface. By making all choices, we get 
the complete surface. Notice that a parameter can equal a coordinate, as in z — w. 
Sometimes both parameters are coordinates, as in x = u and y — v and z=/(w, v). 
That is just z = /(x, y) in disguise — the surface without parameters. In other cases we 
find the xyz equation by eliminating u and v: 

cone {u cos v) 2 + (u sin c) 2 — u 2 or x 2 q- y 2 — z 2 or z — ^fx 2 + y 2 

cylinder (cos t?) 2 + (sin v) 2 = 1 or x 2 + y 2 — T 


= COS V 

- sin v 

- u 
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The cone is the graph of /= *Jx l + y 2 . The cylinder is not the graph of any function. 
There is a line of z ' s through each point on the circle x 2 + y 2 = L That is what z = 
u tells us: Give u all values, and you get the whole line. Give u and v all values, and 
you get the whole cylinder. Parameters allow a surface to close up and even go 
through itself — which th e graph of /( x s y) can never do. 

Actually z = N jx 2 + y 1 gives only the top half of the cone. {A function pr oduces 
only one z) The parametric form gives the bottom half also. Similarly y = y/\ — x 2 
gives only the top of a circle, while x = cos f, y = sin f goes all the way around. 

Now we find dS , using parameters. Small movements give a piece of the surface, 
practically flat. One side comes from the change du t the neighboring side comes from 
dt\ The two sides are given by small vectors A du and Bdv: 


t dx, 8y . dz , 

A ~a: l+ a: J + ai k 


, ^ dx r dy . dz % 

and B = —i + — j + — k. 

dv dv ov 


(5) 


To find the area dS of the parallelogram, start with the cross product N = A x B: 


i J k 


N = 


^ y* 2 v 


f dy dz dz dy\ / 3z dx 
\5u dv du dv / + dv 


?y 

cvf 


OX Cl 

~ li + 

CM C V 


dx dy 
du dv 


du dv J 


( 6 ) 


Admittedly this looks complicated — actual examples are often fairly simple. The area 
dS of the small piece of surface is |N| du dv. The length |N| is a square root: 


dS = 


j fey vz _ d_^ dyV /dz dx _ dx ccY fex ry _ cxY 

V V u dv du dv J \cw cv du cvj \cu dv du dv J 


dud v. 


(7) 


EXAMPLE 2 Find A and B and N — A x B and dS for the cone and cylinder. 

The cone has x = u cos i\ y = u sin v t 2 = u. The u derivatives produce A — SRjdu = 
cos v I + sin v j + k. The v derivatives produce the other tangent vector B — SR/dv = 
— u sin v i + u cos v j. The normal vector is A x B = - u cos v i - u sin v j + u k. Its 
length gives dS: 

dS = | A x B| du dv = x f{u cos v) 2 + (u sin v) 2 + u 2 du dv = y/l u du dv. 

The cylinder is even simpler: dS-dudv. In these and many other examples, A is 
perpendicular to B, The small piece is a rectangle. Its sides have length |A|du and 
|B|^u. (The cone has |A| = u and |B| = the cylinder has |A| = |B[ = l) t The cross 
product is hardly needed for area, when we can just multiply |A|du times 

Remark on the two methods Method 1 also used parameters, but a very special 
choice — u is x and v is y. The parametric equations are x = x, y = y, z=f{x y y). If 
you go through the long square root in (7), changing u to x and v to y, it simplifies 
to the square root in (3). (The terms eyfex and dxidy are zero; dx/dx and cyicy are 
1.) Still it pays to remember the shorter formula from Method 1. 

Don't forget that after computing dS , you have to integrate it. Many times the 
good way is with polar coordinates. Surfaces are often symmetric around an axis or 
a point. Those are the surfaces of revolution — which we saw in Chapter 8 and will 
come back to. 

Strictly speaking, the integral starts with AS (not dS). A flat piece has area 
|A x B|AxAy or |A x B|AuAr\ The area of a curved surface is properly defined as a 
limit. The key step of calculus, from sums of AS to the integral of dS , is safe for 
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smooth surfaces. In examples, the hard part is computing the double integral and 
substituting the limits on x, y or u, v , 


EXAMPLE 3 Find the surface area of the cone z = % fx 2 + y 2 up to the height z = a. 


We use Method 1 (no parameters). The derivatives of z are computed, squared, and 
added; 


dz 

dx 


dz 


sfx 1 + y 2 Sy ^Jx 2 + y 2 


|N| 2 = I + 


x 2 +y 2 


x 2 + y 2 


= 2 . 


Conclusion: jN| = ^/l and dS = -Jl dx dy. The cone is on a 45° slope, so the area 
dxdy in the base is multiplied by yfl in the surface above it (Figure 15.15). The 
square root in dS accounts for the extra area due to slope. A horizontal surface has 
dS = yi dx dy, as we have known all year. 

Now for a key point. The integration is down in the base plane. The limits o n x and 
y are given by the “shadow” of the cone. To locate that shadow set z = ^Jx 1 + y 2 
equal to z = a. The plane cuts the cone at the circle x 2 + y 2 = a 2 . We integrate over 
the inside of that circle (where the shadow is): 


surface area of cone = j] -Jl dx dy - Jl na 2 . 

shadow 


EXAMPLE 4 Find the same area using dS = v /2 u du dv from Example 2. 

With parameters, dS looks different and the shadow in the base looks different. The 
circle x 2 + y 2 = a 2 becomes u 2 cos 2 t) + u 2 sin 2 u = a 2 . In other words u — a. (The cone 
has i = u, the plane has z = a, they meet when u = a.) The angle parameter v goes 
from 0 to 2k. The effect of these parameters is to switch us “automatically” to polar 
coordinates, where area is r dr d9: 


surface area of cone ■ 


ff«- 

m 2n r 

J J W 

0 


y/l u du dv = yfl no. 2 . 



Fig. 15.15 Cone cut by plane leaves shadow in the base. Integrate over the shadow. 


EXAMPLE 5 Find the area of the same cone up to the sloping plane z=l-jx. 

Solution The cone still has dS — y/l dx dy, but the limits of integration are changed. 
The plane cuts the cone in an ellipse. Its shadow down in the xy plan e is another 
ellipse (Figure 15.15c). To find the edge of the shadow , set z = ^Jx 2 + y 1 equal to z = 

1 - jx. We square both sides: 

x 2 +y 2 = 1 -x + £x 2 or |(x + f) 2 + y 2 = f . 
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This is the ellipse in the base — where height makes no difference and i is gone. The 
area of an ellipse is nab , when the equation is in the form (xja) 2 + (yib) 2 = 1. After 
multiplying by 3/4 we find a = 4/3 and b = ^4/3. Then f j^/2 dx dy = jl nab is the 
surface area of the cone. 

The hard part was finding the shadow ellipse (I went quickly). Its area nab came 
from Example 15.3.2. The new part is y/l from the slope. 


EXAMPLE 6 Find the surface area of a sphere of radius a (known to be 4 na 2 }. 

This is a good example, because both methods al most work. The equation of the 
sphere is x 2 4- y z + z 2 — a 2 . Method l writes z = ^fa 2 — x 2 — y 2 . The x and y deriva- 
tives are — xjz and —yjz\ 



The square root gives dS = a dxdyj^Ja 2 — x 2 — y 2 t Notice that z is gone (as it should 
be). Now integrate dS over the shadow of the sphere, which is a circle. Instead of 
dx dy\ switch to polar coordinates and r drdO: 


11 ds = 


ardrdO 

Ja 2 ~ r 2 


— — 2 jia v / a 2 —r 2 ^ = 2i ia 2 . 


( 8 ) 


This calculation is successful but wrong. 2 na 2 is the area of the half-sphere above the 
xy plane. The low'er half takes the negative square root of z 2 = a 2 — x 2 — y 2 , This 
shows the danger of Method 1, when the surface is not the graph of a function. 


EXAMPLE 7 (same sphere by Method 2: use parameters) The natural choice is spheri- 
cal coordinates. Every point has an angle u = 4> down from the North Pole and an 
angle v = 0 around the equator. The xyz coordinates from Section 14.4 are x = 
a sin <f> cos 0 „ y — a sin 0 sin 8,z = a cos </>, The radius p = a is fixed (not a parameter). 
Compute the first term in equation (6). noting cz/cO = 0: 


(cyic<p}(dz;c'0) — (czic<t>){<'yidO)= — (—a sin <P)(a sin <p cos G) = a 2 sin 2 0 cos 8. 

The other terms in (6) are a 2 sin 2 ^ sin 0 and a 2 sin <f> cos tfi. Then dS in equation (7) 
squares these three components and adds. We factor out a 4 and simplify: 

u 4 (sin 4 0 cos 2 # + sin 4 $ sin 2 # + sin 2 ^ cos 2 ^) = a 4 (sin 4 d> + sin 2 <f) cos z <p) = a 4 sin 2 0. 

Conclusion: dS = a 1 sin <j> d<f> dO. A spherical person will recognize this immediately. 
It is the volume dement dV = p 2 sin </> dp d<p d#, except dp is missing. The small box 
has area dS and thickness dp and volume d K Here we only want dS\ 


area of sphere = 



fir 

a 2 sin 4? dO = Ana 2 . 
, o 


(9) 


Figure 15.16a shows a small surface with sides a d<p and a sin <p dO. Their product is 
dS. Figure 15.1 6b goes back to Method l, where equation (3) gave dS = (ajz) dxd\\ 

I doubt that you will like Figure 15. 16c — and you don’t need it. With parameters 
<p and #, the shadow of the sphere is a rectangle. The equator is the line down the 
middle, w r here <f> = tt/ 2. The height is z = a cos <p. The area d<p d$ in the base is the 
shadow of dS = a 2 sin <p d(j> d8 up in the sphere. Maybe this figure shows what we 
don’t have to know about parameters. 
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Fig. 15.16 Surface area on a sphere: (a) spherical coordinates (b> xyz coordinates (c> (j>0 space. 


EXAMPLE 8 Rotate y = x 2 around the x axis. Find the surface area using parameters. 

The first parameter is x (from a to h). The second parameter is the rotation angle 0 
(from 0 to 2 tt). The points on the surface in Figure 15.17 are x = x, y~ x 2 cos 9, 
z = x 2 sin 6. Equation (7) leads after much calculation to dS = x 2 . 1 - Ax 1 dx dO . 

Main point : dS agrees with Section 8.3, where the area was j 2ny*J\ + (dyjdx) 2 dx. 
The 27t comes from the 0 integral and y is x 2 . Parameters give this formula auto- 
matically. 

VECTOR FIELDS AND THE INTEGRAL OF F • n 

Formulas for surface area are dominated by square roots. There is a square root in 
dS, as there was in ds. Areas are like arc lengths, one dimension up. The good point 
about line integrals J F ■ n ds is that the square root disappears. It is in the denominator 
of n, where ds cancels it: F • nds = M dy - N dx. The same good thing will now 
happen for surface integrals Jf F ■ ndS. 


151 Through the surface z - /(x, y), the vector field F(x, y, z) = Mi+ (Vj + Pk 
has 

flux = ff F ■ ndS = ff { -M^f- N^ + P)dxdy. (10) 

surface shadow \ OX 8y J 


This formula tells what to integrate, given the surface and the vector field (/ and F). 
The xy limits come from the shadow. Formula (10) takes the normal vector from 
Method 1: 


N = - dfjdx i - dffdy j + k and |N| = ^1 + [8 flex) 1 + (<3//5y) 2 . 


For the unit normal vector n, divide N by its length: n = N/|N|. The square root is in 
the denominator, and the same square root is in dS. See equation (3): 


F • ndS — 


FN 






dxdy 


■(- 


M 


d l. 

dx 


Nf y + e)d*i». 


(H) 


That is formula (10), with cancellation of square roots. The expression F ■ n dS is often 
written as F -dS, again relying on boldface to make dS a vector. Then dS equals n dS, 
with direction n and magnitude dS . 
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Fig. 15*17 Surface of revolution: parameters x, 0. Fig* 15.18 F - n dS gives flow through dS. 


EXAMPLE 9 Find j\4S for the plane z = x + 2 y* Then find F • n dS for F = k. 
This plane produced y/6 in Example 1 {for area). For flux the ^76 disappears: 

ndS = — dS = — 1 ■■■ J-— - dx dy = (— i — 2j + k) dx dy. 

| N | j 6 


For the flow field F = k, the dot product k ■ n dS reduces to 1 dxdy. The slope of the 
plane makes no difference! The flow through the base also flows through the plane * The 
areas are different, but flux is like rain* Whether it hits a tent or the ground below, 
it is the same rain (Figure 15.18). In this case JJF ■ ndS = jjdxdy = shadow area in 
the base* 


EXAMPLE 10 Find the flux of F = x\ + yj + zk through the cone z 


Solution F • ndS = 


— x 




, - >-1 




+ vV + . 


= V * 2 + v 2 . 
dx dy = 0. 


The zero comes as a surprise, but it shouldn’t. The cone goes straight out from the 
origin, and so docs F. The vector n that is perpendicular to the cone is also perpendic- 
ular to F. There is no flow through the cone, because F*n = 0, The flow travels out 
along rays. 


jj F • ndS FOR A SURFACE WITH PARAMETERS 


In Example 10 the cone was z = f(x, y) = y/x 2 + y 2 . We found dS by Method 1. 
Parameters were not needed (more exactly, they were x and y). For surfaces that fold 
and twist, the formulas with u and v look complicated but the actual calculations can 
be simpler* This was certainly the case for dS = dudv on the cylinder 

A small piece of surface has area dS = |A x B| du dv. The vectors along the sides arc 
A — x u [ + yj + z u k and B = x t j + y v \ + z ( ,k. They are tangent to the surface. Now we 
put their cross product N = A x B to another use, because F *n dS involves not only 
area but direction. We need the unit vector n to see how much flow goes through. 

The direction vector is n = N/|N|, Equation (7) is dS = |N| dudv f so the square root 
|N| cancels in nJS. This leaves a nice formula for the “normal component” of flow: 


15J Through a surface with parameters w and 

u, the field F = M\ + N\ + Pk 

has 




A 

flux = 

' 

F-ndS = 

' J* 

F * N dudv = 

F-(AxB) dudv. (12) 

■ 

J 

J 

J 
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EXAMPLE 11 Find the flux of F = *i + jj+zk through the cylinder x 1 + y 2 — i n 
0 ^ z ^ b. 

Solution The surface of the cylinder is x = cos u> y = sin u, z = v. The tangent vectors 
from (5) are A - (-sin w)i + (cos w) j and B = k. The normal vector in Figure 15,19 
goes straight out through the cylinder; 


TV = A x B = cos u i + sin u j (check A ■ N = 0 and B * IN - 0), 

To find F - N, switch F = xi -f >j + zk to the parameters u and v. Then F * N = 1 : 

F ■ N = (cos u i 4- sin u j + v k) * (cos u i + sin w j) = co$ 2 u + sin 2 u. 

For the flux, integrate F - N = 1 and apply the limits on u = 0 and v = z: 


flux 



1 dudv = 2 nh — surface area of the cylinder. 


Note that the top and bottom were not included! We can find those fluxes too. The 
outward direction is n = k at the top and n = — k down through the bottom. Then 
F * n is at the top and — z = 0 at the bottom. The bottom flux is zero, the top 

flux is b times the area (or nbj. The total flux is 2nb + nb = Hold that answer 
for the next section. 

Apology: I made u the angle and v the height. Then N goes outward not inward. 


EXAMPLE 12 Find the flux of F = k out the top half of the sphere .r 2 + y 2 + z 2 = a 2 . 

Solution Use spherical coordinates. Example 7 had u = (j> and i- = 0. We found 

N = A x B — a 2 sin 2 (j> cos 0 i + a 2 sin 2 ^> sin 0 j + a 2 sin <j) cos <f> k. 

The dot product with F = k is F * N = a 2 sin <f> cos <fi. The integral goes from the pole 
to the equator, (j> = 0 to 0 = tc/ 2, and around from 0 = 0 to 8 = 2n: 

C 2n f re;2 s in 2 <f\* 2 

flux = a 2 sin 0 cos <f> d<f> dO = Ina 2 — — = na 2 . 

Jo Jo 2 Jo 

The next section will show that the flux remains at na 1 through any surface (!) that 
is bounded by the equator A special case is a flat surface — the disk of radius a at 
the equator. Figure 1 5. 1 8 shows n = k pointing directly up, so F ■ n = k -k = 1 . The 
flux is || 1 dS = area of disk = na 2 . AH fluid goes past the equator and out through the 
sphere , 



Fig, 15.19 Mow through cylinder. Fig. 15,20 Mobius strip (no way to choose n). 
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I have to mention one more problem. It might not occur to a reasonable person, but 
sometimes a surface has only one side. The famous example is the Mobius strip, for 
which you take a strip of paper, twist it once, and tape the ends together. Its special 
property appears when you run a pen along the “inside.” The pen in Figure 15.20 
suddenly goes “outside.” After another round trip it goes back “inside.” Those words 
are in quotation marks, because on a Mobius strip they have no meaning. 

Suppose the pen represents the normal vector. On a sphere n points outward. 
Alternatively n could point inward; we are free to choose. But the Mobius strip makes 
the choice impossible. After moving the pen continuously, it comes back in the 
opposite direction. This surface is not orientable. We cannot integrate F • n to compute 
the flux, because we cannot decide the direction of n. 

A surface is oriented when we can and do choose n. This uses the final property of 
cross products, that they have length and direction and also a right-hand rule. We 
can tell A x B from B x A. Those give the two orientations of n. For an open surface 
(like a wastebasket) you can select either one. For a closed surface (like a sphere) it 
is conventional for n to be outward. By making that decision once and for all, the 
sign of the flux is established: outward flux is positive . 


FORMULAS 

FOR 

SURFACE 

INTEGRALS 


Method 1 : Parameters x, y 

Method 2: Parameters u, v 

Coordinates x, y, z(x, y) 

x(u, v) 9 y(u, v) y z(u,v) on surface 

A = i + dz/dx k N = A x B 

A = dx/du i + 8y/du} + dz/du k 

B = j + dz/dy k n = N/|N| 

B = dx/dv i + dy/dv j + dz/dv k 

dS = |N|dx*/>’ = y/\ + z\ + z* dx dy 

dS = \N\dudv 

ndS = N dx dy = (— dz/dx i - dz/dy \ + k)dx dy 

ndS = N du dv 


15.4 EXERCISES 


Read-through questions 

A small piece of the surface z = /(x, y) is nearly q When 
we go across by dx % we go up by b . That movement is 
A dx y where the vector A is i + c The other side of the 
piece is B</v\ where B = j 4 d . The cross product A x B 
is N = e . The area of the piece is dS = |N| dxdy. For the 
surface z = xy, the vectors are A = f and B = 9 and 

N = h . The area integral is J \dS = » dxdv. 

With parameters u and v\ a typical point on a 45° cone is 
x = u cos v, y = 1 z = k A change in u moves that 

point by A du = (cos v i + I )du. A change in v moves the 
point by B dv = m . The normal vector is N = A x B = 

n . The area is dS = o dudv. In this example A • B = 

P so the small piece is a q and dS = |A| \B\dudv. 

For flux we need o dS. The r vector n is N = A x B 

divided by For a surface z = /(x, y), the product ndS 

is the vector t (to memorize from table). The particular 


surface z = xy has n dS = u dx dy. For F = xi -I- yj 4- rk the 

flux through z = xy is F • n dS = v dxdv. 

On a 30 c cone the points are x = 2 u cos v y y = 2u sin v % z = 
u. The tangent vectors are A = w and B ==_*_. This 
cone has ndS = A x B du dv = _v_. For F = xi + yj + zk, 
the flux element through the cone is F - ndS = * The 

reason for this answer is A . The reason we don’t compute 
flux through a Mobius strip is B 

In 1-14 find N and dS=*\N\dxdy and the surface area j $dS. 
Integrate over the xy shadow which ends where the z’s are equal 
(x 2 + y 2 = 4 in Problem 1). 

1 Paraboloid z = x 2 4- y 2 below the plane z = 4. 

2 Paraboloid z = x 2 4- y 2 between z = 4 and z = 8. 

3 Plane z = x — y inside the cylinder x 2 4- y 2 = 1. 

4 Plane z = 3x 4- 4y above the square x < 1, 0 < jr < 1. 
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5 Spherical cap x 2 + y 2 + z 2 = 1 above z = 1/^/2. 

6 Spherical band x 2 + y 2 + z 2 = I between z = 0 and 1 / v /2. 

7 Plane z = 7y above a triangle of area A. 

8 Cone z 2 = x 2 -\- y 1 between planes z = a and z = b. 

9 The monkey saddle z = jx 3 — xy 2 inside x 2 + y 2 = 1. 

10 z = x + y above triangle with vertices (0, 0), (2, 2), (0, 2), 

1 1 Plane z = l — 2x — 2y inside x ^ 0, y > 0, z ^ 0. 

12 Cylinder x 2 + z 2 = a 1 inside x 2 -yy 2 = a 2 . Only set up 

ffrfs- 

13 Right circular cone of radius a and height k Choose 
z = /( x, y) or parameters u and v. 

14 Gutter z = x 2 below z = 9 and between y= ±2* 

In 15-18 compute the surface integrals JJ g(x, y, z)dS. 

15 g = xy over the triangle x + y + z = 1, x f y, z > 0. 

16 g = x 2 + y 2 over the top half of x 2 + y 1 + z 2 = 1 (use 0, 0). 

17 g = xyz on x 2 4- y 1 + z 2 = 1 above z 2 = x 2 + y 2 (use 0 T 8), 

18 g = x on the cylinder x 2 + y 2 = 4 between z = 0 and z = 3. 

In 19-22 calculate A, B, N, and dS. 

19 x = u, y = v + u t z = v + 2u + L 

20 x — ui\ y = u + v, z — u — v. 

21 x = (3 + cos u) cos u, y = (3 + cos u) sin u, z = sin u. 

22 x = u cos v, y = u sin t>, z = v (not z = u)- 

23-26 In Problems 1—4 respectively find the flux JjF'iuiS 
for F = xi + yj-f zk. 


27-28 In Problems 19-20 respectively compute jj F • ndS for 
F = yi — xj through the region u 2 + v 2 < L 



29 A unit circle is rotated around the z axis to give a torus 
(see figure). The center of the circle stays a distance 3 from 
the z axis. Show that Problem 21 gives a typical point (x, y, z) 
on the torus and find the surface area jj dS = jj |N| du dv< 

30 The surface x = r cos 0, y = r sin 8, z = a 2 — r 2 is bounded 
by the equator (r — a). Find N and the flux jj k * ndS, and 
compare with Example 12. 

31 Make a “double Mobius strip” from a strip of paper by 
twisting it twice and taping the ends. Does a normal vector 
(use a pen) have the same direction after a round trip? 

32 Make a “triple Mobius strip” with three twists. Is it 
orientable — does the normal vector come back in the same 
or opposite direction? 

33 If a very wavy surface stays close to a smooth surface, are 
their areas close? 

34 Give the equation of a plane with roof area dS = 3 times 
base area dxdy. 

35 The points (x, f(x) cos 0, /(x) sin 0) are on the surface of 
revolution: y=/(x) revolved around the x axis, parameters 
u = x and v = 8. Find N and compare dS ~\N\dxd8 with 
Example 8 and Section 8.3. 



15.5 The Divergence Theorem 


This section returns to the fundamental law {flow out) — {flow in) = (source). In two 
dimensions, the flow was in and out through a closed curve C. The plane region 
inside was R , In three dimensions, the flow enters and leaves through a closed surface 
5. The solid region inside is K Green’s Theorem in its normal form (for the flux of a 
smooth vector field) now becomes the great three-dimensional balance equation — 
the Divergence Theorem : 
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15 K The flux of F = Mi + Nj + Pk through the boundary surface S equals the 
integral of the divergence of F inside V. The Divergence Theorem is 

JJ F '“' s= IfJ^v Fdr -lH( ? £ + ™ + f) dxd >" b - <*> 


In Green's Theorem the divergence was dM/dx + dNjdy. The new term dPjdz 
accounts for upward flow* Notice that a constant upward component P adds nothing 
to the divergence (its derivative is zero)* It also adds nothing to the flux (flow up 
through the top equals flow up through the bottom)* When the whole field F is 
constant, the theorem becomes 0 = 0* 

There are other vector fields with div F = 0* They are of the greatest importance* 
The Divergence Theorem for tbose fields is again 0 = 0, and there is conservation of 
fluid. When div F = 0, flow in equals flow out , We begin with examples of these 
“divergence-free” fields* 

EXAMPLE 1 The spin fields — yi + jcj + 0k and Oi — zj + yk have zero divergence* 

The first is an old friend, spinning around the z axis* The second is new, spinning 
around the jc axis* Three-dimensional flow has a great variety of spin fields* The 
separate terms dMjdx, dNjdy, dPjdz are all zero, so div F = 0* The flow goes around 
in circles, and whatever goes out through S comes back in. (We might have put a 
circle on as we did on | c , to emphasize that S is closed*) 

EXAMPLE 2 The position field R = xi + yj + zk has div R = 1 + 1 + 1 = 3* 

This is radial flow, straight out from the origin* Mass has to be added at every point 
to keep the flow going* On the right side of the divergence theorem is JJj3dF* 
Therefore the flux is three times the volume* 

Example 11 in Section 15*4 found the flux of R through a cylinder* The answer 
was 3 nb. Now we also get 3nb from the Divergence Theorem, since the volume is n A* 
This is one of many cases in which the triple integral is easier than the double integral* 

EXAMPLE 3 An electrostatic field R /p 3 or gravity field — R/p 3 almost has div F = 0* 

The vector R = xi + yj + zk has length ^x 2 4- y 2 + z 2 — p* Then F has length p/p 3 
(inverse square law). Gravity from a point mass pulls inward (minus sign). The electric 
field from a point charge repels outward. The three steps almost show that div F = 0: 

Step 1* dpjdx = x/p, dpjdy = yjp, dpjdz = z/p — but do not add those three. F is not 
p or 1/p 2 (these are scalars). The vector field is R/p 3 * We need dMjdx t dN/dy, dPjdz * 

Step 2, dM/dx = djdx(xjp 3 ) is equal to 1/p 3 — (3x 5p/5x)jp 4 = 1/p 3 — 3x 2 /p 5 . For 
8N/8y and dPjdz , replace 3x 2 by 3 y 2 and 3z 2 . Now add those three. 

Step 3* div F = 3/p 3 — 3 (jc 2 + y 2 + z 2 )/p 5 = 3/p 3 — 3/p 3 = 0. 

The calculation div F — 0 leaves a puzzle* One side of the Divergence Theorem seems 
to give JjjOdF=0* Then the other side should be jjF • ndS = 0. But the flux is not 
zero when all flow is outward: 

The unit normal vector to the sphere p = constant is n = R/p* 

The outward flow F • n = (R/p 3 ) • (R/p) = p 2 /p 4 is always positive. 

Then jj F * ndS = jj dSjp 1 = 4 np 2 jp 2 = 4n * We have reached 4n = 0* 
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This paradox in three dimensions is the same as for R jr l in two dimensions. 
Section 15,3 reached 2 tt- 0, and the explanation was a point source at the origin. 
Same explanation here: M t /V, P are infinite when p = 0. The divergence is a “delta 
function” times 4 tt, from the point source. The Divergence Theorem does not apply 
(unless we allow delta functions). That single point makes all the difference. 

Every surface enclosing the origin has flux = 4 ti. Our calculation was for a sphere. 
The surface integral is much harder when S is twisted (Figure 15.21a). But the Diver- 
gence Theorem takes care of everything, because div F = 0 in the volume V between 
these surfaces. Therefore JjF-miS — 0 for the two surfaces together. The flux 
|| F ■ ndS = — 471 into the sphere must be balanced by || F - ndS — 4n out of the twisted 
surface. 


t 




(P 2 -P^)dS 

(P r PJdS 

(P r P 0 )dS 


dS 


(dP/dz) dV , 

{dP/dz) dV 0 
SUM INTEGRAL 


Fig. 15.21 


Point source: flux 4n through all enclosing surfaces. Net flux upward 
= ||| (BPtdz)dv. 


Instead of a paradox An — 0, this example leads to Gauss's Law. A mass M at the 
origin produces a gravity field F = “GMR/p 3 . A charge q at the origin produces an 
electric field E = (^/47t£ 0 )R/p 3 , The physical constants are G and e 0 , the mathematical 
constant is the relation between divergence and flux. Equation (1) yields equation (2), 
in which the mass densities M(x, y ; z) and charge densities q(x , y, z) need not be 
concentrated at the origin: 


15L Gauss's law in differentia) form: div F — — AnGM and div E = 
Gauss's law in integral form: Flux is proportional to total mass or charge: 


| F • nrfS = - ^4nGMd v and j| E • ndS = JJ | q dVjz 0 . (2) 


THE REASONING BEHIND THE DIVERGENCE THEOREM 

The general principle is clear: Flow out minus flow in equals source. Our goal is to 
see why the divergence of F measures the source. In a small box around each point, 
we show that div F dV balances F *ndS through the six sides. 

So consider a small box. Its center is at (x, y, z). Its edges have length Ax, Ay, A z* 
Out of the top and bottom, the normal vectors are k and — k. The dot product with 
F = Mi + N\ + Pk is + P or - P . The area AS is AxAy, So the two fluxes are close 
to P(x, y, z + jAz)AxAy and - P(x> y, z =- jAz)AxAy. When the top is combined with 
the bottom, the difference of those P's is AP: 

net flux upward ^ APAxAy — (AP/Az)AxAyAz ^ (dP/Bz) AV t 


(3) 
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Similarly, the combined flux on two side faces is approximately (8Nfdy)& K On the 
front and back it is (3M/3x)A K Adding the six faces, we reach the key point: 

flux out of the box ^ (dM/dx + dNfdy + dPjdz)&V. (4) 

This is (div F)AK For a constant field both sides are zero — the flow goes straight 
through* For F = x\ 4- yj + zk, a little more goes out than comes in* The divergence 
is 3, so 3AK is created inside the box. By the balance equation the flux is also 3AK 

The approximation symbol as means that the leading term is correct (probably 
not the next term)* The ratio AF/Az is not exactly dPjdz, The difference is of order 
Az, so the error in (3) is of higher order AKAz. Added over many boxes (about 1/AK 
boxes), this error disappears as Az -► 0* 

The sum of (div F)AV over all the boxes approaches JJJ(div F)dV. On the other 
side of the equation is a sum of fluxes* There is F 1 nA S out of the top of one box, 
plus F • nA S out of the bottom of the box above* The first has n = k and the second 
has n = — k, They cancel each other — the flow goes from box to box . This happens 
every time two boxes meet The only fluxes that survive (because nothing cancels 
them) are at the outer surface 5. The final step, as Ax, Ay, Az -► 0, is that those outside 
terms approach JJ F ■ n dS. Then the local divergence theorem (4) becomes the global 
Divergence Theorem (1)* 

Remark on the proof That “final step” is not easy, because the box surfaces don't 
line up with the outer surface 5* A formal proof of the Divergence Theorem would 
imitate the proof of Green’s Theorem* On a very simple region ^\\{dPjdz)dxdydz 
equals JJ Pdxdy over the top minus jjPdxdy over the bottom. After checking the 
orientation this is JJ Pk * ndS. Similarly the volume integrals of dMjdx and dNjdy are 
the surface integrals JJ and JJ Nj*ndS* Adding the three integrals gives the 

Divergence Theorem* Since Green’s Theorem was already proved in this way, the 
reasoning behind (4) is more helpful than repeating a detailed proof. 

The discoverer of the Divergence Theorem was probably Gauss. His notebooks 
only contain the outline of a proof — but after all, this is Gauss. Green and Ostrograd- 
sky both published proofs in 1828, one in England and the other in St. Petersburg 
(now Leningrad). As the theorem was studied, the requirements came to light (smooth- 
ness of F and 5, avoidance of one-sided Mobius strips)* 

New applications are discovered all the time — when a scientist writes down a bal- 
ance equation in a small box * The source is known. The equation is div F = source* 
After Example 5 we explain F, 


EXAMPLE 4 If the temperature inside the sun is T = In 1 /p, find the heat flow F = 
- grad T and the source div F and the flux JJ F ■ ndS. The sun is a ball of radius p 0 * 

Solution F is — grad In 1/p = + grad In p, Derivatives of In p bring division by p: 

F = (dp/dx i + dpjdy j + dp\dz k)/p = {xi + jj + zk)/p 2 . 

This flow is radially outward, of magnitude 1/p. The normal vector n is also radially 
outward, of magnitude 1. The dot product on the sun’s surface is l/p 0 : 


II 


F*nd5 = 


<JS/p 0 = (surface area)/p 0 = 47rpg/p 0 = 4rcp 0 * 


( 5 ) 


Check that answer by the Divergence Theorem. Examples will find div F = 1/p 2 . 
Integrate over the sun* In spherical coordinates we integrate dp , sin 4>d<t > , and d6: 

*2x 


JJJdi vFdV = 


rpt 

0 Jo 


p 2 sin <p dpdfydBjp 2 = (p 0 )(2)(2ti) as in (5). 
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This example illustrates the basic framework of equilibrium . The pattern appears 
everywhere in applied mathematics —electromagnetism, heat flow, elasticity, even 
relativity. There is usually a constant c that depends on the material (the example 
has c = 1). The names change, but we always take the divergence of the gradient : 


potential f -► force field — c grad / 
temperature T -*■ flow field —c grad T. 
displacement u — ► stress field + c grad u. 


Then div(-c grad/) = electric charge 
Then div(— c grad T) = heat source 
Then div(- c grad u)- outside force. 


You are studying calculus, not physics or thermodynamics or elasticity. But please 
notice the main point. The equation to solve is div(— c grad/) — known source. The 
divergence and gradient are exactly what the applications need. Calculus teaches the 
right things. 

This framework is developed in many books, including my own text Introduction 
to Applied Mathematics (Wellesley-Cambridge Press). It governs equilibrium, in mat- 
rix equations and differential equations. 

PRODUCT RULE FOR VECTORS: INTEGRATION BY PARTS 

May I go back to basic facts about the divergence? First the definition: 

F(x, y, z) = M\ + N) + Pk has div F = V - F = dMjdx + dNjdy + dP/6z r 

The divergence is a scalar (not a vector). At each point div F is a number. In fluid 
flow, it is the rate at which mass leaves—the “flux per unit volume” or “flux density.” 
The symbol V stands for a vector whose components are operations not numbers: 

V = “del” = i djdx + j djdy + k djdz. (6) 

This vector is illegal but very useful. First, apply it to an ordinary function f{x , y, z): 

V/ = “del /” = i dfjdx + j df jdy + k df jcz = gradient of / (7) 

Second, take the dot product V * F with a vector function F(:x, y, z) = Mi + Nj + Pk: 

V ■ F = “del dot F” = dMjdx + dNjdy + dPjdz = divergence of F. (8) 

Third, take the cross product V x F. This produces the vector curl F (next section): 

V x F = “del cross F" = . . . (to be defined) . . . = curl of F. (9) 

The gradient and divergence and curl are V and V - and V x . The three great opera- 
tions of vector calculus use a single notation! You are free to write V or not — to 
make equations shorter or to help the memory. Notice that Laplace's equation 
shrinks to 



Equation (10) gives the potential when the source is zero (very common). F = grad / 
combines with div F = 0 into Laplace's equation div grad /— 0. This equation is so 
important that it shrinks further to V 2 /=0 and even to A/=0. Of course A f— 
fxx+fy y + fzz has nothing to do with A f— f(x + Ax) —/(*). Above all, remember that 
/ is a scalar and F is a vector: gradient of scalar is vector and divergence of vector is 
scalar. 




Underlying this chapter is the idea of extending calculus to vectors. So far we have 
emphasized the Fundamental Theorem. The integral of df/dx is now the integral of 
div F. Instead of endpoints a and b , we have a curve C or surface 5, But it is the rules 
for derivatives and integrals that make calculus work, and we need them now for 
vectors. Remember the derivative of u times v and the integral (by parts) of u dv/dx : 


15M Scalar functions u(x, y> z) and vector fields V (jc, y t z) obey the product rule: 

div(uV) - a div V + V • (grad u). (11) 

The reverse of the product rule is integration by parts (i Gauss’s Formula ): 


jjju div V dxdydz — — JJJv *(grad u) dxdydz + |Ju V 1 ndS. (12) 
For a plane field tbis is Green’s Formula (and u = 1 gives Green's Theorem): 


Those look like heavy formulas. They are too much to memorize, unless you use 
them often. The important point is to connect vector calculus with “scalar calculus,” 
which is not heavy. Every product rule yields two terms: 

(uM) x = udMj6x + Mdul6x (uN) y = udNldy + N du/6y ( uP) z = udPldz + Pdu/dz . 

Add those ordinary rules and you have the vector rule (11) for the divergence of u\. 

Integrating the two parts of div(uV) gives J j u\ * n dS by the Divergence Theorem. 
Then one part moves to the other side, producing the minus signs in (12) and (13), 
Integration by parts leaves a boundary term f in three and two dimensions as it did in 
one dimension: { uv'dx = — J u'vdx + [uu]*. 

EXAMPLE 5 Find the divergence of F = R/p 2 , starting from grad p = R/p. 

Solution Take V = R and u = 1/p 2 in the product rule (1 1). Then div F = (div R)/ 
p 2 + R • (grad 1/p 2 ). The divergence of R = xi + yj + zk is 3. For grad 1/p 2 apply the 
chain rule: 


R • (grad 1 Ip 2 ) = — 2R - (grad p)/p 3 = — 2R • R/p 4 = - 2/p 2 , 

The two parts of div F combine into 3/p 2 — 2/p 2 = 1/p 3 — as claimed in Example 4. 

EXAMPLE 6 Find the balance equation for flow with velocity V and fluid density p. 

V is the rate of movement of fluid, while pV is the rate of movement of mass. 
Comparing the ocean to the atmosphere shows the difference. Air has a greater 
velocity than water, but a much lower density. So normally F = pV is larger for the 
ocean. (Don’t confuse the density p with the radial distance p. The Greeks only used 
24 letters,) 

There is another difference between water and air. Water is virtually incompressible 
(meaning p = constant). Air is certainly compressible (its density varies). The balance 
equation is a fundamental law — the conservation of mass or the “ continuity equation” 
for fluids. This is a mathematical statement about a physical flow without sources or 
sinks: 


Continuity Equation: div{pV) + cpjdt = 0. 


(14) 
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Explanation; The mass in a region is jjj p dV. Its rate of decrease is — jjj vpjvt dV \ 
The decrease comes from flow out through the surface (normal vector n)> The dot 
product F*n = pV*n is the rate of mass flow through the surface. So the integral 
jjF-n dS is the total rate that mass goes out. By the Divergence Theorem this is 
Ifjdiv F dV. 

To balance — cpjvidV in every region, div F must equal —dpi fit at every point. 
The figure shows this continuity equation (14) for flow in the x direction. 


mass in 

pVdSdt 


mass pdSdx 


extra mass out 

d(p\)dSdt 


mass loss 
— dp dS dx 


Fig. 15.22 Conservation of mass during time di \ d{p\)idx + dpidt = 0. 


15,5 EXERCISES 


Read -through questions 

In words, the basic balance law is o The flux of F 
through a closed surface S is the double integral b The 
divergence of Mi + ;Yj + Pk is c and it measures d 
The total source is the triple integral e That equals the 
flux by the f Theorem. 

For F = 5ck the divergence is g If V is a cube of side 
a then the triple integral equals h The top surface where 
z = i i has n = i and F • n - J The bottom and sides 
have F* n = k . The integral |J F ■ ndS equals i 

The field F = R ip* has div F = 0 except m . J j F * n dS 
equals n over any surface around the origin. This 
illustrates Gauss's Law o The field F = xi yj — 2zk has 
div F = P and jj F • ndS = q For this F, the flux out 
through a pyramid and in through its base are r 

The symbol V stands for s In this notation div F is 

1 The gradient of/ is u The divergence of grad / 
is v The equation div grad /= 0 is w *s equation. 

The divergence of a product is div(uV) — * Integration 

by parts is J jj it div V dx dydz = v + z In tw r o 
.dimensions this becomes a In one dimension it becomes 

b For steady fluid flow the continuity equation is 
div pV = C 

In 1—10 compute the flux J j F • n dS by the Divergence Theorem. 

1 F = xi + xj h- vk, S. unit sphere x 2 + y 2 + L 

2 F = - yi + xj, V: unit cube 0 ^ x ^ L 0 ^ y ^ L 0 ^ r ^ L 

3 F — x 2 i + y 2 j + z 2 k. S: unit sphere 

4 F = \ 2 i + 8y 2 j + z 2 k, V: unit cube. 

5 F - xi - 2yj, S: sides x = 0, y = 0, j = 0, x ■+■ y + z = 1 . 

6 F = u r = (xi + yj 4- zk)/p, S: sphere p = a. 

7 F = p(xi — yj + zk), S: sphere p — a 


8 F = x 2 i + v' 3 j + z 3 k, S. sphere p = a. 

9 F = z 2 k, V\ upper half of ball p < t\. 

10 F = grad ( x ^ sin z>, S; sphere p = a. 

1 1 Find |fj div(x 2 i - yj + 2k)JF in the cube 0 ^ x, y, z ^ a. 

Also compute n and jj F • n dS for all six faces and add. 

12 When a is small in problem 1 L the answer is close to t£i\ 
Find the number c. At what point does div F = c? 

13 (a) Integrate the divergence of F = pi in the ball p ^ a. 

(b) Compute J j F • i viS over the spherical surface p — a. 

14 Integrate jfR*ndS over the faces of the box 

0 ^ y ^ 2 h 0 ^ z ^ 3 and check by the Divergence Theorem, 

15 Evaluate |'J F * ndS when F = xi + z 2 j + y 1 k and: 

(a) S is the cone z 2 = x 2 + y 1 bounded above by the plane 

Z- 1 

(b) S is the pyramid with corners (0, 0, 0), (1,0, 0), (0, LO). 
10, 0,1). 

16 Compute all integrals in the Divergence Theorem when 
F — r v(i + j — k) and V is the unit cube 0 ^ x, y. z ^ I . 

17 Following Example 5, compute the divergence of 
(xi - yj - zk)yC. 

18 (grad/) • n is the derivative of/ in the direction 

. It is also written Pf-Pn. If j xx +j yy + f == = 0 in K 

derive 1 j PfiPti dS - 0 from the Divergence Theorem 

19 Describe the closed surface S and outward normal n: 

(a) V = hollow' ball I ^ .v : + y J + z 2 ^ 9. 

f b ) V = solid cylinder x 2 + y 2 ^ L |e| ^ 7. 

(c) V = pyramid x ^ (\ y ^ 0, z ^ {\ x + 2y F < 1. 

(d) V = solid cone x 2 + y 2 ^ z 2 ^ L 

20 Give an example where ff F * ndS is easier than 
J{] div F dV. 
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21 Suppose F = M(x> y)i + N(x, y)j, R is a region in the xy 
plane, and (x, y , z) is in V if (x, 3?) is in R and \z\ < 1. 

(a) Describe V and reduce jjJdivFdK to a double 
integral. 

(b) Reduce jj F • ndS to a line integral (check top, bottom, 
side). 

(c) Whose theorem says that the double integral equals 
the line integral? 

22 Is it possible to have F • □ = 0 at all points of S and also 
div F = 0 at all points in V? F = 0 is not allowed, 

23 Inside a solid batl (radius a, density 1, mass M = 4 tm 3 /3) 
the gravity field is F ;= — GMR/a 3 , 

(a) Check div F = — 4?iG in Gauss’s Law. 

(b) The force at the surface is the same as if the whole 

mass M were . 

(c) Find a gradient field with div F = 6 in the ball p < a 
and div F = 0 outside. 

24 The outward field F = R/p 3 has magnitude |F| = 1 jp 1 . 
Through an area f on a sphere of radius p, the flux is 

. A spherical box has faces at p x and p 2 with A = 

pj sin <j>d<j>d6 and A — p\ sin <f>d(j)dd. Deduce that the flux 
out of the box is zero, which confirms div F = 0. 


25 In Gauss’s Law, what charge distribution <j(x, y, z) gives 
the unit field E = u r ? What is the flux through the unit sphere? 

26 If a fluid with velocity V is incompressible (constant den- 
sity p), then its continuity equation reduces to * If it 

is irrotational then F = grad /. If it is both then / satisfies 
equation. 

27 True or false , with a good reason. 

(a) If jj F * ndS = 0 for every closed surface, F is constant. 

(b) If F = grad / then divF = 0. 

(c) If |F| < 1 at all points then JJf div F dV < area of the 
surface S, 

(d) If |F| < 1 at all points then [div F| ^ 1 at all points, 

28 Write down statements E-F-G-H for source-free fields 
F(x, y, z) in three dimensions. In statement F, paths sharing 
the same endpoint become surfaces sharing the same bound- 
ary curve. In G, the stream function becomes a vector field 
such that F = curl g, 

29 Describe two different surfaces bounded by the circle 

x 2 + y 2 = 1, z = 0. The field F automatically has the same flux 
through both if . 

30 The boundary of a bounded region R has no boundary. 
Draw a plane region and explain what that means. What does 
it mean for a solid ball? 


15.6 Stokes’ Theorem and the Curl of F 


For the Divergence Theorem, the surface was closed. S was the boundary of K 
Now the surface is not closed and S has its own boundary — a curve called C. We 
are back near the original setting for Green’s Theorem (region bounded by curve, 
double integral equal to work integral). But Stokes’ Theorem, also called Stokes’s 
Theorem, is in three-dimensional space. There is a curved surface S bounded by a 
space curve C. This is our first integral around a space curve. 

The move to three dimensions brings a change in the vector field. The plane field 
F(x, y) = Mi + Nj becomes a space field F(x, y, z) = Mi + N] + Pk. The work 
Mdx + Ndy now includes Pdz. The critical quantity in the double integral (it was 
cN/dx — cMjdy) must change too. We called this scalar quantity “curIF” but in 
reality it is only the third component of a vector. Stokes’ Theorem needs all three 
components of that vector — which is curl F. 

DEFINITION The curl of a vector field F(x, y, z) = Afi + N\ 4 - Pk is the vector field 



(1) 
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The symbol V x F stands for a determinant that yields those six derivatives: 


curl F — V x F = 


i j k 

djdx djdy cjdz 

MNP 


(2) 


The three products i djdy P and j djdz M and k djdx N have plus signs. The three 
products like k djdy down to the left, have minus signs. There is a cyclic symmetry. 
This determinant helps the memory, even if it looks and is illegal, A determinant is 
not supposed to have a row of vectors, a row of operators, and a row of functions. 


EXAMPLE A The plane field M(x y y)i + N(jc,y)j has P = 0 and SMjdz = 0 and 
dNjdz - 0, Only two terms survive: curl F — (dNjdx — dM/dy) k. Back to Green, 


EXAMPLE 2 The cross product a x R is a spin field S, Its axis is the fixed vector a — 
aj + a 2 j + a 3 k. The flow in Figure 15.23 turns around a, and its components are 


S — a x R — 


i j k 

O i 02 ^3 

x y z 


= (<* 2 z - a 3 y)i + fas* - Gi*)} + (tfi y - G 2 x)k. (3) 


Our favorite spin field — yi + xj has (a u a 2 , o 3 ) = (0, 0, 1) and its axis is a = k. 

The divergence of a spin field is M x + N y -f P z = 0 + 0 -f 0, Note how the divergence 
uses M x while the curl uses N x and P x , The curl of S is the vector 2a: 


(5P__8N\ ( 5M_ 

\6y dz J \dz 




2 afi + 2a 2 j + 2 a 3 k — 2a. 


This example begins to reveal the meaning of the curl. It measures the spin! The direc- 
tion of curl F is the axis of rotation — in this case along a. The magnitude of curl F is 
twice the speed of rotation. In this case |curl F| = 2|a| and the angular velocity is |a|. 



EXAMPLE 3 (!!) Every gradient field F = dfjdx i + cfjdy j + cfjdz k has curl F = 0: 


, ^ ( d df d 3f\ (d 3f d df\ { d df d df\ _ 

CUr \dydz 3z3y) + \6z3x cx 3z / + ( 3 * dy dy 3x ) 

Always f yz equals /., . They cancel. Also /« =/„ and f >x So curl grad /- 0. 


(4) 
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EXAMPLE 4 (twin of Example 3) The divergence of curl F is also automatically 


zero: 


T „ d fdP dN\ d (m 

d,v a ,r lF. s ^--j+-^ 17 


ap\ a /sn dM\ 

dx) + dz\dx dy ) 


(5) 


Again the mixed derivatives give P Ky = P yx and N xz — N 2X and M zy = M yz . The terms 
cancel in pairs. In “curl grad” and “div curl”, everything is arranged to give zero. 


15N The curl of the gradient of every /(x,y, z) is curl grad/= V xV/-0. 
The divergence of the curl of every F(x, y , z) is div curl F = V*VxF = 0. 


The spin field S has no divergence. The position field R has no curl R is the gradient 
of f—^x 1 + y 2 + z 1 ). S is the curl of a suitable F. Then div S = div curl F and 
curl R = curl grad / are automatically zero. 

You correctly believe that curl F measures the “spin” of the field. You may expect 
that curl (F + G) is curl F + curl G. Also correct. Finally you may think that a field 
of parallel vectors has no spin. That is wrong. Example 5 has parallel vectors, but 
their different lengths produce spin. 

EXAMPLE 5 The field V = zi in the x direction has curl V = j in the y direction. 

If you put a wheel in the xz plane, this field will turn it . The velocity zi at the top of 
the wheel is greater than zi at the bottom (Figure 15.23c). So the top goes faster and 
the wheel rotates. The axis of rotation is curl V = j. The turning speed is j, because 
this curl has magnitude 1. 

Another velocity field v — — xk produces the same spin: curl v = j. The flow is in 
the z direction, it varies in the x direction, and the spin is in the y direction. Also 
interesting is V + v. The two “shear fields” add to a perfect spin field S = zi — xk, 
whose curl is 2j. 


THE MEANING OF CURL F 

Example 5 put a paddlewheel into the flow. This is possible for any vector field F, 
and it gives insight into curl F. The turning of the wheel (if it turns) depends on its 
location (x, y, z). The turning also depends on the orientation of the wheel. We could 
put it into a spin field, and if the wheel axis n is perpendicular to the spin axis a, the 
wheel won't turn! The general rule for turning speed is this: the angular velocity of 
the wheel is ^{curl F) * n. This is the “directional spin,” just as (grad/) * u was the 
“directional derivative ” — and n is a unit vector like u. 

There is no spin anywhere in a gradient field. It is irrotational : curl grad/— 0. 
The pure spin field a x R has curl F = 2a. The angular velocity is a -n (note that 
i cancels 2). This turning is everywhere, not just at the origin . If you put a penny on 
a compact disk, it turns once when the disk rotates once. That spin is “ around itself” 
and it is the same whether the penny is at the center or not 
The turning speed is greatest when the wheel axis n lines up with the spin axis a. 
Then a • n Is the full length |a|. The gradient gives the direction of fastest growth, and 
the curl gives the direction of fastest turning: 

maximum growth rate of /is |grad /[ in the direction of grad/ 
maximum rotation rate of F is ^|curl F| in the direction of curl F. 
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STOKES' THEOREM 

Finally we come to the big theorem. It will be like Green’s Theorem — a line integral 
equals a surface integral. The line integral ts still the work §F*dR around a curve. 
The surface integral in Green’s Theorem is Jj(iV* — M y )dxdy. The surface is flat (in 
the xy plane). Its normal direction is k, and we now recognize N x — M y as the k 
component of the curl. Green’s Theorem uses only this component because the nor- 
mal direction is always k. For Stokes Theorem on a curved surface, we need all three 
components of curl F. 

Figure 15,24 shows a hat-shaped surface S and its boundary C (a closed curve). 
Walking in the positive direction around C, with your head pointing in the direction 
of n, the surface is on your left You may be standing straight up (n = k in Green’s 
Theorem). You may even be upside down (n = - k is allowed). In that case you must 
go the other way around C to keep the two sides of equation (6) equal. The surface is 
still on the left, A Mobius strip is not allowed, because its normal direction cannot be 
established. The unit vector n determines the “counterclockwise direction’ 1 along C. 


150 (Stokes* Theorem ) 

(£ F • (/ R = f 

(curl F) ■ adS. 

(6) 


Jc J. 

s 



The right side adds up small spins in the surface. The left side is the total circulation 
(or work) around C. That is not easy to visualize — this may be the hardest theorem 
in the book — but notice one simple conclusion, (fcurl F = 0 then §F-dR - 0. This 
applies above alt to gradient fields— as we know, 

A gradient field has no curl, by (4), A gradient field does no work, by (6). In three 
dimensions as in two dimensions, gradient fields are conservative fields . They will be 
the focus of this section, after we outline a proof (or two proofs) of Stokes’ Theorem. 

The first proof shows why the theorem is true. The second proof shows that it 
really is true (and how to compute). You may prefer the first. 

First proof Figure 15.24 has a triangle ABC attached to a triangle A CD, Later there 
can be more triangles. S will be piecewise fiat , dose to a curved surface. Two triangles 
are enough to make the point. In the plane of each triangle (they have different n’s) 
Green’s Theorem is known: 

§ F-dR — {{ curl F*n dS | jj curl F-ndS, 

AB + BC+CA ABC AC+CD + DA ACD 

Now add. The right sides give Jj curl F -miS over the two triangles. On the left, the 
integral over CA cancels the integral over AC. The “crosscut 11 disappears. That leaves 
AB + BC 4- CD + DA. This line integral goes around the outer boundary C— which 
is the left side of Stokes 1 Theorem. 



Fig. 15.24 Surfaces S and boundary curves C. Change in B -►cud E — ► current in C. 
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Second proof Now the surface can be curved. A new proof may seem excessive, but 
it brings formulas you could compute with. From z =/{x, y) we have 

dz - dfjdx dx + dfjdy dy and ndS = (— dfjdx i - dfjdy j + k) dx dy. 

For ndS, see equation 15.4.1L With this dz 7 the line integral in Stokes’ Theorem is 

§Fdtt= j Mdx + N dy + P(dfl8xdx + 8f/8ydy). (7) 

C shadow of C 

The dot product of curl F and ndS gives the surface integral JJ curl F • n dS: 

s 

JJ [(P, - JV,K— dfjdx) + ( Af, - P x )(- dfjdy) + (N x - AfJ] dx dy. (8) 

shadow of S 

To prove (7) = (8), change M in Green’s Theorem to M + Pdfjdx , Also change N to 
N + Pdfjdy~ Then (7) = (8) is Green’s Theorem down on the shadow (Problem 47). 
This proves Stokes’ Theorem up on 5, Notice how Green’s Theorem (flat surface) 
was the key to both proofs of Stokes’ Theorem (curved surface). 

EXAMPLE 6 Stokes’ Theorem in electricity and magnetism yields Faraday’s Law. 

Stokes’ Theorem is not heavily used for calculations — equation (8) shows why. But 
the spin or curl or vorticity of a flow is absolutely basic in fluid mechanics. The other 
important application, coming now, is to electric fields. Faraday’s Law is to Gauss’s 
Law as Stokes’ Theorem is to the Divergence Theorem. 

Suppose the curve C is an actual wire. We can produce current along C by varying 
the magnetic field B(f). The flux = JJ B • ndS, passing within C and changing in time, 
creates an electric field E that does work: 

Faraday's Law (integral form): work = (j) E • dR = — d<pjdL 

That is physics. It may be true, it may be an approximation. Now comes mathematics 
(surely true), which turns this integral form into a differential equation. Information 
at points is more convenient than information around curves. Stokes converts the 
line integral of E into the surface integral of curl E: 

$E*dR = JJ curl E • ndS and also - dcp/dt = JJ - (dBjdt) • n dS. 

These are equal for any curve C, however small. So the right sides are equal for any 
surface S. We squeeze to a point. The right hand sides give one of Maxwell’s equations: 

Faraday's Law (differential form): curl E = — dBjdt. 

CONSERVATIVE FIELDS AND POTENTIAL FUNCTIONS 

The chapter ends with our constant and important question: Which fields do no 
work around closed curves? Remember test D for plane curves and plane vector 
fields: 

if oMjdy = dNjdx then F is conservative and F — grad / and J F * rfR = 0. 

Now allow a three-dimensional field like F = 2 xy i + (x 2 + z)\ + yk. Does it do work 
around a space curve? Or is it a gradient Jieldl That will require dfjdx = 2xy and 
dfjdy = x 2 + r and dfjdx — y. We have three equations for one function /{jc,y, z). 
Normally they can’t be solved. When test D is passed (now it is the three-dimensional 
test: curl F = 0) they can be solved. This example passes test D, and /is x 2 y + yz. 
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15P F(jc* y, z) — M\ + N} + Pk is a conservative field if it has these properties: 

A. The work IF • dR around every closed path in space is zero. 

B. The work JpF * dR depends only on P and Q, not on the path in space. 

C. F is a gradient field : Af = df/dx and N = dfjdy and P = dfjdz m 

D. The components satisfy M y = M z = P x , and N z = P y (curl F is zero). 
A field with one of these properties has them all D is the quick test. 


A detailed proof ofA=*B=*C^D^A is not needed. Only notice how C => D: 
curl grad F is always zero. The newest part is D => A. (/'curl F = 0 then | F • dR = 0. 
But that is not news. It is Stokes’ Theorem. 

The interesting problem is to solve the three equations for / when test D is passed. 
The example above had 

dfjd x — 2xy =>f = J 2 xydx = x 2 y plus any function C(y, z) 
dfjdy = x 2 + z = x 2 + dCjdy => C = yz plus any function c{z) 
df/dz = y = y + dcjdz => c(z) can be zero. 

The first step leaves an arbitrary C(y, z) to fix the second step. The second step leaves 
an arbitrary c{z) to fix the third step (not needed here). Assembling the three steps * 
/ = x 2 y + C = x 2 y + yz + c = x 2 y + yz , Please recognize that the “fix-up” is only pos- 
sible when curl F = 0. Test D must be passed. 


EXAMPLE 7 Is F = (z-y)i+(x- z) j + (y — x)k the gradient of any /? 

Test D says no. This F is a spin field a x R t Its curl is 2a - (2, 2, 2), which is not zero. 
A search for /is bound to fail, but we can try. To match df/dx = z — y, we must have 
/= zx — yx + C(y, z). The y derivative is — x + dCjdy . That never matches N = x — z, 
so / can’t exist. 


EXAMPLE 8 What choice of P makes F = yz 2 i + xz 2 j + Pk conservative? Find / 
Solution We need curl F = 0, by test D. First check dMjdy — z 2 = ONjdx. Also 
dPjdx = dMjdz = 2 yz and dPjcy = cNjdz = 2xz. 

P = 2 xyz passes all tests. To find / we can solve the three equations, or notice that 
/ = xyz 2 is successful. Its gradient is F. 

A third method defines f(x, y , z) as the work to reach (jc, y, z)from (0, 0, 0). The path 
doesn’t matter. For practice we integrate F *dR = Mdx + N dy + Pdz along the 
straight line (xf, yf, zt): 


f(x , y, 2 ) = 


(yt)(zt) 2 (x dt) + {xt)(zt) 2 (y dt) + 2(xt)(yt)(zt){z dt) = xyz 2 . 
o 


EXAMPLE 9 Why is div curl grad / automatically zero (in two ways)? 

Solution First, curl grad /is zero (always). Second, div curl F is zero (always). Those 
are the key identities of vector calculus. We end with a review. 


Green's Theorem: 


FdR 


{dNjcx — dMjdy)dxdy 


F • n ds 


(d\fjdx + 6Njdy)dx dy 
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Divergence Theorem : 


Fn dS 


(cM/cx H- dN/cy + dPjdz)dx dydz 


Stokes* Theorem: 




curl F * n dS. 


The first form of Green's Theorem leads to Stokes' Theorem, The second form 
becomes the Divergence Theorem. You may ask, why not go to three dimensions in 
the first place ? The last two theorems contain the first two (take P = 0 and a flat 
surface). We could have reduced this chapter to two theorems, not four. I admit that, 
but a fundamental principle is involved: “It is easier to generalize than to specialize/’ 
For the same reason dfjdx came before partial derivatives and the gradient. 


15.6 EXERCISES 


Read-through questions 

The curl of Mi + /Vj + Pk is the vector q . It equals the 
3 by 3 determinant b . The curl of x 2 i + z 2 k is c . 
For S — yi — (x + z)j + yk the curl is d This S is a e 
field a x R = £{curl F) x R, with axis vector a — f For 
any gradient field f x i + f y j + f £ k the curl is 9 . That is the 

important identity curl grad / = h It is based on f xy = j yx 
and ’ and 1 The twin identity is . 

The curl measures the 1 of a vector field. A pad- 
dlewheel in the field with its axis along n has turning speed 
m . The spin is greatest when n is in the direction of 
ft Then the angular velocity is o , 

Stokes' Theorem is p = 9 The curve C is the 

r of the * & This is f Theorem extended to 

u dimensions. Both sides are zero when F is a gradient 

field because v 

The four properties of a conservative field are A = 

B = * C — y D = 1 The field v 2 z 2 i + lxy 2 zk 

(passes) (falls) test D. This field is the gradient of /= 

The work J F * dR from (0, 0, 0) to (1, 1, 1) is b (on which 
path?). For every field F, JJ curl F - ndS is the same out 
through a pyramid and up through its base because C , 

In Problems 1-6 find curl F. 

1 F = zi -J- x] + yk 2 F = gradate* sin z) 

3 F = (x + v + z)(i+j + k) 4 F = (X + >')i - (.* + y)k 

5 F = pTvi + yj + zk) 6 F = (i + j) x R 

7 Find a potential / for the field in Problem 3. 

8 Find a potential / for the field in Problem 5. 

9 When do the fields x m i and x"j have zero curl? 

10 When does (a ]r x + a 2 y + a 3 z)k have zero curl? 


In 11-14, compute curl F and find R by Stokes’ 

Theorem. 

11 F = x 2 \ + y 2 k, C - circle x 2 + z 2 = l, y = 0. 

12 F = i x R, C - circle x 2 + z 2 = l, y = 0. 

13 F = (i + j) x R, C = circle v 2 + z 2 = 1, x = 0. 

14 F = (yi — xj) x (xi + yj), C = circle x 2 + y 1 — I, z - 0. 

15 (important) Suppose two surfaces S and T have the same 
boundary C, and the direction around C is the same, 

(a) Prove Jjs curl F * n^S = T curl F ■ n dS> 

(b) Second proof: The difference between those integrals is 
jjjdiv(curl F)dV, By what Theorem? What region is F7 
Why is this integral zero? 

16 In 15, suppose S is the top half of the earth (n goes out) 
and T is the bottom half (n comes in). What are C and F? 
Show by example that F • ndS = JjY F * ndS is not generally 
true, 

17 Explain why jj curl F*n dS = 0 over the closed boundary 
of any solid K 

18 Suppose curl F = 0 and div F = 0, (a) Why is F the gradi- 
ent of a potential? (b) Why does the potential satisfy Laplace's 
equation/,, +/>■>■ +/„ = 0? 

In 19-22, find a potential /if it exists. 

J 9 F = zi + j + xk, 20 F = 2xyzi + x 2 zj -f- x 2 yk 

21 F = e^ z \-e'~ z k 22 F = yzi + xzj + (xy + z 2 )k 

23 Find a field with curl F = (1, 0, 0). 

24 Find all fields with curf F = (I, 0, 0). 

25 S = a x R is a spin field. Compute F = bxS (constant 
vector b) and find its curl. 
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26 How fast is a paddlewheel turned by the field F = yi — xk 
(a) if its axis direction is n = j? (b) if its axis is lined up with 
curl F? Ic) if its axis is perpendicular to curl F? 

27 How is curl F related to the angular velocity a) in the spin 
Held F = w(— yi + xj)? How r fast does a wheel spin, if it is in 
the plane x f y l z = 1? 

28 Find a vector held F whose curl is S =yi — xy 

29 Find a vector field F whose curl is S = a x R. 

30 True or false: when two vector fields have the same cud 
at all points: (a) their difference is a constant field (b) their 
difference is a gradient field (c) they have the same divergence. 

In 31-34, compute fj curl F * ndS over the top half of the sphere 
v 2 + y 2 + z 1 = 1 and (separately) f F - dR around the equator 

31 F = yi - xj 32 F - R/p 2 

33 F = a x R 34 F = (a x R) x R 

35 The circle C in the plane x + y + z — 6 has radius r and 

center at (1,2, 3). The held F is 3zj + 2yk, Compute §F • dR 

around C 

36 S is the top half of the unit sphere and F = zi + xj + xyzk. 
Find j j curl F * ndS. 

37 Find y(.x, y) so that curl gk = yi + x 2 j. What is the name 
for g in Section 15.3? It exists because yi + x 2 \ has zero 


38 Construct V so that curl F = 2xi 3 vj — 5zk (which has 
zero divergence), 

39 Split the field F = xyi into V>W with curl V = 0 and 
divW-0. 

40 Ampere's luw r for a steady magnetic field B is curl B = pJ 
(J = current density, p = constant). Find the work done by B 
around a space curve C from the current passing through it. 


Maxwell allows varying currents which brings in the electric 
field. 

41 For F = ( x 2 + y J )i, compute curl (curl F) and grad (div F) 
and F xx + F^ + F s= . 

42 F or F = v(x t \\ z)i f prove these useful identities: 

(a) curl(curl F} = grad (div F) ~(F XX + F yy + F, = ), 

(b) curl(/F) — / curl F + (grad /} x F. 

43 If B = a cos t (constant direction a), find curl E from Fara- 
day's Law. Then find the alternating spin field E. 

44 With G{x, y, z) = m\ 4- /ij l- pk, write out FxG and take 
its divergence. Match the answer with G • curl F — F * curl G. 

45 Write down Green's Theorem in the xz plane from Stokes' 
Theorem. 

46 True or false: V x F is perpendicular to F. 

47 (a) The second proof of Stokes’ Theorem took M* = 
jVT(x, y,/(x, y)) + P(x* >\/(x\ y))dfjdx as the M in Green's 
Theorem, Compute cM*jdy from the chain rule and pro- 
duct rule (there are five terms). 

(b) Similarly = .\ r (x, y,/) + P(x, }\f)df/cy has the x 
derivative N x + N„f x + P x j\. + P z f x f y + Pf y *, Check that 
N* — M* matches the right side of equation (8), as needed 
in the proof. 

48 "The shadow of the boundary is the boundary of the 
shadow,” This fact was used in the second proof of Stokes’ 
Theorem, going to Green’s Theorem on the shadow. Give 
two examples of S and C and their shadows. 

49 Which integrals are equal when C = boundary of S or S = 
boundary of F? 

£ F * dR $ (curl F) • dR £ (curl F) - nds jj F * n dS 
[jdivFdS ] j(curl F)* ndS j} (grad div F)« ndS JJJ div F dV 

50 Draw the field V = — xk spinning a wheel in the xz plane. 
What wheels would not spin? 
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CHAPTER 16 


Mathematics after Calculus 


I would like this book to do more than help you pass calculus, (I hope it does that 
too,) After calculus you will have choices — Which mathematics course to take next ! — 
and these pages aim to serve as a guide. Part of the answer depends on where you 
are going— toward engineering or management or teaching or science or another 
career where mathematics plays a part. The rest of the answer depends on where the 
courses are going. This chapter can be a useful reference, to give a clearer idea than 
course titles can do: 

Linear Algebra Differential Equations Discrete Mathematics 
Advanced Calculus (with Fourier Series) Numerical Methods Statistics 

Pure mathematics is often divided into analysis and algebra and geometry. Those 
parts come together in the “mathematical way of thinking” — a mixture of logic 
and ideas. It is a deep and creative subject — here we make a start. 

Two main courses after calculus are Unear algebra and differential equations, 
I hope you can take both. To help you later. Sections 16,1 and 16.2 organize them 
by examples. First a few words to compare and contrast those two suhjects. 

Linear algebra is about systems of equations. There are n variables to solve for. A 
change in one affects the others. They can be prices or velocities or currents or 
concentrations — outputs from any model with interconnected parts. 

Linear algebra makes only one assumption — the model must be linear , A change 
in one variable produces proportional changes in all variables. Practically every 
suhject begins that way, (When it becomes nonlinear, we solve by a sequence of linear 
equations. Linear programming is nonlinear because we require x ^ 0,) Elsewhere J 
wrote that “Linear algebra has become as basic and as applicable as calculus, and 
fortunately it is easier,” I recommend taking it, 

A differential equation is continuous (from calculus), where a matrix equation is 
discrete (from algebra). The rate dyjdt is determined by the present state y — which 
changes by following that rule. Section 16.2 solves y = cy + s(£) for economics and 
life sciences, and y* + by + cy =J[t) for physics and engineering. Please keep it and 
refer to it. 
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A third key direction is discrete mathematics . Matrices are a part, networks and 
algorithms are a bigger part. Derivatives are not a part — this is closer to algebra. It 
is needed in computer science. Some people have a knack for counting the ways a 
computer can send ten messages in parallel — and for finding the fastest way. 

Typical question. Can 25 states be matched with 25 neighbors ; so one state in each 
pair has an even number of letters ? New York can pair with New Jersey, Texas with 
Oklahoma, California with Arizona. We need rules for Hawaii and Alaska. This 
matching question doesn’t sound mathematical, but it is. 

Section 163 selects four topics from discrete mathematics, so you can decide if you 
want more. 

Go back for a moment to calculus and differential equations. A completely realistic 
problem is seldom easy, but we can solve models. (Developing a good model is a skill 
in itself.) One method of solution involves complex numbers: 

any function w(x + iy) solves u xx + u yy = 0 (Laplace equation) 

any function e tkix + ct) solves u n — c 2 u xx = 0 (wave equation). 

From those building blocks we assemble solutions. For the wave equation, a signal 
starts at t — 0. It is a combination of pure oscillations e ikx . The coefficients in that 
combination make up the Fourier transform — to tell how much of each frequency is 
in the signal. A lot of engineers and scientists would rather know those Fourier 
coefficients than/(x). 

A Fourier series breaks the signal into lo^cosfcjc or I b k sin kx or £ c k e?* x . 
These sums can be infinite (like power series). Instead of values o(j\x\ or derivatives 
at the basepoint, the function is described by a kt b k ^c k , Everything is computed by 
the '"Fast Fourier Transform” This is the greatest algorithm since Newtons method. 

A radio signal is near one frequency. A step function has many frequencies. A delta 
function has every frequency in the same amount: 8{x) = £ cos kx, Channel 4 can’t 
broadcast a perfect step function. You wouldn’t want to hear a delta function. 

We mentioned computing . For nonlinear equations this means Newton’s method. 
For Ax = b it means elimination — algorithms take the place of formulas . Exact solu- 
tions are gone — speed and accuracy and stability become essential. It seems right to 
make scientific computing a part of applied mathematics, and teach the algorithms 
with the theory. My text Introduction to Applied Mathematics is one step in this 
direction, trying to present advanced calculus as it is actually used. 

We cannot discuss applications and forget statistics. Our society produces oceans 
of data — somebody has to draw conclusions. To decide if a new drug works, and if 
oil spills are common or rare, and how often to have a checkup, we can’t just guess. 
I am astounded that the connection between smoking and health was hidden for 
centuries. It was in the data! Eventually the statisticians uncovered it. Professionals 
can find patterns, and the rest of us can understand (with a little mathematics) what 
has been found. 

One purpose in studying mathematics is to know more about your own life. 
Calculus lights up a key idea: Functions . Shapes and populations and heart signals 
and profits and growth rates, all are given by functions. They change in time. They 
have integrals and derivatives. To understand and use them is a challenge- 
mathematics takes effort. A lot of people have contributed, in whatever way they 
could — as you and I are doing. We may not be Newton or Leibniz or Gauss or 
Einstein, but we can share some part of what they created. 
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16.1 Vector Spaces and Linear Algebra 9HI 

You have met the idea of a matrix . An m by n matrix A has m rows and n columns 
(it is square if m = n ), It multiplies a vector x that has n components. The result is a 
vector Ax with m components. The central problem of linear algebra is to go back- 
ward: From Ax = bftndx , That is possible when A is square and invertible. Otherwise 
there is no solution x — or there are infinitely many. 

The crucial property of matrix multiplication is linearity. If Ax ~ b and AX — B 
then A times x + X is b + B. Also A times 2x is 2 ft. In general A times cx is cb. 
In particular A times 0 is 0 (one vector has n zeros, the other vector has m zeros). The 
whole subject develops from linearity. Derivatives and integrals obey linearity too. 


Question 1 What are the solutions to Ax— 0? One solution is x = 0. There 
may be other solutions and they fill up the “ nuUspace 



When there are more unknowns than equations — when A has more columns than 
rows — the system Ax~0 has many solutions . They are not scattered randomly 
around! Another solution is X = 4, Y— “ 2, Z = 6. This lies on the same line as 
(2, —1, 3) and (0, 0, 0). Always the solutions to Ax — 0 form a “space" of vectors — 
which brings us to a central idea of linear algebra. 

Note These pages are not concentrating on the mechanics of multiplying or invert- 
ing matrices. Those are explained in all courses. My own teaching emphasizes that 
Ax is a combination of the columns of A. The solution x = A~ 1 b is computed by 
elimination. Here we explain the deeper idea of a vector space — and especially the 
particular spaces that control Ax — bA cannot go into the same detail as in my book 
on Linear Algebra and Its Applications, where examples and exercises develop the 
new ideas. Still these pages can be a useful support. 

All vectors with n components lie in n-dimensional space , You can add them and 
subtract them and multiply them by any c. (Don't multiply two vectors and never 
write 1 jx or 1//4), The results x + X and x — X and cx are still vectors in the space. 
Here is the important point: 

The line of solutions to Ax = 0 is a “subspace"— a vector space in its own right. 
The sum x + X has components 6, - 3, 9 — which is another solution. The difference 
x - X is a solution, and so is 4x , These operations leave us in the subspace. 

The nullspace consists of all solutions to Ax — 0. It may contain only the zero 
vector (as in the first example). It may contain a line of vectors (as in the second 
example). It may contain a whole plane of vectors (Problem 5), In every case x + X 
and x - X and cx are also in the nullspace. We are assigning a new word to an old 
idea — the equation x - 2y = 0 has always been represented by a line (its nullspace). 
Now we have 6-dimensional subspaces of an 8-dimensional vector space. 

Notice that x 1 — y = 0 does not produce a subspace (a parabola instead). Even the 
x and y axes together, from xy = 0, do not form a subspace. We go off the axes when 
we add (1, 0) to (0, 1). You might expect the straight line x — 2y — 1 to be a subspace, 
but again it is not so. When x and y are doubled, we have X - 2Y = 2. Then {X, Y) 
is on a different line. Only Ax = 0 is guaranteed to produce a subspace. 
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Figure 16.1 shows the nullspace and “row space/* Check dot products (both zero). 



Question 2 When A multiplies a vector x, what subspace does Ay lie in! The 
product Ax is a combination of the columns of A — hence the name ’ column space". 


~2 r 


~3~ 

3(column 1) 

~8~ 

0 1 



= + = 

2 

0 0 


2 

2(column 2) 

0 


This is in the column space; 


0 

0 

1 


is not. 


No choice of x can produce Ax = (0, 0, 1 ). For this A, all combinations of the columns 

end in a . The column space is like the xy plane within xyz space. It is a 

subspace of m-dimensional space, containing every vector b that is a combination of 
the columns: 

The system Ax = b has a solution exactly when b is in the column space . 

When A has an inverse, the column space is the whole n-dimcnsional space. The 
nullspace contains only x = 0. There is exactly one solution x = A 1 h. This is the 
good case — and we outline four more key topics in linear algebra. 

1. Basis and dimension of a subspace . A one-dimensional subspace is a line, A plane 
has dimension two. The nullspace above contained all multiples of (2, —1, 3) — -by 
knowing that "basis vector*" we know the whole line. The column space was a plane 
containing column 1 and column 2. Again those vectors are a "basis " — by knowing 
the columns we know the whole column space. 

Our 2 by 3 matrix has three columns: (1,0) and (2, 3) and (0, 1). Those are not a 
basis for the column space! This space is only a plane, and three vectors are too 
many. The dimension is two. By combining (1, 0) and (0, 1} we can produce the other 
vector (2, 3). There are only two independent columns, and they form a basis for this 
column space. 

In general: When a subspace contains r independent vectors, and no more, those 
vectors are a basis and the dimension is r. "‘Independent” means that no vector is a 
combination of the others. In the example, (T 0) and {2, 3) are also a basis. A subspace 
has many bases, just as a plane has many axes. 

2. Least squares , If Ax = b has no solution, we look for the x that comes closest. 
Section 11,4 found the straight line nearest to a set of points. We make the length 
of Ax — b as small as possible, when zero length is not possible. No vector solves 
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Ax - b , when b is not in the column space. So b is projected onto that space, This 
leads to the “normal equations” that produce the best x : 

A T Ax=A T b. (1) 

When a rectangular matrix appears in applications, its transpose generally comes 
too. The columns of A are the rows of A T . The rows of A are the columns of ,4 T . 
Then A 7 A is square and symmetric — equal to its transpose and vital for applied 
mathematics. 

3. Eigenvalues (for square matrices only). Normally Ax points in a direction 
different from x. For certain special eigenvectors , Ax is parallel to x. Here is a 2 by 
2 matrix with two eigenvectors — in one case ,4x ~ 5x and in the other Ax = 2x; 


" 3 m 

"5“ 

= 5 

T 

and 

i 

N> 

1 

2 

— 

4“ 

= 2 

2 

1 4 1 

5 


l 


L 1 4J 

-1 


-2_ 


-1 


The multipliers 5 and 2 are the eigenvalues of A. An 8 by 8 matrix has eight eigen- 
values, which tell what the matrix is doing (to the eigenvectors). The eigenvectors are 
uncoupled, and they go their own way. A system of equations dyjdt - Ay acts like 
one equation— when y is an eigenvector : 

dyjdt = 3y 1 + 2y 2 yi = e 5t [l 

has the solution which is e 

dyjdt ~ y x + 4 y 2 y 2 = e 5 ' |_l_ 

The eigenvector is (1, 1). The eigenvalue X ~ 5 is in the exponent. When you substitute 
y t and y 2 the differential equations become 5e St = 5e s \ The fundamental principle 
for dyjdt = cy still works for the system dyjdt = Ay: Look for pure exponential solu- 
tions. The eigenvalue “lambda” is the growth rate in the exponent, 

I have to add: Find the eigenvectors also. The second eigenvector (2, -*1) has 
eigenvalue A “ 2. A second solution is y 1 = 2e 2 *, y 2 = e zt . Substitute those into the 
equation — they are even better at displaying the general rule: 

If Ax — Xx then d/dt(e Xt x) = A(e Xt x). The pure exponentials are y — e Xt x. 

The four entries of A pull together for the eigenvector. So do the 64 entries of an 
8 by 8 matrix — again e h x solves the equation. Growth or decay is decided by X> 0 
or X < 0. When X = k + ice is a complex number, growth and oscillation combine in 
e Xt = e*V wr = e*'(cos cot + i sin cot). 

Subspaces govern static problems Ax = b. Eigenvalues and eigenvectors 
govern dynamic problems dyjdt = Ay> Look for exponentials y=e**x. 

4. Determinants and inverse matrices . A 2 by 2 matrix has determinant D — ad — be. 
This matrix has no inverse if D = 0. Reason: A~ l divides by D : 


1 

-Cl 

1 

1 

~l< 

ii 

1 

a b 


"l 0 

times A = 

equals / = 


t^l~c aj 

|_c d 

_° 1 


This pattern extends to n by n matrices, but D and X” 1 become more complicated. 

For 3 by 3 matrices D has six terms. Section 11.5 identified D as a triple product 
a • (b x c) of the columns. Three events come together in the singular case: D is zero 
and A has no inverse and the columns lie in a plane. The opposite events produce the 
“nonsingular” case: D is nonzero and A ~ 1 exists . Then Ax = b is solved by x = A~ Y b. 
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D is also the product of the pivots and the product of the eigenvalues. The pivots 
arise in elimination — the practical way to solve Ax = b without A' 1 . To find eigen- 
values we turn Ax = Xx into (A - XI) x = 0. By a nice twist of fate, this matrix A - XI 
has D = 0. Go back to the example: 


3 2 


1 

O 
1 


1 

■X 

1 

m 


-X 




1 4 


0 1 


1 

-P*- 

1 


has D = (3-X)(4-X)-2 = X 2 -1X+ 10. 


The equation X 2 -7/1+10 = 0 gives X = 5 and X — 2. The eigenvalues come first, to 
make £> = 0, Then (A - 5/)x = 0 and (A - 2 1)x = 0 yield the eigenvectors. These jc’s 
go into y - e M x to solve differential equations — which come next. 


i6A EXERCISES 


Read-through questions 

If Ax = b and AX = B y then A times 2x 4- 3X equals a 
If Ax = 0 and AX = 0 then A times 2x + 3X equals b . 
In this case x and X are in the c of A t and so is the 
combination d , The nullspace contains all solutions to 
e . It is a subspace, which means t , If x = (1, 1, 1) is 
in the nullspace then the columns add to q . so they are 
(independent)(dependent). 

Another subspace is the h space of A t containing ail 
combinations of the columns. The system Ax = b can be 
solved when b is \ Otherwise the best solution comes 
from A 7 Ax = i Here A x is the k matrix, whose 

rows are 1 The nullspace of A 7 contains all solutions to 
m . The n space of A 7 (row space of A) is the fourth 
fundamental subspace. Each subspace has a basis containing 
as many o vectors as possible. The number of vectors in 
the basis is the p of the subspace. 

When Ax = Xx , the number X is an q and x is an 
f The equation dyjdt = Ay has the exponential solution 
y — a A 7 by 7 matrix has t eigenvalues, whose 
product is the u D . If D is nonzero the matrix A has an 
v Then Ax-b is solved by x= w . The formula 
for D contains 7! = 5040 terms, so x is better computed by 
x On the other hand Ax = Xx means that A— XI has 
determinant y The eigenvalue is computed before the 

_ i 


Find the nullspace In 1-6. Along with x go all cx. 


I A = 


3 C = 


1 2 
2 4 

1 o’ 
0 1 

L 1 2 . 


(solve Ax — 0) 


(solve Cx = 0) 


2 B = 


4 = 


12 -6 
-6 3 

1 0 1 

0 l 2 


5 E = 


1 1 
1 1 
1 1 


1 

1 

1 


6 F = 


2 1 
1 2 


7 Change Problem 1 to Ax = 


(a) Find any particular 


solution x P . (b) Add any x 0 from the nullspace and show that 
x P + x 0 is also a solution. 

r I_ l 

8 Change Problem 1 to Ax = and find all solutions. 

0 

Graph the lines x ^ -h2x 2 = 1 and 2x x +4x 2 = 0 in a plane. 


9 Suppose Ax P = b and /4x o = 0. Then by linearity 
AiXp + = Conclusion: The sum of a particular 

solution Xp and any nullvector x 0 is . 


10 Suppose Ax = b and Ax P = b. Then by linearity 

A(x - x P ) = . The difference between solutions is a 

vector in . Conclusion: Every solution has the form 

x — x F + x 0> one particular solution plus a vector in the 
nullspace > 

11 Find three vectors b in the column space of E. Find all 
vectors b for which Ex = b can be solved. 


12 If Ax = 0 then the rows of A are perpendicular to x. Draw 
the row space and nullspace (lines in a plane) for A above. 

13 Compute CC T and C T C. Why not C 2 1 

14 Show that Cx = b has no solution, if b = (—1, 1, 1). Find 
the best solution from C T Cx = C T b. 

15 C 7 has three columns. How many are independent? 
Which ones? 


16 Find two independent vectors that are in the column space 
of C but are not columns of C. 

17 For which of the matrices ABCEF are the columns a 
basis for the column space? 
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18 Explain the reasoning: If the columns of a matrix A are 
independent, the only solution to Ax = 0 is x = 0* 

19 Which of the matrices ABCEF have nonzero deter- 
minants? 


27 Compute the determinant of E — XI. Find all X y s that make 
this determinant zero* Which eigenvalue is repeated? 

28 Which previous problem found eigenvectors for Ex = 0x? 
Find an eigenvector for Ex = 3x* 


20 Find a basis for the full three-dimensional space using 
only vectors with positive components. 

21 Find the matrix F ~ 1 for which FF ~ 1 = / = 

22 Verify that (determinant of F) 2 = (determinant of F 2 ). 

23 (Important) Write down F — XI and compute its determi- 
nant. Find the two numbers X that make this determinant 
zero. For those two numbers find eigenvectors x such that 
Fx = kx, 

24 Compute G = F 2 , Find the determinant of G — kl and the 
two k * s that make it zero* For those k ' s find eigenvectors x 
such that Gx = kx. Conclusion: if Fx = kx then F z x = k 2 x * 

25 From Problem 23 find two exponential solutions to the 
equation dyjdt = Fy. Then find a combination of those 
solutions that starts from y 0 = (1, 0) at t = 0* 

26 From Problem 24 find two solutions to dyjdt — Gy. Then 
find the solution that starts from y 0 = (2, 1)^ 



29 Find the eigenvalues and eigenvectors of A * 

30 Explain the reasoning: A matrix has a zero eigenvalue if 
and only if its determinant is zero. 

31 Find the matrix H whose eigenvalues are 0 and 4 with 
eigenvectors (1, 1) and(l, —1). 

32 If Fx = kx then multiplying both sides by ____ gives 

F _1 x = A _1 x. If F has eigenvalues 1 and 3 then F“* has 
eigenvalues . The determinants of F and F _1 are 


33 


True or false, with a reason or an example* 
(a) The solutions to Ax = b form a subspace. 


(b) 


0 2 
0 0 


has 


r 

0 


in its nulls pace and column space* 


(c) A 7 A has the same entry in its upper right and lower 
left comers. 

(d) If Ax = kx then y = e 2t solves dyjdt — Ay. 

(e) If the columns of A are not independent, their combi- 
nations still form a subspace* 



16.2 Differential Equations 



We just solved differential equations by linear algebra. Those were special systems 
dyjdt — Ay, linear with constant coefficients* The solutions were exponentials, 
involving e* 1 * The eigenvalues of A were the “growth factors” k, This section solves 
other equations — by no means all* We concentrate on a few that have important 
applications* 

Return for a moment to the beginning — when direct integration was king: 

1. dyjdt = s(t) 2* dyjdt = cy 3* dyjdt = u(y)c(t). 

In 1, y(t) is the integral of s(t). In 2, y(t) is the integral of cy{ r)* That sounds circular — 
it only made sense after the discovery of y — e cf * This exponential has the correct 
derivative cy . To find it by integration instead of inventing it, separate y from t: 


Separation and integration also solve 3: J dyju[y) = j c(f)dt* The model logistic equation 
has u = y — y 2 = quadratic. Equation 2 has u = y = linear. Equation 1 is also a special 
case with u — 1 — constant * But 2 and 1 are very different, for the following reason. 

The compound interest equation y* = cy is growing from inside * The equation 
/ = s(t) is growing from outside. Where c is a “growth rate,” s is a “source,” They 
don’t have the same meaning, and they don't have the same units* The combination 
/ = cy + s was solved in Chapter 6, provided c and s are constant — but applications 
force us to go further* 
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In three examples we introduce non-constant source terms. 

EXAMPLE 1 Solve dy/dt = cy + s with the new source term s = e' K '. 

Method Substitute y = Be 1 ', with an “undetermined coefficient” B to make it right: 
kBe* = cBe*’ + e kt yields B = l/(fc - c). 

The source e*' is the driving term. The solution Be* 1 is the response. The exponent is 
the same! The key idea is to expect e kt in the response. 

Initial condition To match y 0 at t = 0, the solution needs another exponential. 
It is the free response Ae", which satisfies dyjdt = cy with no source. To make 
y = Ae" + Be 13 agree with y 0 , choose A = y 0 - B: 

Final solution y = (y 0 - B)e cl + Be*' = y 0 e“ + (e k ‘ - e“)/(k - c). (1 ) 

Exceptional case B= 1 j(k-c) grows larger as k approaches c. When k=c the 
method breaks down — the response Be*' is no longer correct. The solution (1) 
approaches 0/0, and in the limit we get a derivative. It has an extra factor t: 

e** — e ct change in e^' d 

— - = “ S -f (e cl ) = te a . (2) 

k - c change in c dc 

The correct response is te c! when k — c. This is the form to substitute, when the driving 
rate k equals the natural rate c (called resonance). 

Add the free response y 0 e cl to match the initial condition. 


EXAMPLE 2 Solve dyjdi = cy + s with the new source term s = cos kt. 

Substitute y = B sin kt + D cos kt. This has two undetermined coefficients B and D: 

kB cos kt — kD sin kt = c(B sin kt + D cos kt) + cos kt. (3) 

Matching cosines gives kB = cD+ 1. The sines give - kD = cB. Algebra gives B, D, y: 


B = 


k 2 + c 2 


Z> = 


k 2 + c 2 


y = 


c sin kt + k cos kt 
k 2 + c 2 ' 


(4) 


Question Why do we need both B sin kt and D cos kt in the response to cos kt? 

First Answer Equation (3) is impossible if we leave out B or D. 

Second Answer cos kt is %e iki + \e “ |l ‘'. So e lk< and e “ :k ‘ are both in the response. 


EXAMPLE 3 Solve dyjdt = cy + s with the new source term s = te kI . 

Method Look for y= Be + Dte M . Problem 13 determines B and D. Add Atf 1 as 
needed, to match the initial value y 0 ■ 


SECOND-ORDER EQUATIONS 


The equation dyjdt = cy is first-order. The equation d 2 y/dt 2 = - cy is second-order. 
The first is typical of problems in life sciences and economics — the rate dyjdt depends 
on the present situation y. The second is typical of engineering and physical sciences — 
the acceleration d z yjdt 2 enters the equation. 

If you put money in a bank, it starts growing immediately. If you turn the wheels 
of a car, it changes direction gradually. The path is a curve, not a sharp corner. 
Newton’s law is F = ma , not F ~ mv. 
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A mathematician compares a straight line to a parabola. The straight line crosses 
the x axis no more than once. The parabola can cross twice. The equation 
ax 2 + bx + c = 0 has two solutions , provided we allow them to be complex or equal. 
These are exactly the possibilities we face below: two real solutions, two complex 
solutions, or one solution that counts twice. The quadratic could be x 2 — 1 or x 2 + 1 
or x 2 . The roots are 1 and — 1, i and - i, 0 and 0. 

In solving differential equations the roots appear in the exponent, and are coiled X . 


EXAMPLE 4 

y" = +y: 

solutions 

y = d 

and 

y = e~ x 

A=l, -1 

EXAMPLE 5 

>> 

O 

II 

solutions 

y=l 

and 

y = t 

II 

p 

O 

EXAMPLE 6 

y" ~ —y: 

solutions 

y = cos t 

and 

y = sin t 

A - i, - i 


Where are the complex solutions ? They are hidden in Example 6, which could be 
written y = e [t and y = e~ u . These satisfy y" = — y since i 2 — — 1. The use of sines and 
cosines avoids the imaginary number i, but it breaks the pattern of e Xi . 

Example 5 also seems to break the pattern — again e M is hidden. The solution y — 1 
is e 0t . The other solution y - r is te°\ The zero exponent is repeated — another excep- 
tional case that needs an extra factor r. 

Exponentials solve every equation with constant coefficients and zero right hand side. 

To solve ay" + fry' + cy = 0 substitute y ~ e Xt and find X. 

This method has three steps, leading to the right exponents X = r and X = s: 

1 . With y =• e u the equation is aX 2 e Xl + bXe ki + ce h — 0. Cancel e 2 *. 

2. Solve aX 2 + bX + c — 0. Factor or use the formula X — (— b ± y/b 2 — 4ac)f2a. 

3. Call those roots X—r and A = s. The complete solution is y = Ae rt + Be 

The pure exponentials are y — e" 1 and y — e* r . Depending on r and s, they grow or 
decay or oscillate. They are combined with constants A and B to match the two 
conditions at t = 0. The initial state y 0 equals A + B. The initial velocity y' 0 equals 
rA + sB (the derivative at t = 0). 

EXAMPLE 7 Solve / - 3/ + 2y = 0 with y 0 — 5 and y' 0 = 4. 

Step 1 substitutes y = e Xx . The equation becomes X 2 e Xt — ZXe* + 2e * = 0. Cancel e lt . 
Step 2 solves )}- 3 A + 2 = 0. Factor into (X - l)(A - 2)- 0. The exponents r, s are 1,2. 
Step 3 produces y = Ae* + Be 2t . The initial conditions give A A- B = 5 and l A + 2B = 4. 
The constants are A — 6 and B = — 1. The solution is y— 6e x - e 2t . 

This solution grows because there is a positive X. The equation is “unstable.’' It 
becomes stable when the middle term - 3y' is changed to + 3y'. When the damping 
is positive the solution decays. The A’s are negative : 

EXAMPLE ft (A 2 + 3 A + 2) factors into (A + 1)(A + 2). The exponents are - 1 and - 2. 
The solution is y = Ae~ % + Be~ 2 \ It decays to zero for any initial condition. 

EXAMPLES 9-10 Solve y" + 2y + 2y = 0 and y" + 2/ + y = 0. How do they differ? 

Key difference A 2 + 2A + 2 has complex roots. A 2 4- 2A + 1 has a repeated root: 

A 2 + 2A + 2 = 0 gives A - - 1 ± / (A+1) 2 = 0 gives — 1, -I. 

The —1 in all these A’s means decay. The i means oscillation. The first exponential 
is e { ~ 1+l)t , which splits into e~ x (decay) times e tf ( oscillation ). Even better, change e lt 



606 


16 Mathematic* after Calculus 


and e u into cosines and sines: 

y = Ae { ~ l +,)r + = e~\a cos t + b sin t). (5) 

At t ~ 0 this produces y Q = a. Then matching y* 0 leads to b * 

Example 10 has r = $= - 1 (repeated root)* One solution is e~* as usual. The 
second solution cannot be another e~ l . Problem 21 shows that it is te~ l — again the 
exceptional case multiplies by t! The general solution is y = Ae ^ + Bte~\ 

Without the damping term 2/, these examples are /' + 2y - 0 or y" + y - 0 — pure 
oscillation* A small amount of damping mixes oscillation and decay* Large damping 
gives pure decay. The borderline is when k is repeated ( r~s )* That occurs when 
b 2 - 4ac in the square root is zero* The borderline between two real roots and two 
complex roots is two repeated roots . 

The method of solution comes down to one idea: Substitute y — e Xt . The equations 
apply to mechanical vibrations and electrical circuits (also other things, but those 
two are of prime importance). While describing these applications I will collect the 
information that comes from X. 

SPRINGS AND CIRCUITS: MECHANICAL AND ELECTRICAL ENGINEERING 

A mass is hanging from a spring* We pull it down an extra distance y Q and give it a 
starting velocity y r 0 > The mass moves up or down, obeying Newton’s law: mass times 
acceleration equals spring force plus damping force: 

my" — — ky — dy* or my" + dy* + ky = 0* (6) 

This is free oscillation * The spring force -ky is proportional to the stretching y 
(Hooke’s law)* The damping acts like a shock absorber or air resistance — it takes 
out energy. Whether the system goes directly toward zero or swings back and forth 
is decided by the three numbers m T d, fc* They were previously called a , b , c* 


16A The solutions to my" + dy + ky = 0 are controlled by the roots of 
mk 1 + dk + k — 0. With d > 0 there is damping and decay* From N /d 2 — 4 mk 
there may be oscillation: 

overdamping: d 2 > 4 mk gives real roots and pure decay (Example 8) 
underdamping: d 2 < Amk gives complex roots and oscillation (Example 9) 
critical damping: d 2 = 4mifc gives a real repeated root -d/2m (Example 10) 


We are using letters when the examples had numbers, but the results are the same: 

mk 2 + dk + k - 0 has roots r, s = - + J— Jd 1 - 4mk. 

2m ~ 2m v 

Overdamping has no imaginary pans or oscillations: y = Atf 1 + B&\ Critical damping 
has r = s and an exceptional solution with an extra t: y = Ae rt + Bte r K ( This is only a 
solution when r = s.) Underdamping has decay from -dj2m and oscillation from the 
imaginary part. An undamped spring (d = 0) has pure oscillation at the natural fre- 
quency CtJq = yfkjm. 

Alt these possibilities are in Figure 16.2, created by Alar Toomre. At the top is pure 
oscillation (d = 0 and y = cos 2t). The equation is y" + dy 1 + 4y = 0 and d starts to 
grow. When d reaches 4, the quadratic is A 2 + 4A + 4 or (A + 2) 2 . The repeated root 
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Fig. 16.2 


yields e~ 2 ' and te~ 2 '. After that the oscillation is gone. There is a smooth transition 
from one case to the next — as complex roots join in the repeated root and split into 
real roots. 

At the bottom right, the final value y(2n) increases with large damping. This was 
a surprise. At d = 5 the roots are - 1 and -4. At d = 8.5 the roots are - i and - 8. 
The small root gives slow decay (like molasses). As d -* co the solution approaches 
y = 1. 

If we are serious about using mathematics, we should take advantage of anything 
that helps. For second-order equations, the formulas look clumsy but the examples 
are quite neat. The idea of e 2 ' is absolutely basic. The good thing is that electrical 
circuits satisfy the same eqution. There is a beautiful analogy between springs and 
circuits: 

mass m *-* inductance L 


damping constant d «-► resistance R 
elastic constant k *-* 1 /(capacitance C) 

The resistor takes out energy as the shock absorber did— converting into heat by 
friction. Without resistance we have pure oscillation. Electric charge is stored in 
the capacitor (like potential energy). It is released as current (like kinetic energy). 
It is stored^ up again (like a stretched spring). This continues at a frequency 
co 0 =ll s / r LC (like the spring’s natural frequency yfkiin). These analogies turn 
mechanical engineers into electrical engineers and vice versa. 

The equation for the current y(t) now includes a driving term on the right: 


dy 

L i +Ry+ 


-cl 


y dt = applied voltage = V sin cot. 


(7) 


To match networks with springs, differentiate both sides of (7): 

Ly" + Ry' + yjC = Vco cos cot. (8) 

The oscillations are free when V = 0 and forced when V * 0. The free oscillations 
e x ‘ are controlled by L/. 2 + R/.+ 1/C = 0. Notice the undamped case R = 0 when 
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A = + ij^/Lc. This shows the natural frequency o> 0 = l/^/LC. Damped free oscilla- 
tions are in the exercises — what is new and important is the forcing from the right 
hand side. Our last step is to solve equation (8). 

PARTICULAR SOLUTIONS— THE METHOD OF UNDETERMINED COEFFICIENTS 

The forcing term is a multiple of cos on. The “ particular solution ” is a multiple 
of cos cut phis a multiple of sin on. To discover the undetermined coefficients in 
y = a cos on + b sin cot, substitute into the differential equation (8): 

- Lco 2 {a cos cot+b sin cot) + Rcu(- a sin oit + b cos cot) 

+ ( a cos cot + b sin cot)jC — Vco cos on. 

The terms in cos on and the terms in sin on give two equations for a and b: 

~aLa> 2 + bRco+ a/C= Vco and - bLco 2 - aRco + bjC = 0. (9) 

EXAMPLE 1 1 Solve /' + y = cos on. The oscillations are forced at frequency co. The 
oscillations are free (_>•" = 0) at frequency L The solution contains both. 

Particular solution Set y — a cos an + b sin cot at the driving frequency and (9) 
becomes 

- aa) 2 + 0 + a - 1 and -boo 1 - 0+ £> - 0. 

The second equation gives b = 0. No sines are needed because the problem has no 
dyjdt. The first equation gives a — 1/(1 - o> 2 ), which multiplies the cosine: 

y = (cos on)j{ 1 — (D 1 ) solves y" + y — cos cot. (10) 

General solution Add to this particular solution any solution to y" + y = 0: 

> ~ JWicular + homogeneous = (COS 0>t)/(l “ CO 1 ) + C COS t + D Sin t. (11) 

Problem of resonance When the driving frequency is gj=I, the solution (11) 
becomes meaningless— its denominator is zero. Reason: The natural frequency in 
cos t and sin t is also 1. A new particular solution comes from t cos t and t sin t. 

The key to success is to know the form for y . The table displays four right hand 
sides and the correct /s for any constant-coefficient equation: 

Right hand side Particular solution 

^ y= B ^ (same exponent) 

cos on or sin on y = a cos on + b sin on (include both) 

polynomial in t y — polynomial of the same degree 

e kt cos o)t or e* 1 sin on y = ae kl cos on + b^ 1 sin oA 

Exception If one of the roots k for free oscillation equals k or m or 0 or k + ro>, the 
corresponding y in the table is wrong. The proposed solution would give zero on the 
right hand side. The correct form for y includes an extra r. All particular solutions 
are computed by substituting into the differential equation. 

Apology Constant-coefficient equations hardly use calculus (only e Xt ). They reduce 
directly to algebra (substitute y, solve for k and a and b). I find the S-curve from the 
logistic equation much more remarkable. The nonlinearity of epidemics or heartbeats 
or earthquakes demands all the calculus we know. The solution is not so predictable. 
The extreme of unpredictability came when Lorenz studied weather prediction and 
discovered chaos. 
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NUMERICAL METHODS 

Those four pages explained how to solve linear equations with constant coefficients: 
Substitute y = e The list of special solutions becomes longer in a course on 
differential equations. But for most nonlinear problems we enter another world — 
where solutions are numerical and approximate, not exact. 

In actual practice, numerical methods for dyjdt =/(t, y) divide in two groups: 

L Single-step methods like Euler and Runge-Kutta 
2. Multistep methods like Adams- Bashforth 

The unknown y and the right side / can be vectors with n components. The notation 
stays the same: the step is At = h, the time r„ is nh 9 and y n is the approximation to 
the true y at that time. We test the first step, to find y x from y 0 = 1. The equation is 
dyjdt = y, so the right side is /= y and the true solution is y = e\ 

Notice how the first value of f (in this case 1) is used inside the second /: 

Improved Euler y„ + 1 = y„ + i /»[/(£„, y,,) +/{«„+ 1 , y„ + hf(y K , £„))] 

TEST y t =l+i/i[l+(l + A)]=lT£i + |/i 2 

At time h the true solution equals e*. Its infinite series is correct through h 2 
for Improved Euler (a second-order method). The ordinary Euler method 
y„ + 1 = y» + y n ) is first-order. TEST: y x = 1 + h. Now try Runge-Kutta 
(a fourth-order method): 

Runge-Kutta y n + 1 = y„ + s[fc, + 2fc 2 + 2fc 3 + k + ] with y n ) 

k 2 = + + ky = hf(t n +\K y n + ^i) k 4 = hf(t n + h, y K + k 3 ) 

Now the first value of / is used in the second (for fc 2 )> the second is used in the third, 
and then is used in k 4 . The programming is easy. Check the accuracy with another 
test on dyjdt = y: 

TESTy, = 1 + 2*(l + 1) +2Ji(l + 1( t + 1)) + *(l + /,(l + |(l + 1))) 

/i 2 h* /i 4 

= l + — + — + — . This answer agrees with e* through h\ 

These formulas are included in the book so that you can apply them directly — 
for example to see the S-shape from the logistic equation with j—cy- by 2 , 
Multistep formulas are simpler and quicker, but they need a single-step method to 
get started. Here is y 4 in a fourth-order formula that needs yo> yi> y 2 > y 3 - Just shift 
all indices for y 5J y 6 , and y ft + 1 : 

Muhistep y 4 = y 3 + -^ [55y' 3 - 59 / 2 + 37^ - 9/ 0 ]. 

The advantage is that each step needs only one new evaluation of y' K y„). 

Runge-Kutta needs four evaluations for the same accuracy. 

Stability is the key requirement for any method. Now the good test is / = - y. The 
solution should decay and not blow up. Section 6.6 showed how a large time step 
makes Euler’s method unstable — the same will happen for more accurate formulas, 
The price of total stability is an “implicit method” like y, = y 0 + |/i(y 0 + y\ ), 
where the unknown y 4 appears also in y\ . There is an equation to be solved at every 
step. Calculus is ending as it started — with the methods of Isaac Newton. 
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16,2 EXERCISES 


Read-through questions 

The solution to y* — 5y=lQ is y = Ae**-\-B+ The homo- 
geneous part Ae* J satisfies y' — 5y= o The particu- 
lar solution B equals b The initial condition is 

matched by A — c . For y* — 5y = the right form 
is v = Ae 5 * + d . For y — 5y = cos t the form is 
y = Ae $t + e + I 

The equation y' + 4y + 5y = 0 is second-order because 
fl . The pure exponential solutions come from the roots 
of h which are r = i and s = I The general 
solution is v = fc . Changing 4 y to I yields pure 
oscillation. Changing to 2 y' yields X= — l±2i t when the 
solutions become v = m . This oscillation is 
(over) (under) (critically) damped, A spring with m = 1, d — 2, 
k = 5 goes (back and forth) (directly to zero). An electrical 
network with L= 1, R = 2> C = \ also n 

One particular solution of y" + 4y = e* is e* times 
o , If the right side is cos f, the form of y p is p . If the 
right side is 1 then y p = q , If the right side is r we 
have resonance and y p contains an extra factor i 

Problems 1-14 are about first-order linear equations, 

1 Substitute y = Be 3 * into y' — y = Be 2 * to find a particular 
solution. 

2 Substitute y = a cos 2t + b sin 2r into y + y = 4sin2f to 
find a particular solution. 

3 Substitute y = a + bt + ct 2 into y + y = I + r 2 to find a 
particular solution. 

4 Substitute y = ae * cos f + Vsin r into / = 2e r cos t to find 
a particular solution. 

5 In Problem 1 we can add Ae* because this solves the equa- 
tion . Choose A so that y(Q) = 7. 

6 In Problem 2 we can add Ae~* t which solves . 

Choose A to match y(Q) = 0. 

7 In Problem 3 we add to match y{ 0) = 2. 

8 In Problem 4 we can add y = A, Why? 

9 Starting from y Q = 0 solve y = and also solve y = 1. 
Show that the first solution approaches the second as k -* 0. 

10 Solve y — y = ^* starting from y 0 — 0. What happens to 
your formula as k -+ I? By THopitaTs rule show that y 
approaches te* as k 1, 

11 Solve y — y = e* + cos t. What form do you assume for y 
with two terms on the right side? 

12 Solve y + y = e* + f. What form to assume for y? 

13 Solve / = cy + te * following Example 3 (c / 1). 


14 Solve y = y A- 1 following Example 3 (c = 1 and k — 0). 


Problems 15-28 are about second-order linear equations. 

15 Substitute y = e Xt into f + 6y' + 5y — 0. (a) Find all A’s. 

(b) The solution decays because . (c) The general 

solution with constants A and B is . 

16 Substitute y = into y' + 9y = 0. (a) Find all A’s. (b) The 

solution oscillates because . (c) The general solution 

with constants a and b is , 

17 Substitute y = e Xt into y* + 2y + 3y = 0. Find both A*s. 

The solution oscillates as it decays because . The 

general solution with A and B and e 1 * is . The 

general solution with e~ * times sine and cosine is . 

18 Substitute y = e 2 * into y" + 6y + 9y = 0. (a) Find all X's. 

(b) The general solution with e 2 * and te 2 * is . 

19 For y" + dy ( + y = 0 find the type of damping at 
4 = 0, 1, 2, 3. 

20 For y' + 2y + J!ty = 0 find l he type of damping at 
= 0,1, 2. 

21 If 2 2 + i>A + c = 0 has a repeated root prove it is X = 
— bjl . In this case compute y" + by 1 A cy when y = te x *. 

22 X 2 + 32. + 2 = 0 has roots —1 and —2 (not repeated). Show 
that te~ * does not solve y " + 3y + 2y = 0. 

23 Find y = a cos t + b sin t to solve y" + y + y = cos r. 

24 Find y — a cos cot + b sin cot to solve y" H- y* + y = sin cot, 

25 Solve y' + 9y = cos St with y 0 = 0 and y’ Q = 0, The solution 
contains cos 3r and cos 5f. 

26 The difference cos St — cos 3t equals 2 sin 4f sin t. Graph 
it to see fast oscillations inside slow oscillations (beats). 

27 The solution to y n -t-coly = cos cot with y Q = 0 and 

yo = 0 is what multiple of cos cot — cos tu 0 r? The formula 
breaks down when to = . 

28 Substitute y = Ae l<i>i into the circuit equation 
Ly* + Ry + j y dtjC = Ve i<0 *. Cancel e imt to find A. Its denomi- 
nator is the impedance , 

Problems 29-32 have the four right sides in the table (end of 
section). Find ^particular by using the correct form. 

29 / + 3y = e st 30 f + 3y = sin t 

31 /' -h 2y = 1 + r 32 /' + 2 y = e* cos t. 

33 Find the coefficients of y in Problems 29-31 for which the 
forms in the table are wrong. Why are they wrong? What new 
forms are correct? 
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34 The magic factor f entered equation (2). The series for 
e** — e* 1 starts with 1 + kt +\k 2 t 2 minus 1 + ct + \c 2 t 2 - Divide 
by it — c and set Jfc = c to start the series for te*K 

35 Find four exponentials y = e lt for d 4 yfdt 4 — y = 0. 

36 Find a particular solution to d 4 yjdt 4 + y^eK 

37 The solution is y = Ae~ 2t + Bte~ 2t when 4 = 4 in 
Figure 16.2. Choose A and B to match j 0 = l and / 0 = 0. 
How large is y(2n)? 

38 When d reaches 5 the quadratic for Figure 16*2 is 
X 2 + $X + 4 = (X + l)(X + 4). Match y = Ae~ l + Be M to 
y 0 = 1 and y' 0 = 0. How large is y(2it)l 

39 When the quadratic for Figure 16.2 has roots — r and 
— 4/i\ the solution is y — Ae~ n + Be -4f/r . 

(a) Match the initial conditions y 0 = 1 and y' G = 0. 

(b) Show that y approaches 1 as r -* 0, 


40 In one sentence tell why f = 6y has exponential solutions 
but y"-6y 2 does not. What power y = x n solves this 
equation? 

41 The solution to dy/dt =/(t), with no y on the right side, 
is y = j f(t) dt. Show that the Runge-Kutta method becomes 
Simpson’s Rule. 

42 Test all methods on the logistic equation y r = y — y 2 to 
see which gives = 1 most accurately* Start at the inflection 
point ya = i with h=^ j* Begin the multistep method with 
exact values of y = (1 +e“ f ) _1 , 

43 Extend the tests of Improved Euler and Rungp-Kutta to 
y = — y with y 0 = L They are stable if |yi |< L How large 
can h be? 

44 Apply Runge-Kutta to / = — lOOy + 100 sin t with 
y o = 0 and h = . 02* Increase h to .03 to see that instability 
is no joke* 
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Discrete mathematics is not like calculus. Everything isjinite. I can start with the SO 
states of the U.S. I ask if Maine is connected to California, by a path through 
neighboring states. You say yes. I ask for the shortest path (fewest states on the way). 
You get a map and try all possibilities (not really all — but your answer is right). Then 
I close all boundaries between states like Illinois and Indiana, because one has an 
even number of letters and the other has an odd number. Is New York s till connected 
to Washington ? You ask what kind of game this is — but I hope you will read on. 

Far from being dumb, or easy, or useless, discrete mathematics asks good questions. 
It is important to know the fastest way across the country. It is more important to 
know the fastest way through a phone network. When you call long distance, a quick 
connection has to be found. Some lines are tied up, like Illinois to Indiana, and there 
is no way to try every route. 

The example connects New York to New Jersey (7 letters and 9). Washington is 
connected to Oregon (10 letters and 6). As you read those words, your mind jumps 
to this fact — there is no path from New York with 7 letters to Washington with 10. 
Somewhere you must get stuck. There might be a path between all states with an 
odd number of letters — I doubt it. Graph theory gives a way to find out. 

GRAPHS 

A model for a large part of finite mathematics is a graph. It is not the graph of 
y = ji x )- The word “graph” is used in a totally different way, for a collection of nodes 
and edges. The nodes are like the 50 states. The edges go between two nodes — the 
neighboring states. A network of computers fits this model. So do the airline connec- 
tions between cities. A pair of cities may or may not have an edge between them — 
depending on flight schedules. The model is determined by V and £. 
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DEFINITION A graph is a set V of nodes {or vertices) and a set £ of edges. 

EXAMPLE 1 How many edges are possible with n nodes, in a complete graphl 

The first node has edges to the n — 1 other nodes. (An edge to itself is not allowed.) 
The second node has n-2 new edges. The third node has another n — 3. The total 
count of edges, when none are missing, is the sum from Section 5.3: 

1 + 2 H h (n - 1) - \n(n — 1) edges in a complete graph. 

Fifty states have 25*49= 1225 possible edges. The “neighboring states graph” has 
less than 200. A line of 6 nodes has 5 edges, out of * • 6 • 5 “ 15 possible. 

EXAMPLE 2 Which states with an odd number of letters are reachable from New 
York? Boundaries to states like Pennsylvania (12 letters) are closed. 

Method of solution Start from New York (7). There is an edge to Connecticut (1 1). 
That touches Massachusetts (13), which is a neighbor of Vermont (7). But we missed 
Rhode Island, and how do we get back? The order depends on our search method — 
and two methods are specially important. 

Depth first search (DFS) “From the current state, go to one new state if possible.” 
But what do we do from Vermont, when New Hampshire (12) is not allowed? The 
answer is: backtrack to Massachusetts. That becomes the next current state. 

We label every state as we reach it, to show which state we came from. Then VT 
has the label MA, and we easily cross back. From M A we go to Rl. Then backtrack 
to MA and CT and NY. At every step I searched for a new state with no success. 
From NY we see NJ (9). Finally we are in a comer. 

The depth first search is ended, by a barrier of even states. Unless we allow Ontario 
and keep going to Minnesota. 

Breadth first search (BFS) “From the current state, add all possible new states to the 
bottom of the list. But take the next current state from the top of the list.” There is 
no need to backtrack. 

From NY we reach VT and MA and CT and NJ. What comes next? 

Where DFS moves from the last possible state, breadth first search moves from 
the first possible state. No move from VT is possible — so we “scan” from Massachu- 
setts. We see Rhode Island (barely). That ends BFS. 

The same six states are reached both ways. Only the order is different. DFS is last 
in -first out . BFS is first in- first out . You have the same choice in drawing a family 
tree — follow a path as far as it goes and backtrack, or list all brothers and sisters 
before their children. The BFS graph in Figure 16.3 is a tree. So is the DFS graph, 
using forward edges only. 



Fig. 16.3 Search trees from New York. The minimum spanning tree. 
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DEFINITION A tree is a connected graph with no bops . Its N nodes are connected 
by N — I edges. If N = n, so every node is in the tree, it is a spanning tree . 

The path from VA to KY to TN to NC to VA is a loop (or cycle). If one of those four 
edges is removed, we have a tree. If two edges are removed, we have two trees (a 
small forest). 

EXAMPLE 3 Allow an edge between neighboring states only when one state is even 
and the other is odd . Are the lower 48 states connected? 

Start anywhere — say California, Apply either type of search — maybe DFS. Go to 
Arizona (7) tben Utah (4) then WY (9) then CO (8) then NM then OK then TX. 
(I am writing this on an airplane, looking at the map.) We will never get to Florida! 
It is blocked by Alabama and Georgia. 

The search creates a tree, but not a spanning tree. This graph is not connected. 
An odd-to-even graph is special and important. It is called “ bipartite ” meaning 
two parts . The odd states are in one part, the even states are in the other. AH edges 
go between parts. No edges are within a part.t 

EXAMPLE 4 Is there a “complete matching” between 25 even and 25 odd states? 
This requires neighboring states to be paired off (with no repetition). 

Method 1 Start pairing them off: CA-AZ, UT-WY, NV-ID, NE-SD, WA-MT. 
What about Oregon? Maybe it should have been paired with Idaho. Then Nevada 
could pair with Arizona. Trial and error goes nowhere fast. 

Method 2 Think first. The four states CA-OR-WA-NV are even. This whole group 
is only connected to three odd states (AZ, ID, MT). The matching is impossible. 


168 A complete matching is impossible if a group of nodes in one part is 
connected to fewer nodes m the other part. If every group has enough connec- 
tions, the matching can be achieved 


This is Hall’s Theorem. In a course on graphs, it would be proved. Our purpose here 
is to see the ideas and questions in discrete mathematics, more than the proofs. 

THE GREEDY ALGORITHM 

Put back all edges between neighboring states. The nodes could be provinces of 
Canada or states of Australia. Lf they are countries of Europe- Asia- Africa (or the 
Americas), we need a new map. The essential thing is the new problem. 

In a network each edge has a “length” A positive number c i} is assigned to the edge 
from node i to node j. In an economics problem, c i} is the cost . In a flow problem it 
is the capacity, in an electrical circuit it is the conductance , We look for paths that 
minimize these “lengths.” 

PROBLEM Find the minimum spanning tree , Connect all nodes by a tree with the 
smallest possible total length. 

The six cheapest highways connecting seven cities form a minimum spanning tree. It 
is cheapest to build, not cheapest to drive — you have to follow the tree. Where there 


fExactly half the states have an even number of letters (a real trivia question). This is the little- 
known reason for admitting Alaska and Hawaii. 
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is no edge we set c i} — oo (or an extremely large value, in an actual code). Then the 
algorithm works with a complete network — all n(n- l)/2 edges are allowed. How 
does it find the minimum spanning tree in Figure 16.3c? 

Method 1 Always add the shortest edge that goes out from the current tree . 

Starting from node s , this rule chooses edges of length 1, 2, 7, 4, 3. Now it skips 5, 
which would close a loop. It chooses 6, for total length 23. 

Method 2 Add edges in order , from shortest to longest . Reject an edge that closes a 
loop. Several trees grow together (a forest). At the end we have a minimum spanning 
tree. 

This variation chooses edge lengths in the order 1, 2, 3, 4, 6 (rejecting 5), 7. In our 
network both methods produce the same tree. When many edges have equal length, 
there can be many shortest trees. 

These methods are examples of the Greedy Algorithm: Do the best thing at every 
step . Don’t look ahead. Stick to a decision once it is made. In most network problems 
the Greedy Algorithm is not optimal — in this spanning tree problem it is. 

Method 2 looks faster than Method L Sort the edges by length, and go down the 
list. Just avoid loops. But sorting takes time! It is a fascinating problem in itself — 
bubble sort or insertion sort or heapsort. We go on to a final example of discrete 
mathematics and its algorithms. 

PROBLEM Find the shortest path from the source node s to each other node . 

The shortest path may not go along the minimum spanning tree. In the figure, the 
best path going east has length 1 + 8. There is a new shortest path tree , in which the 
source plays a special role as the “root.” 

How do we find shortest paths? Listing all possibilities is more or less insane. A 
good algorithm builds out from the source, selecting one new edge at every step. 
After k steps we know the distances d u d k to the k nearest nodes. 

Algorithm: Minimize d, + c u over all settled nodes i and all remaining nodes j. 

The best new node j is a distance c u from a settled node, which is a distance d k from 
the source. In the example network, the first edges are 1, 2, 7. Next is 8. The northeast 
node is closest to the source at this step. The final tree does not use edges 3, 5, 6— 
even though they are short. 

These pages were written to show you the algorithmic part of discrete mathematics. 
The other part is algebra — permutations, partitions, groups, counting problems, 
generating functions. There is no calculus, but that’s fair. The rest of the book was 
written to show what calculus can do’ — / hope very much that you enjoyed it. 

Thank you for reading, and thinking, and working. 


16,3 EXERCISES 


Read-through questions 

A graph is a set V of q and a set E of & . With 6 

nodes, a complete graph has c edges, A spanning tree 
has only d . A tree is defined as b , and it is spanning 
if t It has q path between each pair of nodes. 


To find a path from node i to node j, two search methods 
are h As nodes are reached, DFS looks out from the 

t node for a new one. BFS looks out from i DFS 

must be prepared to k to earlier nodes. In case of fire, 
BFS locates all doors from the room you are in before i 
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In a bipartite graph, all edges go from one part to w , 
A matching is impossible if k nodes in one pan are connected 
to n nodes in the other part. The edges in a network have 
o Cij> A minimum spanning tree is p . It can be found 
by the q algorithm, which accepts the shortest edge to a 
new node without worrying about r 

1 Stan from one node of a hexagon (six nodes, six edges)* 
Number the other nodes by (a) breadth first search (b) depth 
first search. 

2 Draw two squares with one node in common (7-node 
graph). From that node number all others by DFS and BFS. 
Indicate backtracks. 

3 How many spanning trees in the hexagon graph? 

4 Draw a spanning tree in the two-square graph. How many 
spanning trees does it have? 

5 Define a connected graph. If a graph has 7 edges and 9 
nodes, prove that it is not connected. 

6 Define a loop. If a connected graph has 8 edges and 9 
nodes, prove that it has no loops. 

7 Find the shortest path (minimum number of edges) from 
Maine to California. 

8 Which state is farthest (how many edges are needed) from 
the state you are in? Why would it come last in BFS? 

9 List the steps of BFS from your state to Georgia or 
Colorado or New Jersey. (There are edges Hawaii-Califorma 
and Ala ska- Washing ton.) 

10 With edges between odd neighboring states and between 
even neighbors, what is the largest connected set of states? 
Map required. 

11 With edges only from odd to even neighbors, how many 
states can be matched? (Answer unknown to author — please 
advise.) 

12 A matching is a forest of two-node trees. Give another 
description. 

13 Find the minimum spanning tree for network A. 

14 Find the shortest path tree from the center of network A. 

15 Is there a complete matching between left and right nodes 
in graph B? If not, which group of nodes has too few 
connections? 




16 Find the loop in network B. Thea find a minimum span- 
ning tree by Method 1 and Method 2. 

17 How many spanning trees in graph B? It has one loop, 

18 Show that a graph cannot have 0, 1, 2, 3, and 4 edges 
going into its five nodes. 

19 If the only edges into a node have lengths 6 and 8, can 
they both be in a minimum spanning tree? 

20 In Problem 19, prove that a minimum spanning tree con- 
tains edge (6) if it contains edge (8). 

21 True or false , with reason or example. 

(a) In a complete network, the minimum spanning tree 
contains the n — 1 shortest edges. 

(b) If a graph has 9 nodes and 9 edges, it has a loop. 

(c) A graph with a complete matching must be connected. 

22 Draw a tree that is perfect for (a) DFS; (b) BFS. 

23 The adjacency matrix has a y = 1 if there is an edge from 
node i to node j. Write down this matrix for graphs A and B. 

24 In a complete network start with = c if . Show that the 
d t j at the end of this program are shortest distances: 

for i = 1 to n do 
for ; — 1 to n do 
for k = 1 to n do 
d t j = max(d 0 , d, k + d kJ ) 

25 How many spanning trees in graph A? 

26 A maximum spanning tree has greatest possible length. 
Give an algorithm to find it. 

27 Write a code that will find a spanning tree (or stop), given 
a list of edges like (1, 2), (1, 3), (4, 7), .... 
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Answers to Odd-Numbered Problems 


CHAPTER 1 INTRODUCTION TO CALCULUS 
Section 1.1 Velocity and Distance (page 6) 


1 v — 30,0, — 30; v = — 10, 20 3 u(t) = 


2 for 0 < t < 10 
1 for 10 < t < 20 v(£) = 
—3 for 20 < t < 30 


5 25; 22; t + 10 7 6; -30 9 o(t) = { \ J \ f(t) = j 


0 for 0 < t < T 

± for T < t < 2 T 

0 for 2T < t < 3 T 

7 £ Hi 11 

X 

13 f(t) = 0, 30(f - 1), 30; f(t) = -30 1, -60, 30(t - 6) 15 Average 8, 20 17 40 1 - 80 for 1 < t < 2.5 

21 0 < t < 3,-40 </< 20; 0 < t < 3T,0 </< 60T 23 3 -It 25 6t - 2 27 3* + 7 

8 for 0 < t < T v _ ( 8 1 for 0 < t < T 

-2 for T < t < 5T 1 1 " \ 10T - 2t for T < t < 5T 

iooo > — wu?\/i oivi& luv/WA 37 1 ^ t ^ 5j 0 ^ y ^ 2 

39 0 < t < 5,0 < / < 4 41 0 < t < 5,1 < t < 32 43 \t + 4; \t + \\ 2t + 12; 2t + 3 

45 Domains — 1 < t < 1 : ranges 0 < 2t + 2 < 4, — 3 < t — 2 < — 1, -2 < -f(t) < 0, 0 < f(-t) < 2 


29 Slope -2; 1 < / < 9 


31 


*'!■)={ —2 


33 |(7-f 32; slope | 35 f(w) = slope = conversion factor 


47 IV;§V 


49 input * input — > A 
input + A — ► output 


input * input — ♦ A B * B —> C input + 1 — ► A 

input +A — ► £ B + C — ► output A * A — ► £ 

A + B — ► output 

51 3£ + 5, 3£ + 1, 6£ - 2, 6£ - 1, -3£ — 1, 9t — 4; slopes 3, 3, 6, 6, -3, 9 

! - 2*) 3 < £ < 4.5 

4.5 < £ < 6 


Borp , , „ , , . f 2 ( 2 *) 0 < £ < 1.5 12-2(12- 

53 The graph goes up and down twxce. /(/(<)) = | 12- 4* 1.5 < t < 3 2(12 - 2 1) 


Section 1.2 Calculus Without Limits (page 14) 

1 2 + 5 + 3 = 10; / = 1, 3, 8, 11; 10 3 / = 3, 4, 6, 7, 7, 6; max / at v = 0 or at break from v = 1 to — 1 

5 1.1, -2, 5; /(6) = 6.6, -11, 4; /(7) = 7.7, -13,9 7 f(t) = 2t for t < 5,10 + 3(f- 5) for t > 5;/(10) = 25 

9 7, 28, 8£ + 4; multiply slopes 11 /(8) = 8.8, -15, 14; = 1.1, -2, 5 

13 f(x) = 3052.50 + .28(z — 20, 350); then 11,158.50 is /(49, 300) 15 19|% 

17 Credit subtracts 1,000, deduction only subtracts 15% of 1000 19 All Vj = 2;i>y = (— l) J_1 ;vy = (\) 3 

21 L’s have area 1,3, 5, 7 23 fj = j\ sum j 2 -f j; sum § 25 (101 2 — 99 2 )/2 = ^ 27 Vj = 2 j 29 / 31 = 5 

31 a 3 - = - fj 33 0; 1; .1 35 t; = 2, 6, 18, 54; 2 • 37 AZ = 1, .7177, .6956, .6934 -► In 2 = .6931 in Chapter 6 

39 Vj — ~{\) 3 41 Vj = 2(— l) J , sum is fj — 1 45 t; = 1000, £ = \0/V 

47 M, N 51 v/9 < 2 • 9 < 9 2 < 2 9 ; (|) 2 < 2(|) < < 2 1 / 9 

Section 1.3 The Velocity at an Instant (page 21) 


16,6,^0,-12,0,13 3 4, 3.1, 3 -f h, 2.9 5 Velocity at t— 1 is 3 7 Area / = £ + £ 2 , slope of / is 1 -f 2£ 

9 F; F; F; T 11 2; 2 £ 13 12 -f 10£ 2 ; 2 + 10£ 2 15 Time 2, height 1, stays above | from £ = | to | 

17 /(6) = 18 21 u(£) = — 2£ then 2 £ 23 Average to £ = 5 is 2; v(5) = 7 25 4u(4£) 27 v ave = £, u(£) = 2£ 

Section 1.4 Circular Motion (page 28) 


1 107T, (0, —1), (—1, 0) 3 (4 cos £, 4 sin £); 4 and 4£; 4 cos £ and —4 sin £ 

5 3£; (cos 3£, sin 3£); —3 sin 3£ and 3 cos 3£ 7 x = cos£; y/2/2 ; —y/2/2 

11 Clockwise starting at (1,0) 13 Speed ^ 15 Area 2 17 Area 0 


9 2tt/3; 1; 2tt 
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19 4 from speed, 4 from angle 21 \ from radius times 4 from angle gives 1 in velocity 
23 Slope average (1 — ^)/(tt/ 6) = 3 ( 2 ~^ = .256 25 Clockwise with radius 1 from (1,0), speed 3 

27 Clockwise with radius 5 from (0,5), speed 10 29 Counterclockwise with radius 1 from (cos 1, sin 1), speed 1 
31 Left and right from (1,0) to (-1,0), v = — sin t 33 Up and down between 2 and —2; start 2 sin 0, v = 2 cos(t-|-0) 
35 Up and down from (0, -2) to (0,2); t; = sin \t 37 x = cos y = sin §-£, speed v up = cos §g£ 

Section 1.5 A Review of Trigonometry (page 33) 

1 Connect corner to midpoint of opposite side, producing 30° angle 3 n 7 A area \r 2 0 
9 d = 1, distance around hexagon < distance around circle 11 T; T; F; F 
13 cos(2t + t) = cos 2t cos t — sin 2£ sin t = 4 cos 3 t — 3 cos £ 

15 | cos(s — t) + | cos(s + £); | cos(s — t) — | cos(s + t) 17 cos 0 = sec 5 = ±1 at 0 = n^r 

19 Use cos(| — s - t) = cos(| - s) cos t + sin(§ - s) sin t 23 0 — multiple of 2n 
25 6 = multiple of ir 27 No 0 29 <f> = \ 31 \OP\ = a, |OQ| = 6 


CHAPTER 2 DERIVATIVES 

Section 2.1 The Derivative of a Function (page 49) 


1 (b) and (c) 3 12 + 3fc; 13 + 3h; 3; 3 5 f{x) + 1 7 -6 9 2x + Ax + 1;2* + 1 

11 tTZt - I = tliTZty -» W 13 7; 9; corner 15,4 = 1, B = -1 17 F; F; T; F 

19 6 = B] m and M ; m or undefined 21 Average xz + xi — ► 2x\ 

25 §; no limit (one-sided limits 1, —1); 1; 1 if t ^ 0, — 1 if t = 0 27 /'( 3 ); /( 4) — /( 3 ) 


29 2x 4 (4x 3 ) = 8a; 7 


O-g dtl 1 1 

dx "" 2u “ 2^/x 


33 


M = -§; /'(2) doesn’t exist 35 2 /£ = 4u 3 ^ 


Section 2.2 Powers and Polynomials (page 56) 


—n— 1 


7 nz n 1 — nx 


19 Decreases for — 1 < x < | 


1 6x 6 ; 30x 4 ; /""" = 720 = 6! 3 2x + 7 5 1 + 2s + 3x 2 + 4x 3 

9H-z+§x 2 + §z 3 ll-i,(-i)+5 13 «-8/3.,-4/3 ; _l x -4/3 

15 3x 2 — l = 0atx=^j and 17 8 ft/sec; — 8 ft/sec; 0 

21 V? 281 5 10 10 5 1 adds to (l+l) 6 (z = A = 1) 

25 3a: 2 ; 2h is difference of z’s 27 ££ = 2z + Ai+ 3 i 2 + 3iAx+ (A i) 2 — ♦ Ix + Sx 2 — sum of separate derivatives 
29 7z 6 ; 7(z + l) 6 31 ±x 4 plus any cubic 33 x + §z 2 + §z 3 + ±z 4 + C 35 ^z 4 , j^z 5 

41 Alt — JLf £ _ da __c_ 

A- A-l. 


37 F; F; F; T; T 


31 jjX 4 plus any cubic 
39 ^ = .12 so ^ = §(.12); six cents 


45 t to </2t 47 Az 10 ; 7 rrJi n+1 ; divide by n + 1 = 0 


* ) ’ dx *' 


49 . 7913 , - 3 . 7913 , 1 . 618 , -. 618 ; 0 , 1 . 266 ,- 2.766 


Section 2.3 The Slope and the Tangent Line (page 63) 

1 = £riy- 6 = -3(z - 2); y - 6 = §(x - 2); y - 6 = -§(x - 2) 3 y + 1 = 3(z - 1); y = 3z - 4 

5 y = x; (3,3) 7 y - a 2 = (c + a)(z - a); y - a 2 = 2a(z - a) 9 y = §z 2 + 2;y - 7 = — §(z - 5) 

11 y = 1; z = | 13 y — § = — ^-(z — a);y = ^,z = 2o; 2 15 c = 4, tangent at z = 2 
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17 (—3, 19) and (|, ||) 19 c = 4, y = 3 — x tangent at x = 1 

21 (1 + ft) 3 ; 3ft + 3/i 2 4- ft 3 ; 3 4- 3ft 4- A 2 ; 3 23 Tangents parallel, same normal 

25 y — 2 ax — a 2 , Q = (0, —a 2 ); distance a 2 + angle of incidence = angle of reflection 
27 x = 2 p; focus hasy=f£=p 29y-^ = x+^;x = -^ = -\/2 

31 y — o 2 = — 2 a(x ~ o); y = o 2 + 2 ; a = ^ 33 (p-)(1000) = 10 at x = 10 hours 35 a = 2 

37 1.01004512; 1 + 10(.001) = 1.01 39 (2 + Ax) 3 - (8 + 6Ax) = 6(Ax) 2 + (Ax) 3 41 Xl = §; x 2 = 

43 T — 8 sec; f(T) = 96 meters 45 a > | meters/sec 2 

Section 2.4 The Derivative of the Sine and Cosine (page 70) 

1 (a) and (b) 3 0; 1; 5; | 5 sin(x + 2 tt); (sinft)/ft — ► 1; 2 tt 7 cos 2 0 « 1 — 0 2 + |0 4 ; |0 4 is small 

9 sin \6 » |0 11 |;4 13 PS = sin ft; area OP/2 = | sin ft < curved area |ft 

15 cosx = 1 — 4- 4 3 — * * 17 ^(cos(x 4- ft) — cos(x — ft)) = jr(— sin x sin ft) —► — sinx 

19 xf = cosx — sinx = 0 at x = j 4- nir 21 (tanft)/ft = sin ft/ft cos ft < — > 1 

23 Slope | cos \x = |,0, — no 25 y = 2 cos x + sinx; y" — -y 27 y — cos3x; y = | sin3x 
29 In degrees (sin ft)/ft — ► 27r/360 = .01745 31 2 sin x cos x + 2 cos x(— sin x) = 0 


Section 2.5 The Product and Quotient and Power Rules (page 77) 

1 2x 3 5 (x — 2)(x — 3) + (x — l)(x — 3) + (x — l)(x — 2) 

7 — x 2 sinx + 4xcos x 4 - 2 sinx 9 2x — 1 - - : n \ - 11 2>/x sin x cos x + |x -1 / 2 sin 2 x-f |(sin x)” 1 / 2 cos x 
13 4x 3 cosx— x 4 sinx-fcos 4 x— 4xcos 3 x sinx 15 |x 2 cos x+2xsin x 17 0 19 — |(x— 5)~ 5 / 3 4-§(5— x)” 5 / 3 (= 0?) 
21 3(sin x cos x) 2 (cos 2 x - sin 2 x) + 2 cos 2x 23 u'vwz 4 - v'uwz + w'uvz + z'uvw 25 — esc 2 x — sec 2 x 
27 V = = cos Bint A = 2(^ I +tcost+^) A # = 2(cos t — tsint 4 - — t+f ) 

29 lOt for t < 10, for t > 10 31 

33 u"v 4- 2uV 4- uv"; u'"v 4- 3u'V 4- 3uV' 4- v"‘ 35 § sin 2 t; § tan 2 1; |[(1 4- t) 3 / 2 - l] 

39 T; F; F; T; F 41 degree 2 n — 1/ degree 2n 43 u(t) = cost — tsint(t < j);v(t) = — |(t > |) 

45 y — 4- has ^ = 0 at x = 0 (no crash) and at x = — L (no dive). Then ^ 4- f) and 

d 7 y __ 6V 7 h( 2x i ■, \ 


Section 2.6 Limits 


(page 84) 


1 = 0, after N — 10; |f ,oo, no A; 1,0, after 5; 1.1111, ~, all n;\/2, 1, after 38; y/20 — 4, |, all n; 

|f|, e = 2.718 • • • , after N = 12. 3 (c) and (d) 

5 Outside any interval around zero there are only a finite number of a’s 7 | 9 /M~/t Q ) 11 1 


15 sin 1 


17 No limit 


21 Zero if /(x) is continuous at a 


25 .001, .0001, .005, .1 27 |/(x) - L\\ ^ 29 0; X = 100 33 4; oo; 7; 7 35 3; no limit; 0; 1 

37 if |r| < 1; no limit if |r| > 1 39 .0001; after N = 7 (or 8?) 41 | 

43 9;8i;a„ -8 = §(<!„_! -8) ->0 

45 a n — L < b n — L < c n — L so |6 n — L\ < e if \a n — L\ < e and \c n — L\ < e 
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Section 2.7 Continuous Functions (page 89) 

1 c = sin 1; no c 3 Any c; c = 0 5 c = 0 or 1; no c 7 c = 1; no c 9 no c; no c 

** c= 64> c= 64 13c = — l;c = — 1 15 c = 1; c = 1 17 c = -l^c = — 1 

19 c = 2, 1,0, —1, • • • ; same c 21 f[x) = 0 except at x = 1 23 y/x — 1 25 — 27 

29 One; two; two 31 No; yes; no S3 xf(x), (/(a:)) 2 , x, f{x), 2 (/(as) — re), /(a;) + 2x 35 F; F; F; T 
37 Step; f(x) = sin j with /(0) = 0 39 Yes; no; no; yes (/t(0) = 1) 

41 <7(|) = /(l) — /(l) = /( 0) — /(|) = —<7(0); zero is an intermediate value between <7(0) and j(|) 

43 f[x) — iis>0atx = 0 and < 0 at x = 1 

CHAPTER 3 APPLICATIONS OF THE DERIVATIVE 
Section 3.1 Linear Approximation (page 95) 

1 Y = * 3 Y = l + 2(x-|) 5 Y = 2n{x - 2 jt) 7 2 6 + 6 • 2 5 • .001 9 1 

11 1 - l(— .02) = 1.02 13 Error .000301 vs. § (.0001)6 15 .0001 - flO" 8 vs. |(.0001)(2) 

17 Error .59 vs. |(.01) (90) 19 £y/l - x = = -§ at x = 0 

21 = £ at u = 0, c + £ = e + £ 23 dV — 3(10) 2 (.l) 

25 A = 4?rr 2 , dA — Snr dr 27 V — nr 2 h y dV = 2nrh dr (plus nr 2 dh) 29 1 + \x 31 32nd root 

Section 3.2 Maximum and Minimum Problems (page 103) 

1 x = — 2 : abs min 3 x = — 1 : rel max, x = 0: abs min, x = 4 : abs max 

5 i = — 1 : abs max, x = 0, 1 : abs min, x = \ : rel max 7 x = — 3 : abs min, x = 0 : rel max, x — 1 : rel min 
9 x = 1, 9 : abs min, x = 5 : abs max 11 x = | : rel max, x = 1 : rel min, x = 0 : stationary (not min or max) 
13 x = 0, 1, 2, • • • : abs min, x = |, §,§,••• : abs max 15|x| < 1 : all min, x = — 3 abs max, x = 2 rel max 
17 x = 0 : rel min, x = | : abs max, x — 4 : abs min 

19 x = 0 : abs min, x = 7r : stationary (not min or max), x = 2 tt : abs max 

21 6 = 0 : rel min, tan 6 = — | (sin0 = | and cos 0 = — | abs max, sin 6 = — | and cos 6 = | abs min), 

6 = 2n : rel max 

23 h = 1(62" or 158 cm); cube 25 2\/fl6 gallons/mile, miles/gallon at v = 

27 (b) 6 = — = 67.5° 29 x = compare Example 7; | = \/3 

31 R(a;) - Cfo); P(*)- C ( a > ; M - profit 88 x =2^)! zero 35 x = 2 
37 V = x(6 - )(12 - 2x); x « 1.6 39 A = 7rr 2 + x 2 ,x = |(4 - 2x r); r min = 

41 max area 2500 vs 10 ^° Q = 3185 43 x = 2, y = 3 45 P(x) = 12 — x; thin rectangle up y axis 

47 h= y , r = | of cone volume 

49 r = 2 (h—r) > cylinder has no height, area 2i tR 2 from top and bottom (?) 

51 r = 2, h = 4 53 25 and 0 55 8 and —00 

57 y/r 2 + x 2 + y/q 2 + (a - a:) 2 ; £ = ^=p - = 0 when sin <* = sine 

59 y — x 2 = | 61 (1, —1), (^, — J) 63 m = 1 gives nearest line 65 m = | 67 equal; x = | 

69 ^x 2 71 TVue (use sign change of /") 

73 Radius R y swim 2Rcos0, run 2 R0 y time 2iZ ^ osg + ^;max when sin0 = j^ y min all run 
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Section 3,3 Second Derivatives: Bending and Acceleration (page 110) 


3 y = — 1 — x 2 ; no • • • 5 False 7 True 9 True (/' has 8 zeros, /" has 7) 

11 x = 3 is min: /"(3) = 2 13 x = 0 not max or min; x = | is min: /"( |) = 81 

15 x = 2f is max: /"(^f) = -yft\x = ^ is min: /"(^f) = V5 
17 Concave down for x > | (inflection point) 

19 x = 3 is max: /"(3) = —4; x = 2, 4 are min but f n = 0 21 /(Ax) = /(—Ax) 23 1 + x — “ 

25 1 — — 27 1 — ~x — ~x 2 29 Error ±/" (x) Ax 31 Error OAx + §/"'(x)( Ax) 2 

37 = 1.010101; yj = .90909 39 Inflection 41 18 vs. 17 43 Concave up; below 

Section 3.4 Graphs (page 119) 

1 120; 150; ~ 3 Odd; x = 0, y = x 5 Even; x = l,x=— l,y = 0 7 Even; y = 1 9 Even 

11 Even; x = 1, x = —1, y = 0 13 x = 0, x = — 1, y = 0 15 x = 1, y — 1 17 Odd 19 

21 x + yzj 23 \/x 2 + 1 25 Of the same degree 27 Have degree P < degree Q\ none 

29 x = 1 and y = 3x + C if / is a polynomial; but /(x) = (x — l) 1 / 3 + 3x has no asymptote x = 1 
31 (x — 3) 2 39 x = \/2, x = — \/2, y = x 41 Y — 100 sin 45 c = 3, d = 10; c = 4, d — 20 

47 x* = \/5 = 2.236 49 y — x — 2\Y = X;y = 2x 51 x max = .281,x m i n = 6.339; Xi n fj = 4.724 

53 x m i n = .393, x ma x ^ 1*53, x m i n = 3.33;xj n fl — .896,2.604 
55 x m j n — *7398, x max — - .8135;xj n fl — .04738; Xbi OWU p — i2.38 57 8 digits 


Section 3.5 Parabolas, Ellipses, and Hyperbolas (page 128) 


1 dy/dx = 0 at £ 3 V = (1, -4), F = (1, -3.75) 5 V = (0,0), F = (0,-1) 7 F = ( 1,1) 

9 V = (0,±3);F = (0,±\/8) 11 V = (0,±l);F = (0,±^/|) 13 Two lines, a = b = c = 0; V" = F = (0,0) 

15 y = 5x 2 — 4x 17 y + p = v/x 2 + {y - p) 2 — 4 py = x 2 ; F = (0, ^),y = 

19 x — ay 2 with a > 0; y = ; y = —ax 2 + ax with a > 0 

21 ^ + j / 2 = l;i^ + (y-l) 2 = l 23 g + £ = 1; ^ ^ = l;x 2 + j, 2 = 25 

25 Circle, hyperbola, ellipse, parabola 27 ^ = - f ; y = -§x + 5 29 f ; ^ = |(| - f ) 

31 Circle; J3,l); 2; X = *=*, y = *=± 33 3x' 2 + y' 2 = 2 35 y 2 - |x 2 = 1; £ - *£ = 1; y 2 - x 2 = 5 

37 fg ~ 3 ? = 1 39 y 2 — 4y + 4, 2x 2 + 12x + 18; -14, (-3, 2), right-left 

41 F= (±^,0);y = ±| 43(x + y+l) 2 =0 

45 (a 2 — l)x 2 -h 2 abxy + (6 2 — l)y 2 + 2acx + 2bcy + c 2 = 0; 4(a 2 H- 6 2 — 1); if a 2 + b 2 < 1 then H 2 — 4 AC < 0 


Section 3.6 Iterations = F(x n ) (page 136) 

1 -.366; oo 3 1; 1 5§;±oo 7 -2; -2 

9 1 ~ 2 v/3 attracts, re pels; | attracts, 0 repels; 1 attracts, 0 repels; 1 attracts; | attracts, 0 repels; 

rh\/2 repel 

11 Negative 13 .900 15 .679 17 | a | < 1 19 Unstable \F f \ >1 21 x* = yf^; |a| < 1 
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23 $2000; $2000 25 x 0 , b/x 0 , x 0 , b/x 0 , ■ ■ • 27 F' = - 

29 F‘ = 1 — 2 cx = 1 — 4c at x* = 2; 0 < c < | succeeds 
31 F' = 1 — 9c(x — 2) 8 = 1 — 9c at a:* = 3;0 < c < § succeed, 

33 x n ^.i = x n 3^3 ! *n+i — x n cos x ft 2 35 x* = 4 if x 0 > 2. 

37 m = 1 + c at x* = 0, m = 1 — c at x* = 1 (converges if 0 < c < 2) 


27 F‘ = -^x -3 / 2 = -§ 
ds 

succeeds 


if so > 2.5; x* = 1 if xo < 2.5 

39 0 43 F' = 1 at x* = 0 


Section 3.7 Newton’s Method and Chaos (page 145) 

1 x n+1 = x n — + 3^r 5 x x = x 0 ; xi is not defined (oo) 7 x* = 1 or 5 from x 0 < 3, x 0 > 3 

11 x 0 < \ to x* = 0; x 0 > | to x* = 1 21 x n+ x = x n - 23 x 4 = cotn — oo; x 3 = cot ^ = cot j 

25 jt is not a fraction 27 = \x 2 n + § + ^ = 4(tf ”»_ x ) 29 16x - 80z 2 + 128s 3 - 64* 4 ; 4; 2 

31 |xo| <1 33 Ax = 1, one-step convergence for quadratics 35 X2 = 1.86 

37 1.75 < x* < 2.5; 1.75 < x* < 2.125 39 8; 3 < x* < 4 41 Increases by 1; doubles for Newton 

45 xx = xq + cot xo = xo + 7T gives X2 = xx + cot xj = xx + ir 49 a = 2, Y's approach | 


Section 3.8 The Mean Value Theorem and l’Hopital’s Rule (page 152) 


1 c = \/| 3Noc 5 c = 1 7 Corner at | 9 Cusp at 0 


11 sec 2 x — tan 2 x = constant 

l-sina 
1 -He 08 a 


13 6 


15 -2 


19 n 


21-1 


23 Not g 


17-1 

25 —1 27 1; has no limit 29 /'(c) = c = y/l 

31 0 = x* - x n+1 + ^§^(x* - x n ) 2 gives Af « 33 /'(0); singularity 35 | 371 


CHAPTER 4 DERIVATIVES BY THE CHAIN RULE 
Section 4.1 The Chain Rule (page 158) 

1 z = y 3 , y = x 2 — 3, z' = 6x(x 2 — 3) 2 3 z = cos y,y = x 3 ,z' = — 3x 2 sin x 3 

5 z = y/y,y = sin x,z' = cos x/2\/sin x 7z = tanj/ + (1/ tan x), y = 1 /x,x' = (^r)sec 2 (j) — (tan x) -2 sec 2 x 

9 x = cos y, y = x 2 + x + 1, z' = -(2x + 1) sin(x 2 + x + 1) 11 17 cos 17x 13 sin(cos x) sin x 

15 x 2 cos x + 2x sin x 17 (cos \fx + l)|(x + l) -1 / 2 19 |(l + sinx) _1 / 2 (cosx) 21cos( 

23 8x 7 = 2(x 2 ) 2 (2x 2 )(2x) 25 2(x + 1) + cos(x + it) = 2x + 2 — cos x 

27 (x 2 + l) 2 + 1; sin U from 0 to sin 1; U (sin x) is 1 and 0 with period 2ir; R from 0 to x; 72 (sin x) is half-waves. 
29 g[x) = x + 2, A(x) = x 2 + 2; k(x) = 3 31 /'(/(x))/'(x); no; (-l/(l/x) 2 )(-l/x 2 ) = 1 and /(/(x)) = x 

33 §(§x + 8) + 8; \x + 14; £ 35 f{g(x)) = x, flr(/(y)) = y 

37 /(ff(*)) = ih> 9{f{x)) = 1 - j » /(/(*)) = x = ff(fif(x)), g{f{g{x))) = ^ = /(</(/(x))) 

39 f{y) = V — 1| g{x) = 1 43 2cos(x 2 + 1) — 4x 2 sin(x 2 + 1); — (x 2 — 1) -3 ^ 2 ; — (cos -^/x)/4x + (sin v/x)/4 x 3 / 2 

45 /'(u(t))u'(t) 47 (cos 2 u(x) — sin 2 u(x))^j 49 2xu(x) + x 2 ^ 51 l/4\/l — y/1 — x y/l — x 

53 df/dt 55f'(g[x))g'{x) = 4{x 3 ) 3 3x 2 = 12x n 57 3600; §; 18 59 3; | 
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Section 4.2 Implicit Differentiation and Related Rates (page 163) 

1 -x n - 1 /y n ~ 1 #& = 1 = 7 (y 2 — 2xy)/(x 2 — 2xy) or 1 9 ^ or ^ 

11 First second ^ = - 13 Faster, faster 15 2zz' = 2yy ' —+ z' — Jy # = y' sin0 

17 sec 2 0 = ^ 19 500£;500^/T+l£ja 21 g = -f ; f - -2>/3; oo then 0 

23 ^ — 7rr 2 /i; ^ ~ ^ in/sec 25 = |a6sin0, —■ = 7 27 1.6 m/sec; 9 m/sec; 12.8 m/sec 

29 ~5 31 d« = Y a - To cos = Tory -so cos Ssm %) 

Section 4.3 Inverse Functions and Their Derivatives (page 170) 

1 x = 3 x = -y/y + 1 (x unrestricted — ► no inverse) 5 x = 7 x = (1 + y) 1 / 3 

9 (x unrestricted — > no inverse) 11 y = 13 2 < / -1 (x) <3 15 / goes up and down 

17 f(x)g(x) and jfa 19 m ^ 0; m > 0; |m| > 1 21 ^ = 5x 4 , ^ = |y _4/5 

23 f x = 3x 2 ; % = |(1 + y)~ 2 / 3 25 g = g = ftrfjr 27 y; |y 2 + C 

29 /(y(x)) = -l/3x 3 ;y _1 (y) = yl;y(y -1 (x)) = x 39 2/\/3 41 1/6 cos 9 

43 Decreasing; J = 3^ < 0 45 F; T; F 47 g(x) = x m , f(y) = y n ,x = (z 1 ' n ) 1 / m 

49 y(x) = x 3 , /(y) = y + 6, x = (z - 6) 1 / 3 51 y(x) = 10 s , /(y) = log y, x = log(10 ! ') = y 

53 y = x 3 , y" = 6x, (Px/dy 2 = — | y~ 5 / 3 ; m/ sec 2 , sec /m 2 55 p = ^ — 1;0 < y < 1 

57 max = G = §y 4 / 3 , G' = Ly 1 / 3 59 y 2 / 100 


Section 4.4 Inverses of Trigonometric Functions (page 175) 


10, |,0 3 |,0,| 5 7T is outside [— f, f] 7 y = — \/3/2 and \/3/2 

9 sinx = \A - y 2 ; \A - y 2 and 1 11 id cosx = 1 -*■ = — yr : 


13 y = 0: 1,-1, l;y= 1:0,0, | 15 F; F; T; T; F; F 17 £ = 19^=3 


ai d; 2 sin 1 a; 

CiX yj i — 


25 — 1 

dy |y+!| \fy*+2y 


27 u = 1 so % 


31 secs = \/y 2 + 1 


33 A,i ,k 35 -y/\/l-JI 2 3Tisec|tanf 




47 u = 4 sin 1 y 49 jt 51 — x/4 




41 41L — .1 

d* l+z 3 


CHAPTER 5 INTEGRALS 

Section 5.1 The Idea of the Integral (page 181) 

11,3,7,15,127 S-i-i-f = !-l 5 }j - f 0 = yfy 7 3x for x < 7, 7x - 4 for x > 1 

9 S2^II> A> 52\/^ 11 Lower by 2 13 Up, down; rectangle 15 \fx + Ax - y/x\ Ax; -y/5 

17 6; 18; triangle 19 18 rectangles 21 6x — |x 2 — 10; 6 — x 23 || 25 x 2 ;x 2 ; |x 3 
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Section 5.2 Antiderivatives (page 186) 

lx & + §x 6 ;| 32-y/z;2 5 |x 4 / 3 (l + 2 x / 3 ); Kl + 2 1 / 3 ) 7 — 2cosx — | cos2i; | — 2cos 1— |cos2 

9 xsinx + cosx;sinl + cosl — 1 11 lsin 2 x; |sin 2 1 13/ = C;0 15 f(b) — f(a); fy — / 2 

17 8+^ 19f(l + V5);f(3 + >/3);2 2l|;^;oo 23 /(x) = 2y/i 25 §, below -1; 1, | 

27 Increase - decrease; increase - decrease - increase 

29 Area under B — area under D\ time when B = D\ time when B — D is largest 33 T; F; F; T; F 
Section 5.3 Summation Versus Integration (page 194) 

50 100 4 7 

1§§;16 3 127; 2 n+1 - 1 5 ^2j = 2550; £(2 j- 1) = 10,000; £(-l)*+ 7* = - 

j= 1 t=l k=l 

n n - 

7 2a fc ® fc ;^sin^- 9 5.18738; 7.48547 11 2(a? + 6?) 13 2” - 1; £ - { 15 F; T 

k = 0 ]=1 ^ 

17 ^ + C-, fg — f 8 — fi + f 0 19 /i = 1; n 2 + (2n + l) = (n + l) 2 

21 a + 6 + c = 1, 2a + 46 + 8c = 5, 3a + 96 + 27c = 14; sum of squares 23 5400 = 80200; #400 = .0025 = - 

25 5 ioo,i /3 « 350,^100,1/3 « .00587; S 10 o , 3 = 25502500, 25ioo , 3 = .0201 27 t>i and v 2 have the same sign 

29 Vi = 9, v 2 = 12, EE = 21 31 At N = 1 9 2 N ~ 2 is not 1 33 0; £(t>i + • • • + v n ) 

35 Axf>(j Ax) 37 /( 1) - /(0) = / 0 X f x dx 
i= 1 

Section 5.4 Indefinite Integrals and Substitutions (page 200) 

1 |(2 + x) 3 / 2 + C 3 (x + l) n+1 /(n + 1) + C[n ^ — 1) S^+lf + C 7 -A cos *x + C 

9 — | cos 4 2x + C 11 sin -1 1 + C 13 |(l + 1 2 ) 3 / 2 - (1 + i 2 ) 1 / 2 + C 15 2 y/x + x + C 

17 sec x + C 19 — cos x + C 21§x 3 + fx 3 / 2 23 -|(l-2x) 3 / 2 25 y = v^x 

27 |x 2 29 a sin x + 6 cos x 31 £x 5 / 2 33 F; F; F; F 35 /(x — l);2/(f) 

37 x — tan -1 x 39 / £du 41 4.9t 2 + <7,* + C 2 43 /(« + 3); f(t) + 3t;3/(t); 1/(3*) 

Section 5.5 The Definite Integral (page 205) 

1 C = — /(2) 3C = /(3) 5 /(t) is wrong 7 C = 0 9 C = /(-a) - /(— 6) 

11 u = x 2 + 1;/“ u 10 ^ = 13 u = tanx jJ u du = § 

15 u = sec x;f/*udu = § (same as 13) 17 u = A,x = A, dx = =$■•, ft 12 =£*■ 

19 5 = 1(1 + l) 4 + |(1 + l) 4 ;s = |(0) + 1(J + l) 4 

21 5 = |((|) 3 + l 3 + (|) 3 + 2 3 ]; s = |[ 0 3 + (|) 3 + l 3 + (|) 3 ] 

23 S = 4 [( ig) 4 + (f) 4 + (if) 4 + 2 4 ] 25 Last rectangle minus first rectangle 

27 S = .07 since 7 intervals have points where W — 1. The integral of W(x) exists and equals zero. 

29 M is increasing so Problem 25 gives S — s = Ax(l — 0); area from graph up to y = 1 is + - = 

+ ) = area un <ier graph is 

31 f{x) = 3 + /* v(x)dx; f{x) = /* v{x )dx 33 T;F;T;F;T;F;T 
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Section 5.6 Properties of the Integral and Average Value (page 212) 

1 v = | x 4 dx = 1 equals c 4 at c = i(l) 1 / 4 8 t) = 1 cos 2 a; da; = | equals cos 2 c at c = J and ~ 

5 0 = /i 2 ^ 2 e< l ua l s afc c = \/2 7 J 3 5 v(x)dx 9 False, take t>(x) < 0 

11 True; | J^ 1 v(x)dx + § * § fi v [x)dx = | /q v(x)c!x 13 False; when v(x) = x 2 the function x 2 — - is even 
15 False; take v(x) = 1; factor | is missing 17 v = J* v(x)dx 19 0 and J 
21 v(x) = Cx 2 ; v(x) = (7. This is “constant elasticity” in economics (Section 2.2) 23 V — ► 0; V — ► 1 

25 - / q 2 ( a — x)dx = a + 1 if a > 2; | / 2 \a — x\dx = | area = — a-|-lifa<2; distance = absolute value 

27 Small interval where y = sinfl has probability ~; the average y is f* 8in ^ dB — J 

29 Area under cos 5 is 1. Rectangle O<0<^,O<y<l has area Chance of falling across a crack is ^ \ • 

31 gT) * * * * i 10-5 33 y f Q 220 cos ^j^dt = j • 220 • sin = V^ e 

35 Any v(x) = t> e ven(s) + t>odd(z); {x + l) 3 = (3x 2 + 1) 4- (re 3 + 3x); ^ 

37 16 per class; £; E(x) = && = 39 F; F; T; T 

41 'M - { %-lf « 1 2 } + Ci « 5 > - «°» - s + i- $ 

Section 5.7 The Fundamental Theorem and Its Applications (page 219) 

1 cos 2 x 3 0 5 (x 2 ) 3 ( 2 x) = 2x 7 7 v(x -hi) — u(x) 9 81 " a x — ^ J* sin 2 1 dt 

11 f* v(u)du 13 0 15 2sinx 2 17 u(x)v(x) 19 sin” 1 (sin x) cos x = x cos x 

21 F; F; F; T 23 Taking derivatives u(x) = (xcos x)' = cos x — xsin x 

25 Taking derivatives — v(— x)(— 1) = v(x) so v is even 27 F; T; T; F 
29 Si v{t)dt = J* v(t)dt - / 0 X v(t)dt = - 1+2 ( in revised printing) 

31 V = s 3 ; A = 3s 2 ; half of hollow cube; AF « 3s 2 dS; 3s 2 (which is A) 

33 dH/dr = 27r 2 r 3 35 Wedge has length r « height of triangle; /J^ 2 |r 2 d0 = 

07 r _ 1 . dfl - f"/ 4 d6 _ tanfl lW 4 _ 1 

01 r “ cos0> 2cos 3 0> Jo 2 cos 3 0 2 JO “2 

39 x = y 2 ; J 2 y 2 dy = ^-]§ = |; vertical strips have length 2 — y/x 

41 Length \/2a; width ^j|; f* ada = | 43 The differences of the sums /y = ui + u 2 H 1 -vy are fy — fy-x = vy 

45 No, /* o(i)dt = £(x) - £( 0) and /„(/“ o(t)dt)dx = /(l) - /( 0) - £(0) 

Section 5.8 Numerical Integration (page 226) 

1 §Az(t> 0 - «u) 3 1, .5625, .3025; 0, .0625, .2025 5 L s « .1427, T 8 » .2052, S 8 « .2000 

7 p = 2 : for y = x 2 , \ 0 2 + \ • (|) 2 + \ ■ l 2 ^ § 9 For y = z 2 , error |(Az) 2 from | - §, y[ = 2Az 

13 8 intervals give + £] = 1^4 < -001 15 f"(c ) is y'{c) 17 00; .683, .749, .772 - f 

19 A + B + C = 1, + <7=1, lJ5 + (7 = 1; Simpson 

21 y = 1 and z on [0, l] : L n = 1 and | - R n — 1 and | + so only + ~R n gives 1 and | 

23 T 10 « 500, 000, 000; T100 « 50,000,000;25,000jt 

25 a = 4, 6 = 2, c = 1; / 0 X (4z 2 + 2z + l)dz = ^p; Simpson fits parabola 27 c = 
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CHAPTER 6 EXPONENTIALS AND LOGARITHMS 
Section 6.1 An Overview (page 234) 


1 5;-5;-l; |; 2 5 1; - 10; 80; 1; 4; - 1 7nlog 6 s 9 f ; ^ 13 10 5 

15 0; Isf = 10 7 /o; 8.3 + log 10 4 17 4 = 7,6 = 2.5 19 4 = 4, *=1.5 

21 i;^: lo g2 23 y — 1 = cx; y — 10 = c[x — l) 25 - l)/(-A) = (10^ - l)/(-A) 

27 y" = c 2 b x \ x n = — l/cy 2 29 Logarithm 


Section 6.2 The Exponential e x 


(page 241) 


1 49e 7 * 3 Se 8x 

17 e ain x cos x + e x cos e x 


5 3* In 3 7 (|) x In 


9 


“* 3 - ( 1 -f-e*) 3 

19 .1246, .0135, .0014 are close to — 


11 2 


13 xe x 


15 


21 


l . l 


( e * +c -x)2 


23 Y(h) = 1 + ^; Y{ 1) = (1 + ^) 10 = 2.59 25 (1 + J)* < e < e x < e 3x / 2 < e 2x < 10 x < 


3 1 7 1 

27 V + V 


10’ x — v A 1 io 
2Q x + -2L + -31 

X ^ in 2 ' In 3 


o-j (2e) J | cy x 
31 ln(2e) + U 


33 V 


e~ x 

2 


35 2e*/ 2 +^2“ 37 e~ x drops faster at x = 0 (slope —1); meet at x = 1; /e~ x < e~ 9 /e~ 3 < for x > 3 

39 y — e a = e a (x — a); need —e a — —ae a or a = 1 
41 %/ = x x (In x + 1) = 0 at x min = j;y" = x*[(lnx + l) 2 + £] > 0 

43 -jz {e~ x y) — e~ x ^ — e“ x y = 0 so e~ x y — Constant or y = Ce x 


dx ’ 

ir e 3x |1 „ - 3 
45 2 JO — 


e — 1 
2 


47 


H - 


2-* 


In 2 J~1 “ In 2 ~ 2 In 2 49 e *]o° ~ 1 c JO ~ ^ v In 2 

55 f M**dx = + C; f(e u ) 2 £dx= i e 2 “ + C 57 yy' = 1 gives ±y 2 = x + C or y = >/2x + 2C 


51 e 1 " 1 "*], 1 


= e — e 


53 


K = o 


59 ^ = (n — x)x n 1 /e x <0 for x > n; F(2x) < 


constant 


0 


61 


6! 


s/\2n 


« 117; (-) 6 « 116; 7 digits 


Section 6.3 Growth and Decay in Science and Economics 


(page 250) 


1 1 2 + J/o 
13 rr^ln(i) 


3 J/0 e 




23 c = 


Tt t ** t 

15c=ta2;t= Jln(|) 19 e c = 3 so y 0 = e - 3c 1000 = ^ 

c = k Mifis); p( 10 ) = 1013 * 10c = i° 13 \/S = V(ioi3)(so) 

= t/o — at reaches yi at t = ; then t/ = Ae~ at / yi 27 F; F; T; 


21 p = 1013 e ch ; 50 = 1013 e 20c , , 20 - V1013 „ r , 

’ = (|) 3 = | 25 y = yo - at reaches yi at t = ^~ J / L ; then y 


294=|,5 = -i 31e*-l 33 1 - e _t 35 6; 6 + 4e~ 2t ;6 - 6e~ 2t , 6 + 4e _2t ; 6 

37 4;4-i;4 39 ye"*; y(t) = te* 41 4 = 1, B = -1,C = -1 43 e 0725 > .075 45s(e-l); 




47 (1.02)(1.03) — ► 5.06%; 5% by Problem 27 49 20,000 e^ 20 T H 05 ) = 34,400 (it grows for 20 — T years) 

51 a = — cy 0 e ct /(e ct - 1) = -(.Ol)(lOOO)e' 60 /(e- 60 - l) 53 y 0 = ^(1 - e - 005 < 48 )) 

55 e 4c = 1.20 so c = - n ^ 20 5 7 24e 36 5 =? 59 To — oo; constant; to + oo 

61 *£ = 60 C y ; % = 60(-y + 5); still Yoo = 5 

63 y = 60e c< + 20, 60 = 60e 12c + 20, c = ^ ln(fg); 100 = 60e ct + 20 at t = ^ ln(|§) 65 0 
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Section 6.4 Logarithms (page 258) 

1 X 3 SflEfe* 51n * 7 ^f =cota: 9 l 11 3 lnt + C 13 In | 

15 | In 5 17 — ln(ln2) 19 ln(sin x) + C 21 ln(cos 3a:) + C 23 ±(ln a:) 3 + C 

27 1ny=iln( 2 2 +l);^ = ^ r 29 g = e «‘“ « cos x 

31 dt. = 33 lny = eMnx; ^ = ye^lnxd- ±) 35 In y = -1 so y = ^ = 0 37 0 

39-1 41 sec x 47 .1; .095; .095310179 49 -.01; -.01005; -.010050335 

51 l’Hopital: 1 53 y-L 55 3 — 2 In 2 57 Rectangular area | + ' ' ' + k < fi ir = ^ nrl 

59 Maximum at e 61 0 63 log 10 e or 65 1 — x; 1 + xln2 

67 fraction is y = 1 when ln(T + 2) — In 2 = 1 or T = 2e — 2 69 y' = y = 1 ~ never equals 1 

71 lnp= xln2;LD 2 a: ln2;ED p = e xln2 ,p' = ln2 e xln2 

75 2 4 = 4 2 ;ylnx = xlny — + decreases after x = e, and the only integers before e are 1 and 2. 


Section 6.5 Separable Equations Including the Logistic Equation (page 266) 


17e‘-5 3 (|x 2 + l) 1 / 3 5 x 

15 z = 1 + e -t , y is in 13 17 ct = In 3, ct = In 9 


a l — C 08 t 


9 (f + v/5o) 2 


11 yoo =0;t = £ 


19 6 = 10 9 , c = 13 • 10 3 ; = 13 • 10 6 ; at y = ^ (10) gives In - = ct + In — so t = 1900 + = 2091 


I r = £ 

c> 7 6 


21 y 2 dips down and up (a valley) 23 sc — 1 = s6r so s 
25 y = ’ T = 0 27 Dividing cy by y + if > 1 slows down y' 

. > o -£S - 

7 ^ U > y+JC 


on di? _ 

dy (y+tf) 


31 % = 


dT 


y+ 1 


; multiply e y ! K = e ^/^^/^(Ste.) by if and take the Ath power to reach (19) 


33 y 7 = (3 - y) 2 ; 3 ^j=t+|;y = 2att = 


35 Ae* H- D — Ae t + £? + D£ + £ — i > D — — 1,B — — 1; yo = A + H gives A = 1 

37 y — ► 1 from yo > 0, y — ► — oo from yo < 0; y — * 1 from yo > 0, y — ► — 1 from yo < 0 

39 f = / dt — ► ln(sin y) = £ + C = £ + In Then sin y = stops at 1 when £ = In 2 


Section 6.6 Powers Instead of Exponentials (page 276) 

ll-x+^-^-b - 31 ±x+^±4 + “* 5 1050.62; 1050.95; 1051.25 

7 1 + n(“0 + ( ~~ ) 2 —►1—1+1 9 square of (1 + £) n ; set N — 2n 

11 Increases; ln(l + ~) — >0 13 y(3) = 8 15 y (t) = 4(3 t ) 17 y(£) = £ 

19 y(t) = |(3 t - 1) 21 s(^ff) if l;s£ if a= 1 23 y 0 = 6 25 y 0 = 3 

27 -2, -10, -26 — -oo;-5,-^,-^ -> -12 29 P = ^ 31 10.38% 33 lOO(l.l) 20 = $673 

35 = 965 37 ^P(l.l 20 - 1) = 57,275 39 Voo = 1500 41 2;(||) 52 = 2.69;e 

43 1.0142 12 = 1.184 -> Visa charges 18.4% 

Section 6.7 Hyperbolic Functions (page 280) 

1 e x , e ~ x , - — | sinh 2x 7 sinh nx 9 3 sinh(3x + 1) 11 = — x sec ^ x 

13 4 cosh xsinh x 15 -j =~=^ (sech\/ x 2 + l) 2 17 6 sinh 6 x cosh x 

19 cosh(lnx) = |(^+ ^) = 1 at x = 1 21 i3> “ if’ 23 0, 0, 1, oo, oo 

25 lsinh(2x+l) 27 | cosh 3 x 29 ln(l + cosh x) 31 e x 
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33 J y dx = /sinh t( sinh t dt)\A = -sinh tcosht — f y dx\A! = A = 0 at t = 0 so A = 

41 e y = re + Vs 2 + 1, y — ln[x + V x 2 + l] 47 |ln||^| 49 sinh” 1 x (see 41) 51 — sech^x 

53 | In 3; oo 55 y(rc) = I cosh err; - cosh cL — ~ 

57 y 7 ' = y — 3V 2 ; §(y') 2 = \y 2 ~ y 3 is satisfied by y = §sech 2 § 

CHAPTER 7 TECHNIQUES OF INTEGRATION 
Section 7.1 Integration by Parts (page 287) 


1 — x cos x + sin x H- C 3 - xe~ x — e~ x + C 5 x 2 sin x + 2rr cos rr — 2 sin x + C 
7 |(2x -j- 1) ln(2rr 4- 1) — x + C 9 Ie*(sinx — cos x) -j- C 11 a i ^ b 7 (a sin bx — b cos fcx) + C 
13 |(sin(lnx) — cos(lnx)) + C 15 x(ln x) 2 — 2xlnx + 2x -J -C 17 xsin"* 1 i + Vl-x 2 + C 

19 |(x 2 + 1) tan -1 x — | H- (7 21 x 3 sin x -b 3x 2 cosx — 6xsin x — 6 cos x + (7 

23 e x (x 3 — 3x 2 + 6x — 6) + C 25 x tan x + ln(cos x) + C 27—1 29— |e“ 2 -|-^ 31—2 

33 3 In 10 — 6 + 2 tan -1 3 35 u = x n , v = e x 37 u = x n , u = sin x 39 u = (lnx) n , v = x 

41 u = x sin x, v = e x — ► / e 35 sin x dx in 9 and — J x cos x e*dx. Then u = — x cos x, i> = e* — ► / cos x dx 

in 10 and — / x sin x e x dx (move to left side) : y (x sin x — x cos x + cos x) . Also try u = xe* , v = — cos x. 

43 j* |usin u du — |(sin u — ucos u) = |(sin x 2 — x 2 cos x 2 ); odd 

45 3- step function; 3e** step function 49 0;x6(x)] — / 8(x)dx — — 1; v(x)<?(x)] - J v(x)6(x)dx 
51 «(*) = /* f{x)dx 

53 u(z) = ± /* v(x)dx; £(§ - f for x < §, ±(2x - x 2 - \) for * > §; § for x < §, ^ for x > 

55 u = x 2 , v = — cos x — ► —x 2 cos x + (2x) sin x — / 2 sin x dx 57 Compare 23 

59 uu/]o — Jq u'w* — u'w]q H- fo u'w' = \uw' — u'w]q 

61 No mistake: e* cosh x — e x sin/ix = 1 is part of the constant C 


Section 7.2 Trigonometric Integrals (page 293) 


1 J(1 — cos 2 x) sin x dx — — cos x + | cos 3 x + C 
5 /(I — u 2 ) 2 u 2 (— du) = — | cos 3 x 4- | cos 5 x — j cos 7 x -f C 
9 | / sin 3 2x dx = — (— cos 2x + | cos 3 2x) + C 


3 | sin 2 x + C 


11 2 


7 | (sin x) 3 / 2 + C 
13 I(^ + M) + C 


15 x + C 


17 - cos 5 xsinx + § / cos 4 x dx; use equation (5) 

- nzi r*/ 2 .^n-2 f */ 2 

»0 

sin n_1 x cos x + (n — 1) J sin 

n-l , /„ r _:_n-2 


19 /o" 72 

21 / = — sin” 1 zcosx + (n — 1) / sin” -2 xcos 2 x dx = 


cos” X dx = V- Jo cos " 2xdx = -= !k ^ 1 ^=2 -2 Io'‘ dx 


n — 2 


x dx — (n — 1)7. 


So nl = — sin n 1 x cos x + (n - 1) / sin n ^ x dx. 


23 0,+, 0,0,0, — 25 — | cos 3 x, 0 27 


31 — | cos x, 0 33 JJ 7 x sin x dx = A sin 2 x dx — ► A = 2 

37 p is even 39 p — q is even 41 sec x + C 


. cos 200s \ oq 1 ( sin 2Q0g . sin 2x 

• 2 l 200 ' 


200 


).o 


35 Sum = zero = ~ (left + right) 


47 | tan 3 x — tan x + x -f C 


43 | tan 3 x + C 


45 ^ sec 3 x + C 


55 4\/2 


57 

n/3 


49 In | sin x\ + C 51 ~ + C 53 A = \/2, -\/^sin(x -j- J) 


59 1 — cos s + sin x i /nr 
1 + cos x+sin x ' 


2 cos 3 x 

61 p and q are 10 and 1 
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Section 7.3 Trigonometric Substitutions (page 299) 


1 x = 2 sin 0; f dO = sin 1 § + C 3 x = 2 sin 0; J 4 cos 2 0 dO — 2 sin 1 | + xy^l - y + C 


5 x — sin 0; / sin 2 0 dO = f; sm 


_ l , 
2 ' 


_1 x - ~x\/l- x 2 4- (7 


7 x = tan 0; / cos 2 0 d0 = ~ tan -1 a: + 1 1H f x2 -f <7 
9 i = 5 sec 0; J 5(sec 2 0 — l)d0 = y/x 2 — 25 — 5 sec -1 | + (7 
11 i = sec 0; / cos 0 d0 = 4- <7 13 x = tan 0; / cos 0 dO = 4- (7 

CQ8 0 <jfl I r* — x y 

9 sin 2 0 


15 * = 3 sec «; / = 53k + <? = + C 


§\/x' 1 -9 


17 X = sec 0; f sec 3 0 d0 = | sec 0 tan 0 + | ln(sec 0 4- tan 0) 4- (7 = |x\/z 2 - 1 4- § ln(x + y/x 2 — 1) 4- (7 
19 x =Une;f^f = -^ + C =^f±i + C 


21 f d ° = ~0 4- <7 = — cos 1 x H- (7; with C — | this is sin 


-l 


tan 0 sec a 0 d$ 


sec 2 0 


= — ln(cos 0) 4- (7 = In Vx^~TT 4- C which is | ln(x 2 4- l) 4- C 


27 sin 


-i 


~il — ZL _ ZL 
X J.5 “ o * 


23/ 

25 x = a sin 0; /_^ 2 a2 cos2 ^ d0 = y 1 = area °f semicircle 
29 Like Example 6: x = sin 0 with 0 = f when x = oo, 0 == ^ when x = 2, /J^ 2 y = — 1 4- 

o i q i„ n f*/ 2 3 sec 2 0 d0 0 l 77 / 2 tt q o f J n ' +1 +x n, ~ 1 j„ f n-lj_ x n 

31 x — 3 tan0, J_ n j 2 gse^e ” 3 1-*/ 2 ~ 3 ^3 J s 2 + i dx — J x dx — n 

35 X = sec 0; 4- e _/ ) = |(x 4- y/x 2 -l 4- a . + y a . 2 _ 1 ) = §(® 4- >A 2 — 1 4- x - y/x 2 - 1) = x 

37 x = cosh 0; / d0 = cosh -1 x4*C 

39 x = cosh 0; f sinh 2 0 d0 = |(sinh0 cosh 0 — 0) 4- C = |x\/x 2 — 1 — | ln(x 4- y/x 2 — 1) 4- C 

41 x = tanh 0; / d0 = tanh -1 x-\- C 43 (x — 2) 2 4-4 45 (x — 3) 2 — 9 47 (x 4- l) 2 

49 u = x-2,/y 


2 

>/3 


du 


tan 


- 1 | = |tan-i(^ 2 )+C;« = x-3 ) /^ = Iln^f = iln^ + <7; 


i+r + ci 


2 +4 2 

u = x4-1,/^t = — 

> J u J u 

51 u = x 4- 6; / u 2_^>_|_ c uses u = a sec 0 if b 2 > c, u = a tan 0 if b 2 < c, equals — ~ if 6 2 = c 

53 cos0 is negative (—y/l — x 2 ) from - to y ; then f* — f ± 1 4- /° x y/l — x 2 dx = ir = area of unit circle 
55 Divide y by 4, multiply dx by 4, same / y dx 

57 No sin -1 x for x > l; the square root is imaginary. All correct with complex numbers. 


Section 7.4 Partial Fractions (page 304) 


1 A = —l, B — l, — lnx 4- ln(x - 1) 4- C 3 


2x 


7 ! + i 


15 


9 3 


11 


1 -u 1 

4_ — 


2 , 1Z 2 

x+l ' x+2 


x — 1 


lO-i/e + l/l _I/2 ,1/6 
Ad X • x-1 x — 2 • X — 3 


X-l-1 


T + A + ^f;^ = -i,5=i,C'=0,i) = 


17 Coefficients of y : 0 = —Ab 4- B\ match constants 1 = Ac ; A = B = ^ 

19 A = 1, then B = 2 and C = 1;/ ^ = 

ln(x — 1) 4- ln(x 2 4- x 4- 1) = ln(x — l)(x 2 4- x 4- 1) = ln(x 3 — 1) 

21u = e*;/^=/^-/f = ln(^)+C = ln(^i)+C 

23 « = cos 0; / ^ = -|/4^-|/^I = | ln ( 1 - w )-| M 1 + «) = \ ln + C - We can reach 

2 ln = ln = ln ( csc 9 - cot °) ° r a different way | ln = ln IT^e = “ ln 4^ = 

— In (esc 0 4- cot 0) 

25 u = e x ; du = e*dx - u ds; / + f ihk = _ 2 l n (l - e x ) + ln e* + C = -2 ln(l - e*) + x + C 
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27 x + 1 = u 2 , dx = 2u du ; / = /[2 ~ T+^]^ u = 2u — 2 ln(l + u) + C = 

2 y/x + 1 — 2 ln(l + \/a: + 1) + C 

29 Note Q(a) = 0. Then by definition of derivative. At a double root Q'{a) = 0. 


Section 7.5 Improper Integrals (page 309) 


13 = ^1 Z -2(1 -xyl*\l = 2 5 tan- 1 11 «]»,/*= f 7 |(lnx) 2 ]J = -oo 

9 xlnx — x]g = 0 11 ln(ln(lnx))]Jo 0 = oo 13 |(z + sinxcosx)]§° = oo 

15 ^r^]o° diverges for every p\ 17 Less than “f = 5 
19 Less than + f” ^ = tan" 1 *]J - ^]f = 5 + 2 

21 Less than e~ x dx = greater than — £ 

23 Less than f* e 2 dx + e f^° e~^ x ~^ 7 dx = e 2 + e e - “ du = e 2 + ^ 

25 /„ + J“ smiapte legs than 1 + /” d§ = 2 27 p! = p times (p - 1)!; 1 = 1 times 0! 

29 u = x,dv = xe~ xi dx : -x^-]? + / 0 °° = \y/ir 31 / 0 °° 1000e-“dt = -lO.OOOe- 1 *]^ = $10,000 

33 W = =^\% = S ^ = 2 mv o if «*> = 

ok f°° dx _ fOO —x In 2 j_ _ £^iloo __ 

35 Jo 2* ” Jo C aX ~~ -In 2 JO — In 2 

37 / 0 ^ 2 (sec x — tan x)dx = [ln(sec x + tan x) + ln(cos x)]q ^ 2 = [ln(l + sin x)]£^ 2 = In 2. 

The areas under sec x and tan x separately are infinite 39 Only p = 0 


CHAPTER 8 APPLICATIONS OF THE INTEGRAL 
Section 8.1 Areas and Volumes by Slices (page 318) 

1 x 2 - 3 = 1 gives x = ±2; / 2 2 [(1 — (x 2 - 3)]dz = 

3 y 2 — x — 9 gives y = ±3; /f 3 [9 - y 2 ]dy = 36 

5 x 4 — 2x 2 = 2x 2 gives x = ±2 (or x = 0); / 2 2 [ 2x 2 — (x 4 — 2z 2 )]cfx = 

7 y = x 2 = —x 2 + 18x gives x = 0, 9; [(— x 2 + 18x) — x 2 ]dx = 243 

9 y = cos x = cos 2 x when cos x = 1 or 0, x = 0 or | or • • • (cos x — cos 2 x)dx = 1 — J 

11 e* = e 2 *” 1 gives x = 1; /^[e* - e 2 *'" 1 ]^ = (e - 1) - ( e ~| — ) 

13 Intersections (0, 0), (1, 3), (2, 2); J^x — x]dx + / 2 [4 - x — x\dx = 2 
15 Inside, since 1 — x 2 < V^l ” x 2 ; /^[x/l — x 2 - (1 — z 2 )]dx = f ~ | 

17 7 = /* a ny 2 dx = /® a 7 t 6 2 (1 - fr)dx = around y axis F = 47r g* 6 ; rotating 

x = 2, y = 0 around y axis gives a circle not in the first football 
19 V = Jq 27rxsin x dx — 2?r 2 21 7t(8 — x) 2 dx = f* 2ttx( 8 — x)cix = (same cone tipped over) 

23 /o 7T • l 2 dx - /q 1 7r(x 4 ) 2 dx = £ 2 tt( 1 - z 4 )z dx = ^ 

25 /i% ^(3 2 )dx - / 2 /3 7r(^) 2 dx = / 2 /3 2 ttx( 3 - £)dz = 

27 /q tt[(x 2 / 3 ) 2 — (x 3 / 2 ) 2 ]dx = |f; 2? rx(x 2 / 3 - x 3 / 2 )dx = |f (notice xy symmetry) 

29 x 2 = i? 2 -y 2 ,^ = 7r(i2 2 - y 2 )dy = tt(J2/i 2 - ^ ) 

31 f“ a [2y/a 2 — x 2 ) 2 dx — ^a 3 33 / Q 1 (2- v /l - y) 2 dy = 2 37 / ;4(x)dx or in this case / a[y)dy 

39 Ellipse; Vl — x 2 tan 0; |(l — x 2 ) tan S\ | tan 0 

41 Half of 7rr 2 A; rectangles 43 / * 7r(5 2 — 2 2 )dx = 42 n 45 J * 7r(4 2 — l 2 )dx = 30?r 
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47 Jo ° *(( 4 “ J/) 2 “ o 2 )dy = f (6 3 - 3a 2 6 + 2o 3 ) 49 / Q 2 x(3 - x) 2 dx; /„ 2xy(2)dy + /* 2xy(3 - y)dy 

53 960 x cm 55 | 57 f 

59 2x 61 J* 2xy(2 — y/y)dy = 63 3xe 65 Height 1; / 0 ° 2xx dx = xa 2 ; cylinder 

67 Length of hole is 2y/b 2 — a 2 = 2, so 6 2 — a 2 = 1 and volume is ^ 69 F; T(?) ; F; T 

Section 8.2 Length of a Plane Curve (page 324) 

1 £ y fi + (f S 1 / 2 ) 2 ^ = £[(¥) 3/2 - 1] = 3 Vl + * 2 (* 2 + 2)dx = £( 1 + x 2 )dx = f 

5 /l 3 \Z 1 + (i- 4^) 2diE = /l 3 + 4^)^= ¥ 

7 /* ^/l + (x 1 / 2 - ix- 1 /2)2 cix = J^(*l/3 + I I - 1 / 2 )dx = f 

9 /J^ 2 \/9 cos 4 £ sin 2 £ + 9 sin*T cos 2 £ d£ = f^ 2 3 cos £ sin £ dt = | 

11 J^ 2 yj sin 2 £ + (1 — cos t) 2 dt = J^ 2 y/2 — 2 cost dt = /J^ 2 2 sin | dt = 4 — 2\/2 

13 /o \/£ 2 + 2£-H dt = /o (£ + l)d£ = § 15 J" \/l -f cos 2 x dx = 3.820 17 J* ^1+ £dx = 2.003 

19 Graphs are flat toward (1,0) then steep up to (1,1); limiting length is 2 
21 % = V36sin 2 3£ + 36cos 2 3£ = 6 23 J Q l x/^6 dy = \/26 

25 /i x yj\{* -*-*)* + 1 dy = /j, |(e» + e'»)dy = |(e» - = e - 

Using x = cosh y this is / \A + sinh 2 y dy = / cosh y dy = sinh y]Li = 2 sinh 1 
27 Ellipse; two y’s for the same x 2 9 Carpet length 2 ^ str aight distance \/2 
31 (ds) 2 = (dx) 2 + (dy) 2 + ( dx) 2 ; ds = ^(f ) 2 + ($ ) 2 + (£) 2 dt; 

da = Vsin 2 t + cos 2 1 + ldt = \/2dt; 2x\/2; curve = helix, shadow = circle 
33 L = Vl + 4x 2 dx; / 2 vT+x^dx = \/l + 4u 2 2du = 2L; stretch xy plane by 2 (y = x 2 becomes | = (| 

Section 8.3 Area of a Surface of Revolution (page 327) 

1 J® 2VV^M^dx = J® 2x v ^TIdx = *§* 3 2 fo 2x(7x)\/50dx = 14x\/50 

5 /2 t 2xV4^ 2 v /l+ j^ydx = f\ 4 xdx = 8x 7 / 2 2x1^1 + (2x) 2 dx = f (1 + 4x 2 ) 3 / 2 ] 2 = f [17 3 / 2 - l] 
9 / Q 3 2xx\/2dx = 9x\/2 11 Figure shows radius s times angle 5 = arc 2xi? 

13 2xrAs = x(R + i2')(s — s') = xiZs — xiZ's' because R's — J2s' = 0 

15 Radius a, center at (0, 6); (^f ) 2 + (^) 2 = a 2 , surface area / 2,r 2x(6 + a sin t) a dt = 4x 2 a6 

17 / 2 2xx-^ 1 + ij^jr dx = / 2 -j^=j = x 2 + 2x (write 2x - x 2 = 1 - (x - l) 2 and set x - 1 = sin 0) 

19 A/2 to^/HJdx ( can be done) 

21 Surface area = ff° 2x^^/l + ^dx > /“ = 2xlnx]J° = oo but volume = /“ x(^) 2 dx = x 

23 2x sin tV 2 sin 2 1 + cos 2 t dt = J* 2xsin ty/ 2 — cos 2 1 dt = 2xV2 — u 2 du = 

xu\/2 — u 2 + 2xsin -1 ^]ii = 2x + x 2 

Section 8.4 Probability and Calculus (page 334) 

1 P(X < 4) = l,P(X = 4) = -fe, P(X > 4) = 3 / 0 °° p(x)dx is not 1; p(x) is negative for large x 

5 JT e "*dx = p-; /j 101 c _a! dx « (.01) j 7 p(x) = F(x) = * for 0 < x < x (F = 1 for x > x) 



Answers to Odd-Numbered Problems 


A-15 


9 1.1+ 1.2+ -.+ 1.7 = 4 11 /~^^ T = lln(l+x 2 )]^=+oo 

13 /“ axe—dx = ]g° + / 0 °° = J 

15 So rr ( i+ a 3 ) = f tan_1 *5 Jo e_a!<ia: = 1 - e_x ; Jo* ae- a *da: = 1 - e- ax 17 /“ ie-*/ 10 da: = -e"*/ 10 ]^ = 

19 Exponential better than Poisson: 60 years — ► f Q . 01e~ ,olx dx = 1 — e - ’ 6 = .45 

21 y — three areas « | each because /x — cr to /x is the same as /x to /x + a and areas add to 1 

23 — 2/x / sp(s)ds + /x 2 / p(s)ds = — 2/x • /x + /x 2 • 1 = — /x 2 


l _ 


25 #* = 0 - 1 + 1. 1 + 2 

Also £ « 2 Pn - M 2 = 

27 P = Jo” = 2; ! - Jo 


1; a2 = (0 — l) 2 ' 3 + (l “ l) 2 ' 3 + (2 — l) 2 • 3 = 


0-I + 1-1 + 4. 1-1=1 


dx 


= 1 + <■ 


/5 ]$ - 


-2 


29 Standard deviation (yes - no poll) < = 2 ^* QQ = — Poll showed = || peaceful. 


95% confidence interval is from ~ 


— to “ + — , or 93% to 100% peaceful. 


60 30 1 60 » 

31 95% confidence of unfair if more than QQ = 2% away from 50% heads. 

2% of 2500 = 50. So unfair if more than 1300 or less than 1200. 


33 55 is 1.5cr below the mean, and the area up to /x — 1.5a is about 8% so 24 students fail. 

A grade of 57 is 1.3cr below the mean and the area up to /x — 1.3a is about 10%. 

35 .999; .999 1000 = (1 - j^) 1000 « k e because (1 - ±) n -> 


l 

c 


Section 8.5 Masses and Moments (page 340) 

1 x = y 3 x = | 5 x = 7 x = | = y 9 x = = y 11 x — — y 13 x — , y — 

15 x = ^ = y 21 I = J x 2 p dx — 2t J xp dx + t 2 f p dx\ — —2 f xp dx + 2t f p dx = 0 for t = s 

23 South Dakota 25 2n 2 a 2 b 27 M x = 0, M y = j 29 ^ 31 Moment 

33 J = £m n r 2 ; 2 E m n r n w n;° 35 14*i£; 14xl£; % 

37 solid ball, solid cylinder, hallow ball, hollow cylinder 39 No 
41 T « VT+J by Problem 40 so T « \/L4, \/L5, y/sfi, V* 


Section 8.6 Force, Work, and Energy (page 346) 

1 2.4 ft lb; 2.424 ... ft lb 3 24000 lb/ft; 83§ ft lb 5 10s ft lb; 10s ft lb 7 25000 ft lb; 20000 ft lb 
9 864,000 Nkm 11 5.6 • 10 7 Nkm 13 k = 10 lb/ft; W = 25 ft lb 15 f eOwh dh = 48000iu, 12000w 
17 ^wAH 2 ; ^wAH 2 19 9600tu 21 (1 — ^-)” 3 / 2 23 (800) (9800) kg 25 ± force 


CHAPTER 9 POLAR COORDINATES AND COMPLEX NUMBERS 
Section 9.1 Polar Coordinates (page 350) 


1 (1 , f ) 3 (2, f) 5 ( ^2, te) 7 (0,2) 9 (^10, y/l0) 11 (y/3 } -l) 13 2y/2 

15 y/r 2 + R 2 — 2r#cos(0 — <f>) 

17 0<r<co,— | < 0 < |; 0 < r < oo, ?r < 0 < 27r; y/i < r < y/5 , 0 < 0 < 2tt; 0 < r < oo, — ^ < 0 < J 
19 y = stan 0, r = ssec 0 21 0 = J, all r; r = ; r = cos0 + sin0 


23 x 2 -by 2 — y 


25 x — r sin 0 cos 0, y == r sin 2 0, s 2 + y 2 = y 




sin fl 


cos 0+sin 6 


31 (x 2 + y 2 ) 3 = x 4 
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Section 9.2 Polar Equations and Graphs (page 355) 

1 Line y = 1 S Circle x 2 + y 2 — 2x 5 Ellipse 3a; 2 + 4y 2 = 1 — 2x 7 x, y, r symmetries 
9 x symmetry only 11 No symmetry 13 x, y, r symmetries! 

15 x 2 + y 2 = 6y+8x — (x - 4) 2 + (y - 3) 2 = 5 2 , center (4,3) 17 (2,0), (0,0) 

19r=l-+,0=2f;r = l++,0 = ^;(O,O) 21 r = 2 , 6 = ±J, ±ff , ±f§, 

23 (x, y) = (1, 1) 25 r = cos 5 0 has 5 petals 27 (x 2 + y 2 — x) 2 = x 2 + y 2 

29 (x 2 + y 2 ) 3 = (x 2 - y 2 ) 2 31 cos0 = -§,sin0 = ^ — y = ^f,x = -§ 33x=f,r=-f 35.967 

Section 9.3 Slope, Length, and Area for Polar Curves (page 359) 

1 Area ^ 3 Area 5 Area - 7 Area | \ 9 f "^ 3 ( § cos 2 0 — ^ 

11 Area 8?r 13 Only allow r 2 > 0, then 4 f^ 4 | cos 20 dO = 1 15 2 + j 

17 6 = 0; left points r — 0 = x = — y = 

19 2~]6 4 = 40,000; ^[r/r 2 + c 2 H- c 2 ln(r + \/r 2 -j- c 2 )]^ 4 = 40,000.001 

21 tan tp — tan 0 23 x = 0, y = 1 is on limacon but not circle 25 ~ ln(27r + VT-Mtt 2 ) + ttn/T+Att 2 

27 ^ 29 \ (base) (height) « §(rA0)r 31 ±*\/2 33 2 tt(2 - a/ 2) 35 39 sec 0 

Section 9.4 Complex Numbers (page 364) 

1 Sum = 4, product = 5 5 Angles 7 Real axis; imaginary axis; | axis x > 0; unit circle 

9 cd = 5 4- lOi, § = 11 2 cos 0 } 1; —1, 1 13 Sum = 0, product = -1 15 r 4 e 4ie , \e~ iB , ± e~ 4ie 

17 Evenly spaced on circle around origin 19 e xt , e~ xt 21 e t ,c“ t , e° 23 cos7£,sin7f 
29 t— — y = —e n !^ 31 F; T; at most 2; Re c < 0 33 a: = ^ cos0, y = sin 0\ ±-^e~ xB l 2 

CHAPTER 10 INFINITE SERIES 

Section 10.1 The Geometric Series (page 373) 

1 Subtraction leaves G — xG — 1 or G = 3 |; |; 3^ 521 + 3- 2x + 4- 3x 2 + • • • = 

7 .142857 repeats because the next step divides 7 into 1 again 

9 If q (prime, not 2 or 5) divides 10 N — 10^ then it divides 10^" M — 1 11 This decimal does not repeat 

13§f;|§f 15^ 17 3 +t 19 ife 21 ^ 23 tan -1 (tan x) = x 

25 (1 + x + x 2 + x 3 • • -)(1 - x + x 2 — x 3 • • •) = 1 + x 2 + x 4 + • • • 

27 2(.1234...)is2 = “; 1 - .0123. .. is 1 - = §? 29 f+r = 1 

31 — ln(l — .1) = — In .9 33 § In ^ 35(n+l)! 37 y = 

39 All products like ai&2 are missed; (1 + 1) (1 + 1) ^ 1 + 1 41 Take x = | in (13): In 3 — 1.0986 

43 In 3 seconds the ball goes 78 feet 45 tan z — |; (18) is slower with x = | 
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Section 10.2 Convergence Tests: Positive Series (page 380) 

1 | J H is smaller than 1 + ^-1 

3 a n = s n - S n - 1 = , 3 = 1; a n = 4, 3 = oo; a n = In ^ - In = ln^,a = ln2 

5 No decision on^^n 7 Diverges: ^(1 + § H ) 9 E f ooVn 2 conver S es: Z) ^ is larger 

11 Converges: E ^ is larger 13 Diverges: £ — is smaller 15 Diverges: £ ^ is smaller 
IT Converges: E is larger 19 Converges: E 3 ^ is larger 21 £ = 0 23 £ = 0 25 £ = | 

27 root t 2 ^)” — ► £ = - 29 3 = 1 (only survivor) 31 If y decreases, E 2 y(0 — fi y (*)<** ^ Ei* 2/( l ) 

33 Er e_n ^ /°°° e” x da; = 1; J + H = 35 Converges faster than / 

37 Diverges because / 0 °° = ^ln(z 2 + l)]o° =00 39 Diverges because x e ~ n dx = = 00 

41 Converges (geometric) because f^°(^) x dx <00 43 (b) /^ +1 — > (base 1) (height jj^y) 

45 After adding we have 1+ | + 1- ^ (close to In 2 n); thus originally close to In 2n — In n = In = In 2 

47 fioo° f? = loo “ 1^0 ~ * 0 °9 49 Comparison test: sin a n < a n ; if a n = nn then sin a n = 0 but E a » = 00 

51 a n = ri~ b ! 2 53 a n = ^ 55 Ratios are 1, §, 1, §, • • • (no limit £); (yxr) 1/2fc = yes 

57 Root test jA- -+ £ = 0 59 Root test £ = A 61 Divergence: AT terms add to In — ► 00 

63 Diverge (compare E n ) 65 Root test £ = | 67 Beyond some point < 1 or a n < b n 

Section 10.3 Convergence Tests: All Series (page 384) 

1 Terms don’t approach zero 3 Absolutely 5 Conditionally not absolutely 7 No convergence 

9 Absolutely 11 No convergence 13 By comparison with E |°n| 

15 Even sums | ^ H diverge; a^’s are not decreasing 17 (b) If a n > 0 then 3 n is too large so 3 — s n < 0 

19 $ = l — A below by less than ^ 

21 Subtract 2(yj + y* H ) = |(^ + ) = i 2 from P os i^ ve series to get alternating series 

23 Text proves: If E \ a n\ converges so does E °n 

25 New series = (§) “ 4 + (§) ~ s * * * = 2 ( 1 ” 2 + 3 “ 4 ’) 27 I ln 2 ; a4d ln 2 series to 2 ( ln 2 seri es) 

29 Terms alternate and decrease to zero; partial sums are 1+ | + b - — Inn —► 7 

31 .5403? 33 Hint + comparison test 35 Partial sums a n — 00 ; sum — ao if a n — ► 0 

37 x = 3 but product is not 1 + f + • * • 

2 1 3 0 

39 Write x to base 2, as in 1.0010 which keeps 1 + | and deletes §> 4 > ' ' ' 

41 § + 27 + - • adds to = | and can’t cancel | 

43 = cot | (trig identity) = tan (f ~ §)> 3 = S = ~ 1°8(1 — c ‘) by 10a in Section 10.1; 

take imaginary part 


Section 10.4 The Taylor Series for e I ,sin x, and cos a; (page 390) 


1 1 + 2x + H ; derivatives 2"; 1 + 2 + |^ H 3 Derivatives 1 + ix H 

5 Derivatives 2”n!; 1 + 2x + 4a; 2 + • • • 7 Derivatives — (n — 1)!; —x — ^ ^ — ••• 

9y = 2 — e* = l — x— |j- lly = x- ^-+ - = sin x 13 y= xe x = x + x 2 + §7 + • • • 


15 1 + 2x + x 2 ;4 + 4(x - 1) + (x - l) 2 17 -(x - l) 5 

21 (x - 1) - (z ~ 1}3 -1- = ln(l + (x - 1)) 23 e 

25 x + 2x z + 2x 3 27 § - ^ 29 x - ^ 
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35 oo slope; 1 + |(x -1) 37 s - 39 z + + ^ 41 1 + x + ^ 43 1 + Ox - x 2 

45 cos 6 = g ~*~ 2 e ~ , sin 0 = g — 47 99th powers —1, — i f e 3 ™/ 4 , —i 

49 e“ f7r / 3 and —1; sum zero, product -1 53 if, if + 27 ti 55 2e x 


Section 10.5 Power Series (page 395) 


11 + 4®+ (4x) 2 H \r — \\x = \ 3e(l~x+|^ );r = oo 

5 lne + ln(l + -) = 1+ f — |(|) 2 +-;r = e;x = — e 
7)^1 < lor (-1,3); ^ 9 \x - a\ < 1; - ln(l - (x - a)) 

11 1 + Y\ + It “I » add to 1 at x = 0 13 a 1} a 3 , • are all zero 15 ^ 


17 /^ 8 )(c) = cos c < 1; alternating terms might not decrease (as required) 

i^/ = ^ t^;(i- c ) 4 = !- 1 

21 /( n+1 )(z) = (!_“)*+! , I Jin | < dV- ^mT) 


0 when x = ~ and 1 


c> I 


1 

2 


23 R 2 = f(x) - f{a) - f'(a)(x -a) - §/"(a)(x - a) 2 so R 2 = R^ = R” = 0 at x = = / #,# ; 

Generalized Mean Value Theorem in 3.8 gives a<c<c 2 <c\<x 
25 1 + \x 2 + f(x 2 ) 2 27 (~l) n ; (-l) n (n + 1) 

29 (a) one friend k times, the other n — k times, 0 < k < n; 21 33 (16 — l) 1 / 4 « 1.968 

35 (1 + .l) 1,1 = 1(1. 1)(.1)+ » 1.1105 371+ ^ + ^;r= f 41 x + x 2 + §z 3 + |x 4 

43x 2 -§x 4 + ^x 6 45 l+| + f + ff 47.2727 49-|-§ = -§ 5lr = l,r=f-l 


CHAPTER 11 VECTORS AND MATRICES 
Section 11.1 Vectors and Dot Products (page 405) 

1 (0, 0, 0); (5, 5, 5); 3; —3; cos 0 = — 1 3 2i — j — k; — i — 7j + 8k; 6; —3; cos 6 = — \ 

5 (u 2 , — Vi); (t> 2 , — t/i, 0), (t> 3 , 0, — iq) 7 (0,0);(0,0,0) 9 Cosine of 6 ; projection of w on v 

11 F;T;F 13 Zero; sum = 10 o’clock vector; sum — 8 o’clock vector times 1+ 2 v/3 
15 45° 17 Circle x 2 + y 2 = 4; (x — l) 2 + y 2 = 4; vertical line x = 2; half-line x > 0 

19 v= — 3i + 2j,w = 2i— j;i = 4v— w 21 d = -6; C = i - 2j + k 

23 cos 0 = ^=; cos 0 = cos 0 = ^ 25A(A + B) = 1 + AB=1 + B- A = B*(A + B); equilateral, 60° 

27 a = A • 1, 6 = A * J 29 (cos £, sin £) and (— sin £, cos £); (cos 2£, sin 2£) and (—2 sin 2£, 2 cos 2t) 

3lC = A + B,D = A- B;C D=A A + B A-A B-B B = r 2 -r 2 =0 
33 U + V — W = (2, 5, 8), U -V + W = (0, -1, -2), -U + V + W = (4, 3, 6) 

35 c and yj a 2 + b 2 \ b/a and V a 2 + b 2 + c 2 

37 M x = |A + C,M 2 = A + ^B,M 3 =B + ICjMx +M 2 + M 3 = |(A + B + C) = 0 

39 8 < 3 -3; 2 y/xy < x + y 41 Cancel a 2 c 2 and b 2 d 2 ; then b 2 c 2 + a 2 d 2 > 2abcd because (6c — ad) 2 > 0 

43 F; T; T; F 45 all 2y/2; cos 6 = - 1 

Section 11.2 Planes and Projections (page 414) 

1 (0,0,0) and (2, —1, 0); N = (1,2,3) 3 (0,5,6) and (0,6,7); N - (1,0,0) 

5 (1,1,1) and (1,2,2); N = (1,1,— 1) 7 x + y = 3 9x + 2y + z = 2 
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11 Parallel if N • V = 0; perpendicular if V = multiple of N 

13 i -f- j -h k (vector between points) is not perpendicular to N; V * N is not zero; plane through first three 
is x + y + z = 1; x 4- y — z — 3 succeeds; right side must be zero 

15 ax + by + cz = 0; a(x - x 0 ) + b(y - y 0 ) + c[z - z 0 ) = 0 17 cos 8 = § 

19 — A has length | 21 P = |A has length ||A| 23 P = —A has length |A| 25 P = O 

27 Projection on A = (1,2,2) has length |; force down is 4; mass moves in the direction of F 
29 |P| m in = = distance from plane to origin 31 Distances and both reached at (3 j 3* — 3) 

33 i 4- j 4- k ; t = — f; (|, —3, 3); 

35 Same N = (2, —2, 1); for example Q = (0,0, 1); then Q + |N = (|, — |, ^) is on second plane; ||N| = 

37 3i 4- 4j; (3£,4t) is on the line if 3(3£) 4- 4(4 1) = 10 or t = P = (|^, — r), \P\ = 2 

39 2x 4- 2(-p - §x)(-§) = 0 so x - §§ = §;3x4-4y = 10 gives y = | 

41 Use equations (8) and (9) with N = (a, b) and Q = (xi,yi) 43 t = ^ ; B onto A 
45 aV L = §L/ - §L ///; aVF = |L// 4- §Ljjj 
47 V • L/ = 2 — 1; V * Ljj = — 3 — 1, V * L/// = — 3 — 2; thus V • 2i = 1, V • (i - >/3j) = -4, and V = |i4-^j 


Section 11.3 Cross Products and Determinants (page 423) 


IO 3 3i — 2j — 3k 5 — 2i 4- 3j — 5k 7 27 i 4- 12 j - 17 k 

9 A perpendicular to B; A, B, C mutually perpendicular 11 |AxB| = v^,AxB = j — k 13 A xB = O 

15 |A X B| 2 = (of 4- al)(bl 4- 6 2 ) - (a 1 b 1 4- a 2 b 2 ) 2 = (a x b 2 - a 2 b x ) 2 ; A X B = (a ± b 2 - a 2 b x ) k 
17 F; T; F; T 19N = (2,1,0) or 2i 4- j 21 x-t/ + 2r = 2 soN = i- j4 -k 
23 [(1, 2, 1) - (2, 1 , 1)] x [(1, 1, 2) - (2, 1 , 1)] = N = i + j + k; x + y 4- « - 4 

25 (1, 1, 1) X (a, 6, c) = N = (c — 6)i 4- (a — c)j 4- {b — a)k; points on a line if a = b = c (many planes) 

27 N = i 4“ j, plane x 4- y — constant 29 N = k, plane z = constant 
x y z 

= i- y + 2: = 0 33 i — 3 j ; — i 4- 3j; — 3i — j 35 -1,4, -9 


31 


cl 2 a 3 

“ c 2 

ai a 3 

4- c 3 

cl\ a 2 

63 

61 fc 3 

bi 6 2 

2 = (|a6) 2 + (|ac) 2 + (|£>c) 2 = 

(i|AxB 


39 4 -Ci 

I 1/2 ^3 1 1^1 ^3 1 1^1 ^2 1 

) 2 when A = ai — 6 j, B = ai — ck 
43 A = |(2 • 1 — (— 1 ) 1 ) = §; fourth corner can be ( 3 , 3 ) 

45 aii 4 - a2j and &ii 4 - 62J; \a1b2 ~ a 2 bi |;AxB= V (aib 2 — a 2 b\ )k 

47 A x B; from Eq. (6), (A x B) x i = -(a 3 6i - ai6 3 )k 4 - (aib 2 — a 2 &i)j; (A • i)B - 
ai(&ii 4 “ b^ 4 - 63k) — 6i(aii 4 - 4 - a 3 k) 

49 N = [Q — P) x (R — P) = i 4- j 4- k; area |\/ 3 ; x 4- y 4- z — 2 


(B * i) A = 


Section 11.4 Matrices and Linear Equations (page 433) 


= 5 


1 x = 5,y = 2,D = -2, 

5 x = 2y, y = anything, D = 0, 2y 

- =M - 3 v - J, 

— —8 — 6 >y - d 


' 1 

+ 2 

1 ‘ 



1 

-1 


3 x 

r 2 : 


? -4 : 


‘ 0 ' 

1 

+ y 

-2 

— 

0 


3x = 3,y = 1, 


= 3 


-1 

-3 


,£> = -8 


7 no solution, D = 0 


9s=L 


3 8 
1 0 


= nl = 1 
-8 1 


11 


C3|fcO 
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15 ad — be = —2 so A 1 = 


.5 .5 

.5 -.5 


IT Are parallel; multiple; the same; infinite 
19 Multiples of each other; in the same direction as the columns; infinite 
21 di = .34, d 2 = 4.91 23 .96x + . 02 y = .58, .04x + .98y = 4.92; D = .94, x = .5, y = 5 

25 a — 1 gives any x = — y; a = — 1 gives any x — y 


27 D = -2, A 


-l 


5 -4 

-3 2 

B = -2 (not + 2 ), (—A ) -1 = -A _1 ;B = l ,/" 1 


; D = — 8 , ( 2 A) 1 = ~A 1 ;D=-^,(A x ) 1 = original A; 
I 


29 AB = 
31 AB = 
33 A ” 1 = 


7 5 
5 1 


, BA = 


11 

3 


,BC = 


3 5 
1 3 


,CB = 


4 2 
2 2 


ae + bg af - b bh 
ce + dg cf + dh 


aecf + aedh -j-bgcf + bgdh , , . w . r , 
-afce-afdg - bhee - bhdg = H - bc)(eh - } g) 


1 -2 

0 I 




35 Identity; B l A 


-i j-i 


37 Perpendicular; u = v x w 


39 Line 4 + t, errors — 1 , 2 , —1 

Section 11.5 Linear Algebra 


h -i 

0 1 „ 

41 d\ — 2d 2 + d 3 = 0 43 A -1 can’t multiply O and produce u 



' 1 

2 

2 ' 


X 


■ 0 ' 


1 

2 

3 

5 


y 

= 

0 

3 


0 

2 

2 


z 


8 



(page 443) 

1 -1 0 
0 1 -1 
1 0-1 


x 

y 

z 


5 det A = 0 , add 3 equations — > 0=1 7 5a + lb + 0 c = d, A 1 = 

9bxc;abxc = 0; determinant is zero 11 6, 2, 0; product of diagonal entries 


1 0 0 


0 ± 0 
0 0 ± 


13 A- 1 = 

1 

O (— 1 

-2 

1 

2 

1 

1 

,J3 _1 = 

1 

O O 

1 

CO to 

1 " 
2 

1 


L 0 

0 

3 J 


Li 

0 

0 m 


17 d= ( 1 , — 1 , 0 ); u = ( 1 , 0 , 0 ) or (7,3,1) 19 AB = 


15 Zero; same plane; D is zero 

, det AB = 12 = (det A) times (det B) 


8 4 1 

40 26 0 

18 12 0 


, det (A + C) is not det A + det C 


2 3-3 

21A + C = -1 4 6 

0-1 6 

23 p = (»)(3)-(0)(6) =hq= -(4)(3K(0)(0) = _ 2 25 is always A 

27 -1,-1, l,l,;(y,z,z), (z,y,x),(y,z,x), ( z,x,y ) 29 2! = 2,4! = 24 


= by 


2 ’ * 


31 z = = -|,i = 3; 

33 New second equation 3z = 0 doesn’t contain y; exchange with third equation; there is a solution 
35 Pivots 1,2,4, D = 8 ; pivots 1 , — 1 , 2 , D = —2 37 a\ 2 = 1 , a 2 \ = 0, ]T) dijbjk = row i, column k in AB 

39 G& 11 CZ 22 ~ a i 2 a 2 i 7 ^ 0 ; B = 0 


CHAPTER 12 MOTION ALONG A CURVE 
Section 12.1 The Position Vector (page 452) 

1 v(l) = i + 3j; speed \/l0; 3 jg = = zf=Tt I tangent to circle is perpendicular to | 

5 v = e* i - e -t j = i - j;y - 1 = ~[x - l);xy = 1 
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7 R = (1, 2, 4) + (4, 3, 0)t; R = (1, 2, 4) + (8, 6, 0)t;R = (5, 5, 4) + (8, 6, 0)t 
9 R = (2 + t,3,4 — t);R = (2 + y,3, 4 — y); the same line 
11 Line; y = 2 + 2i, z = 2 + 3t; y = 2 + 4i, z = 2 + 6t 

13 Line; V36 + 9 + 4 = 7; (6, 3, 2); line segment 15 1 ; ^ 17 x — t, y = mt + b 

19 v = i — ^j, |v| = Vl + t -4 , T = v/|v|;v = (cost — tsint)i + (sint + tcost)j; |v| = \/l + 1 2 ; 

T = v/|v|; v = i + + 2k, |v| = 3, T = |v 

21 R = — sint i + cost j + any Ro; same R plus any wt 

23 v = (1 - sin t)i + (l - cost)j; |v| = \/3 - 2sint - 2cost, |v| m i n = \/3 - 2\/2, Mm** = \/3 + 2^2; 

a = —cost i + sint j, |a| = 1; center is on x = t,y = t 
25 Leaves at (f , f );v = (-V2,\/2);R = (^, 4) + v(t - §) 

27R = cos^i + sin^j + ^k 

29 v = sec 2 t i + sec i tan t j; |v| = sec 2 tVl + sin 2 t; a = 2 sec 2 t tan t i + (sec 3 t + sec t tan 2 t) j; 

curve is y 2 — x 2 = 1; hyperbola has asymptote y = x 
31 If T = v then |v| = 1; line R = ti or helix in Problem 27 
no , u\ n\\ (2t, 0) 0 < t < \ (3 — 2t, 1) 1 < t < I 

“ (.(*),»(*))= ,) .<,<? jo,4-2<) | < t < 2 


35 x(t) = 4 cos |,y(t) = 4 sin | 37 F; F; T; T; F 39 J = tan# but J ^ tant 

41 v and w; v and w and u; v and w, v and w and u; not zero 

43 u = (8, 3, 2); projection perpendicular to v = (1, 2, 2) is (6, -1, -2) which has length y/41 

45 x = G(t), y = F(t); y = x 2 / 3 ; t = 1 and t = — 1 give the same x so they would give the same y; y = G(jF'" 1 (x)) 


Section 12.2 Plane Motion: Projectiles and Cycloids (page 457) 

1 (a) T = 16/ g sec, R = 128V3/9 ft, Y = 32/g ft (b) f (c) f , 0, if 3 x = 1.2 or 33.5 

5 y = x- \x 2 = 0 at x = 2; y = ztanx - §(^-^) 2 = 0 at x = R 7 x = 

9t/ 0 » 11.2, tan a « 4.32 11 vo = y/gR = \/980 m/sec; larger 13 v 2 /2 g = 40 meters 

15 Multiply R and H by 4; dR = 2v 2 cos2ada/g i dH = v 2 sin a cos a da/ g 
17t=&f sec; y = 12- ^ * -2.1m; + 2.1m 19 T = 

21 Top of circle 25 ca(l — cos 0), ca sin 0; 0 = 7r, ~ 27 After 0 = tt : x = na + Vot and y = 2a — ^gt 2 29 2; 3 

31 g4 ” a -; Sir 2 a 3 33 x = cos 0 + 0 sin 0, y = sin 0 - 6 cos 0 35 (a = 4) 6 tt 

37 y = 2 sin 0 — sin 20 = 2 sin 0(1 — cos 0); x 2 + y 2 = 4(1 — cos 0) 2 ; r = 2(1 — cos 0) 


Section 12.3 Curvature and Normal Vector (page 463) 


1 (i+JU ' jV* ^2 ® ^ (line) 7 9 (- sin t 2 , cos i 2 ); (— cos t 2 , — sin t 2 ) 

11 (cost, sint); (-sint, - cost) 13 (-| sint, | cost, |); |v| = 5,« = | longer; tan0 = | 


15 2 y/ 2 a>Ji—co&$ 17 K = w , N = i 19 (0, o); (-3, 0) with 1 = 4; (-1, 2) with ^ = 2y/2 


21 Radius i, center (l,±yj^l)for«< 1 23 UV' 25 (sint i - cost j +k) 27 | 

29 N in the plane, B = k, r = 0 31^^ 33 a = 0 T + 5w 2 N 35 a = T + 

37 a = 39 | F 2 + 2(F') 2 - FF"\/(F 2 + p' 2 ) 3 / 2 
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Section 12.4 Polar Coordinates and Planetary Motion (page 468) 

1 j, -i; i + j = u r - Me 3 (2, -1); (1, 2) 5 v = 3e 3 (u r + ue) = 3e 3 (cos3 - sin3)i + 3e 3 (sin3 + cos 3)j 

7 v = — 20 sin 5t i + 20 cos 5t j = 20 T = 20 u# ; a = — 100 cos 5t i — 100 sin 5£ j = 100 N = — 100 u r 
9r# + fi = 0 - f it^S) 11 S = .0004 ndlau/Mc; *-•*§- 40,000 
IS mR Xu; torque 15 T*/>(CM/4r>)‘/» 1T4 uV/T»G 19 hf 
23 Use Problem 15 25 a + c = , a — c = solve for <7, D 

27 Kepler measures area from focus (sun) 29 Line; x = 1 

33 r = 20-2 1,6 = 2^,v = -2u r + (20 - 2t)fgue,;a = (2t - 20)(ff ) 2 u r - 4(f*)u 0 ; / Q 10 |v|dt 

CHAPTER 13 PARTIAL DERIVATIVES 
Section 13.1 Surfaces and Level Curves (page 475) 

3 x derivatives oo, —1, —2, —4e~ 4 (flattest) 5 Straight lines 7 Logarithm curves 
9 Parabolas 11 No: f — (x + y) n or ( ax -f by) n or any function of ax + by 13 f{x,y) = 1 — x 2 — y 2 
15 Saddle 17 Ellipses 4x 2 + y 2 = c 2 19 Ellipses 5x 2 + y 2 = c 2 + 4cx + x 2 

21 Straight lines not reaching (1,2) 23 Center (1,1);/ = x 2 + y 2 - 1 25 Four, three, planes, spheres 

27 Less than 1, equal to 1, greater than 1 29 Parallel lines, hyperbolas, parabolas 

31 ^ : 48x — 3x 2 = 0, x = 16 hours 33 Plane; planes; 4 left and 3 right (3 pairs) 


Section 13.2 Partial Derivatives (page 479) 


13 + 2 xy 2 ; -1 + 2 yx 

o _ x • y 
y a^ + y 2 * x*+y 3 

17 (/=£)/, 


3 3x 2 y 2 2x;2x 3 y e y 5 

11 


— 2x 




(x 3 +y 3 ) 3 ’ (x 3 +y 3 ) 2 


^ 2 -f-y 3 > x 3 +y 3 13 2,3,4 15 6(x + iy ) , 6z(x + zy), 6(x + zy) 

3 — ; fxy = “5^; /yy = 19 ~ a 2 cos axcos 6y, a6 sin ax sin 6y, — 6 2 cos ax cos by 


t ■ - — j r • • ^ t 

21 Omit line x = y; all positive numbers; f x = — 2(x — y) -3 , f y = 2(x — y) -3 
23 Omit z = t\ all numbers; 

25 x > 0, t > 0 and x = 0, £ > 1 and x = — 1, — 2, • • • , t = e, e 2 , • ■ * ; f x = (lnt)x lnt_1 , / t = (In x)t lnx ~ 1 

27 y, x; / = G(x) + H (y) 29 £ = ^^(xy) - yv(xy) 

31 /xxx — Oy 3 , fyyy = 6x 3 , / X xy = /xyx = /yxx = lSxy 2 , /yyx — fyxy — fxyy — 18x 2 y 

33 y(y) = Ae 0 *'/ 7 35 y(y) = Ae 0 *'/ 2 + Bc^l 2 


37 ft = - 2 /, /** = fyy - -e 2t sin x sin y; e 13t sin 2x sin 3y 
39sin(x + t) moves left 41 sin(x - ct), cos(x + ct), e*~ ct 

43 (B - A)h y (C*) = {B - A)[f„(b,C*) - f y (a,C*) ] = {B - A)(b - a)f yx {c*,C*y, continuous f xy and f yx 
45 y converges to 6; inside and stay inside; d n = \/(x n — a) 2 + (y n — fc) 2 — ► zero; d n < e for n > N 
47 e, less than 6 49 f(a, 6); or ( a ,_ 1 ) 1 ( i ,- 2 ) 51 /(0, 0) = l;/(0, 0) = 1; not defined for x < 0 
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Section 13.3 Tangent Planes and Linear Approximations (page 488) 


lz — l = y— l;N=j — k 3 z - 2 = §(z - 6) - §(y - 3);N = ±1 - §j - k 
5 2{x - 1) + 4(y - 2) + 2(z - 1) = 0;N = 2i + 4j + 2k 7z-l = i-l;N = i-k 
9 Tangent plane 2zo(z ~ ^o) — 2xo(z — xo) — 2yo (y — S/o) = 0; (0,0,0) satisfies this equation because 
zfi — x% — t/o = 0 on the surface; cos 6 = (surface is the 45° cone) 

11 dz = 3 dx — 2 dy for both; dz — 0 for both; A z = 0 for 3x — 2y, Az = .00029 for x 3 /y 2 ; tangent plane 
13 z = zo + plane 6(x - 4) + 12(y — 2) + 8 (z — 3) = 0; normal line x = 4 + 6£, y = 2 + 12t, z = 3 + 8£ 
15 Tangent plane 4 (x — 2) + 2(y — 1) + 4(z — 2) = 0; normal line x = 2 + 4t, y = 1 + 2£, z = 2 + 4£; (0, 0, 0) 


at t = — § 


IT dtu = yodx + x 0 dy\ product rule; Aty — dw = (a; - xo)(y — yo) 

19 dl = AOOOdR + .08<fP; dP = $100;/ = (.78)(4100) = $319.80 

loo == 404 * decrease = 100 — 101 ~ = 404 > == y^ x ~~ ^ 


21 Increase = ^ 


23 A0 « -* - ^±* A * 

V* 3 +y 3 


25 Q increases; Q, = -*f,Q t = =f,P» = -.2 Q. = f ,Pt = = 1;Q = 50 - 2§°(s - .4) - §(* - 10) 

27 s = 1, t = 10 gives Q = 40: “ I J * + fj^'+V 0 5 Q. = -20, Qt = ~ § , P. = 20, P t = § 

29 z — 2 = i — 2 + 2(y — 1) and z - 3 = 4(z - 2) — 2(y - l);z = l,y = |,z = 0 
31 Ax = -§, Ay = |;*i = §, yi = -§; line x+ y = 0 

33 3a 2 A® — Ay = —a — a 3 gives Ay = —Ax = ; lemon starts at (l/\/3, — l/>/3) 

— Az + 3o 2 Ay = a + a 3 

35 If x 3 = y then y 3 = z 9 . Then z 9 = z only if z = 0 or 1 or —1 (or complex number) 

37 Az = -z 0 + 1, Ay = -yo + 2, (zi,yi) = (1,2) = solution 

39 G = H = 41 J= ** ° y , Az = — 1 + e - **, Ay = — 1 — (z„ — 1 + e~ Xn )e~ Vn 

43(z 1 ,y 1 ) = (0,|),(-|,|),(| I 0) 


Section 13.4 Directional Derivatives and Gradients (page 495) 


1 grad f = 2xi- 2yj, J9 U / = V$x ~ y, Du/(P) = 

3 grad / = e* cosy i — e*siny j,Du/ = -e*siny, Du/(P) = -1 

5/=v/i 2 + (y-3)2 ( grad/=^i+^j,Du/=^I>u/(P) = ^ 7 grad / = ^ ^ j 

9 grad / = 6xi + 4yj = 6i + 8j = steepest direction at P; level direction — 8i + 6j is perpendicular; 10, 0 
11 T; F (grad / is a vector); F;T 13 u = ( ^J +b , , Ai / = Va 2 +b 2 

15 grad j = (c*-», -«-») = («" 1 , -c" 1 ) at P;u = {fa ^),Z?u/ = v^" 1 

17 grad / = O at maximum; level curve is one point 19 N — (—1, 1, — 1), U = (—1, 1, 2),L = (1, 1, 0) 

21 Direction -U = (-2,0, -4) 23 -U = ( ^_*,: gt , v/l _^_ y2 , 

25 / = (x + 2y) and (x 4- 2y) 2 ;i + 2j; straight lines x + 2y = constant (perpendicular to i + 2j) 

27 grad / = ±(^~, ^|),' grad g = ±(2\/5, >/5), / = ±(^ ~ + C'i ^ = ±(2y/lx -f y/5y) -f C 

29 6 = constant along ray in direction u = ; grad 5 grad 0 = 0 

31 U = (/,,, / y , / 2 + / 2 ) = (-1, -2, 5); -U = (-1, -2, 5); tangent at the point (2,1,6) 

33 grad / toward 2i + j at P, j at Q f -2i + j at R ; (2, |) and (2|, 2); largest upper left, smallest lower right; 
z ma x > 9; z goes from 2 to 8 and back to 6 
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35 f - aV(*- !) 2 + {y- |f)o,o = (^, 5^) 

37 Figure C now shows level curves; |grad /| is varying; / could be xy 
39 x 2 + xy\e x ~ y \ no function has |£ = y and |£ = — s because then / xy ^ / ya; 

41 v - (1, 2t); T = v/Vl + 4i 2 ; f = v • (2t, 2t 2 ) = 2t + 4t 3 ; f t = (2t + 4f 3 )/^/^T4t2 
43 v = (2,3); T = ^=; £ = v • (2x 0 + 4t, -2y 0 - 6t) = 4z 0 - 6 y 0 - 10 1; £ = 

45 v = (e‘, 2e 2t , -«"*); T = grad / = (A, ±, ±) = («"*, a" 2 *, e‘), £ = 1 + 2- 
47 v = (-2sin2t, 2cos2t),T = (-sin2f,cos2t); grad / = (y,a:), £ = -2sin 2 2£ + 2cos 2 2f, £ = i £ ; 
zero slope because / = 1 on this path 

49 * - 1 = 2(x - 4) + 3(y - 5); / = 1 + 2(x - 4) + 3(y - 5) 51 grad / ■ T = 0; T 


Section 13.5 The Chain Rule (page 503) 


1 f y = cf x = ccos(x + cy) 3 f y = 7 f x = 7e x+7y 
dt — 1 (wave moves at speed 1) 


9 42 


5 ^If 7 Moves left at speed 2 


11 d^f( x + i y) = /"(* + ty), £^/(z + iy) =i 2 f"(x + iy) 
so f xx + / ys/ = 0; (x + iy ) 2 = (z 2 - y 2 ) + i{2xy) 

13 I = 2z(l) + 2y(2t) = 2t + 4t 3 15 £ = yf t + z£ = -1 = + 

19 y = i^r 2 h, f + ^ f = 36* 

21 3? = 75TOP( 60 ) + = $ m P h ; $ = 7TOP( 60 ) + * 74 m P h 

23 £ = “iff + u z|f + u 3§£ 25 If = 1 with z and y fixed; £ = 6 

27 ft = f,t + /„(2t); f tt = f xt t +f x + 2 fytt + 2/y = {f xx t + f yx {2t))t + f x + 2(f xy t + f yy (2t))t + 2 f y 

29 If = If If + If If = If cos 6 + If sin ^ is fixed 

31 r *» = y^f+yj “ (* 3 +y J )»/» = (i’+V 1 ) 5 / 1 > ) = r ~ Xr ~ 2 §5 = r ~ 7> = ^ 

33 = ^r_~f;;y )3 ; ( cos *)(if )» = i; first answer » also Vl _) ina , = ^7 

35 f r — f x cos 0 + f y sin 0 , / r 0 = —f x sin 0 + / y cos sin# cos 0) -f / xy (— r sin 2 0 -f r cos 2 #) d-/ yy (rcos 0 sin 0) 
37 Yes (with y constant): |f = ye**, §f = ~ ^ 39 / t = f x x t + / y y t ; /« = f xx x 2 + if^Xtyt + / yy y t 2 

41 (£1) - + dj-dn = a _ = 4t52t = ij 

da; ^ a y ax U 5^>vaa;ls/ 1 dz )* dy dz 5 ° 

43 1 45 / = y 2 so f x = 0, f y = 2y = 2r sin 1 9; / = r 2 so f r = 2r = 2\J x 2 T y 2 , / 0 = 0 

f x%u d - fyVu /a: d" / y 5 fx%v d“ / y !/ti /a; / y j = fxx x u d" fxyVu d" fy X X u ~h / yy y u 

/xa; d" 2/x y d" / yy j fxx-^v d“ fxyVv fyx^v fyyVv ~ fxx 2 f X y d~ fyy Add y uu d" Q vv 49 False 


Section 13.6 Maxima, Minima, and Saddle Points (page 512) 


1 (0,0) is a minimum 3 (3,0) is a saddle point 5 No stationary points 7 (0,0) is a maximum 
9 (0,0,2) is a minimum 11 All points on the line x—y are minima 13 (0,0) is a saddle point 
15 (0,0) is a saddle point; (2,0) is a minimum; (0, —2) is a maximum; (2, —2) is a saddle point 
17 Maximum of area (12 — 3y)y is 12 

1Q 2(x d- y) d- 2(x -f 2y — 5) -h 2(x -f 3y — 4) = 0 x = 2; 


2(x + y) d- 4(z + 2y - 5) + 6(x + 3y - 4) = 0 
21 Minimum at (0, |); (0, 1); (0, 1) 


_ gives 


^ min because E xx E yy = (6)(28) > E% y — 12 2 
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28 dt = 0 wten tant = y/3 ;/ max = 2 at (§, ^),/ m in = -2 at 

25 (ax + 6y)max = V^G 2 + & 2 J (x 2 H- y 2 )min = ^2+53 27 0 < c < 4 

29 The vectors head-to-tail form a 60-60-60 triangle. The outer angle is 120° 312 + V$; 1 + y/3; 1 + 

35 Steiner point where the arcs meet 39 Best point for p = oo is equidistant from corners 

41 grad / = (v^ ^ + ^ + ^, ^ + ^ + ^ ); angles are 90-135-135 

43 Third derivatives all 6; / = |^x 3 + ^x 2 y + ^xy 2 + ^y 3 

45 ( J“) n ( J“) m ln(l “ a:y)] 0 ,o = n!(n “ 1)* for m = n > 0, other derivatives zero; / = — xy — ^ 

47 All derivatives are e 2 at (1,1); / « e 2 [l + (x — 1) + (y — 1) + |(x — l) 2 + (x - l)(y — 1) + |(y — l) 2 ] 

49 x = 1, y = —1 : f x = 2, / y = -2, / xx = 2, / xy = 0, f yy = 2; series must recover x 2 + y 2 
51 Line x — 2 y = constant; x + y = constant 

88 %“ H" %yfxy + ^2 /yy]o,o5 /** ^ 0 and fxxfyy t> / xy at (0,0); fx ~ fy — 0 55 Ax = — 1, Ay = — 1 

57 / = x 2 (12 - 4x) has / max = 16 at (2,4); line has slope -4,y = has slope = -4 

59 If the fence were not perpendicular, a point to the left or right would be closer 

Section 13.7 Constraints and Lagrange Multipliers (page 519) 

1 / = x 2 + (k - 2x) 2 ; £ = 2x - 4(A; - 2x) = 0; (^, |), ^ 3 A = -4, x min = 2, y min = 2 

5 A = 3(4)17® : [ x i y) = (i2 1 / 6 , 0) or (0, +2 1 / 6 ), / m in — 2 1 / 3 ; A = ^ : (x, y) = (±1, =kl), /max = 2 
7 A = J, (x, y) = (2, —3); tangent line is 2x - 3 y = 13 

9 (1 — c) 2 + (—a — c) 2 + (2 — a — b — c) 2 + (2 — b — c) 2 is minimized at a = — b = §, c = | 

11 (1,-1) and (-1,1); A = — | 

13 / is not a minimum when C crosses to lower level curve; stationary point when C is tangent to level curve 

15 Substituting H = |^ = |^=0andL = / min leaves = A 

17 x 2 is never negative; (0,0); 1 = A(— 3y 2 ) but y = 0;y = 0 has a cusp at (0,0) 

19 2x = Ai + A 2 ,4y = Ai,2z = Ai — A 2 ,x + y + z = 0, x — z = 1 gives Ai = 0, A 2 = 1, / min = § at (|,0, -§) 

21 (1,0,0); (0, 1,0); (Ai, A 2 ,0);x = y = 0 23 and §£; A = 0 

25 (1,0,0), (0, 1,0), (0,0, 1); at these points / = 4 and -2 (min) and 5(max) 

27 By increasing fc, more points are available so / max goes up. Then A = > 0 

29 (0, 0); A = 0; / m i n stays at 0 

31 5 = Ai + A 2 , 6 = Ai + A3, A 2 > 0, A3 <0; subtraction 5 — 6 = A 2 — A3 or— 1 > 0 (impossible); 
x = 2004, y = —2000 gives 5x + 6y = -1980 

33 2x = 4Ai + A 2 , 2y = 4Aj + A3, A 2 > 0, A3 > 0,4x + 4y = 40; max area 100 at (10,0)(0,10); min 25 at (5,5) 

CHAPTER 14 MULTIPLE INTEGRALS 
Section 14.1 Double Integrals (page 526) 

1 !> I 52 7 1 9| 11 £, fLi d x dy + /; =2 J» /2 dx dy 

13 /„= 0 /.=- \\ny dx d y 15 SLo f£-y/i d y dx 17 fo 1 ft' 2 dx d y = ft' 2 fL d y dx =\ 

1 9 fo f-y dx d y - /-1 f-x d y dx + fo 1 fx d y dx = 9 21 Jo /»% dx d y + ft ft /a dx d y = fo fl x d y dx = * 

23 fo 1 C dy dx + ft f b o (2 ~ X) dy dx = ft fli b (V ' b) dx dy = b 25 /(a, 6) - /(a, 0) - /( 0, 6) + /( 0, 0) 
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27 fo Jq (2x — 3y + l)dx dy = | 29 f(x)dx = f* J 0 /(l) ldy dx 31 50,000*- 

33 jf fi x 2 dxdy= % 35 2 f Q 1/v ^ f/ V ' 1 dx dy = f 

_ 1 * 1 

37 — , — ) is exact for / = l,x,y,xy 39 Volume 8.5 41 Volumes ln2,21n(l + \/2) 

43 Jo Jo 1 a= y da: dy = /J ^dy = In 2; /J x y dy dx = f* f^jdx = In 2 
45 With long rectangles ^ y< A A = Y1 A A = 1 but f f y dA = | 

Section 14.2 Change to Better Coordinates (page 534) 

1 f^j{ 4 fo r dr d0 = f 3 S = quarter-circle with u > 0 and v > 0; ff du dv 

5 R is symmetric across the y axis; ff f 0 1_ ” u du dv — | divided by area gives (u, v) = (4/3n,4/3n) 

7 2 fi+x ~ X dy dx; xy region R* becomes R in the x*y* plane; dx dy = dx* dy* when region moves 

dx/dr* dx/dd* _ cosd* -r*sin0* _ f 3»/4 f i 

9,7 ~ dy/dr* dy/d6* ~ sin r*cos0* ’-W* Jo 

11 J y = S Sr x * dx d y = Jj// 4 So r ‘ 2 cos2 19 r dr dd = - |; I x = ^ + |; / 0 = f 

13 (0,0), (1,2), (1,3), (0,1); area of parallelogram is 1 

15 x = u, y = u -f 3t> + uv\ then (u, v) = (1,0), (1, 1), (0, 1) give corners (x, y) = (1,0), (1,5), (0,3) 

17 Corners (0,0), (2,1), (3,3), (1,2); sides y = |x, y = 2x - 3, y = |x + §, y = 2x 
19 Corners (1,1), (e 2 , e), (e 3 ,e 3 ), (e, e 2 ); sides x = y 2 ,y = x 2 /e 3 , x = y 2 /e 3 ,y = x 2 
21 Corners (0,0), (1,0), (1,2), (0,1); sides y = 0, x = l,y = 1 + x 2 , x = 0 

23 J= \ J =3, area ff fj 3du dv = 3; J = “ +2v ^ u+2 „ = 3* 3 “+ 3 ”, /J J 0 X 3 C 3 “+ 3y du dt, = 

f 0 1 (e 3+3v - e 3v )dv = |(e 6 - 2e 3 + l) 

25 Corners (x,y) = (0,0), (1,0), (1,/(1)),(0,/(0));(1, 1) gives x = |,y = /(|); 7= = /(«) 

27 B 2 = 2 J^ 4 J 0 1/8in<> e- r ’r dr d0 = J”/ 4 ( e -i/ e - l)d<? 

29 f — ff f2 dr d0/ f f r dr dd = f£ |a 3 cos 3 6 dd/i ro 2 = 31 J^* r 2 r dr d0 = | 

33 Along the right side; along the bottom; at the bottom right corner 

35 f f xy dx dy = f* Jq (u cos a — v sin a) (u sin a H- 1 ; cos a)du dv ~ J (cos 2 a — sin 2 a) 

37 fo" jf r 2 r 2 r dr d0 = ^(5 6 -4 6 ) 39 x = cos a: — sin a, y = sin a 4- cos a goes to tx = 1, v = 1 


Section 14.3 Triple Integrals (page 540) 


1 So 1 So 2 So dx dy dz — | 

30 <y<x<- 2 ;< 1 and all other orders xzy , yzx,zxy } zyx; all six contain (0,0,0); to contain (1,0,1) 

5 /- 1 /- 1 /-l <il£ fy‘k = 8 7 = 4 SJ-lf* Si dy = 3 


» s: si r “ jr 1 " 2 * **<<»=! 78 /» ,/3 j? -2 ” /r”- 2 - * «<* ^ - * 

y dx dy dz = 67T 



dx dy dz ~ i r 


21 Corner of cube at (^, ^); sides area ^ 

23 Horizontal slices are circles of area ?rr 2 = ?r(4 — ^); volume = J Q 7r(4 — 2 r)d 0 = 8tt; centroid 
has x = 0, y = 0,^ = f* zn (4 — z)dz/8w = | 
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25 I = T gives zeros; §£ = /" / dy dz, f£ = J* /* / dx dz, ^ = /* / da: 

27 Ji /^(y 2 + z 2 )dx dy dz = f;fffx 2 dV = f ;3 fff(x - ^) 2 dV = f 

29/ 0 3 /o/o dx dy dz = 6 31 Trapezoidal rule is second-order; correct for 1, x, y, z, xy, xz, yz, xyz 

Section 14.4 Cylindrical and Spherical Coordinates (page 547) 


1 (r, 0, z) = [D, 0, 0); (p, <f>, 0) = ( D , f , 0) 3 (r, 0, z) = (0, any angle, D); [p, <f>, 0) = [D, 0, any angle) 

5 [x,y,z) — (2, -2, 2 y/2)-, ( r,0,z ) = (2y/2,-f,2^) 7 ( x,y,z ) = (0, 0,-1); ( r,0,z ) = (0, any angle, -1) 


9 0 = tan *(£) 11 45° cone in unit sphere: ^(1 — ^-)=) 


17 £ 


19 Hemisphere of radius x : far 4 


near side 


15 \ hemisphere: | 

23 |o 3 tan a (see 8.1.39) 27 % = p — ;° 8 * - hypotenuae 

31 Wedges are not exactly similar; the error is higher order = 

33 Proportional to 1 + ^{y/a 2 + [D — h) 2 — y/a 2 + D 2 ) 
a 

35 J = | b | = a6c; straight edges at right angles 3T 


13 cone without top: 

3 #. 21 ic(R 2 — z 2 )\ Anry/R 2 — r 2 

cos a 

proof is correct 


39 


. 2 
3 > 3 


41 p 3 ; p 2 ; force = 0 inside hollow sphere 


cos 0 
sin 0 
0 


—rain 6 
r cos 6 
0 


= r 


CHAPTER 15 VECTOR CALCULUS 
Section 15.1 Vector Fields (page 554) 


1 f(x,y) = x+2y 3 f(x, y) = sin(x + y) 5 f[x,y) = ln(x 2 + y 2 ) = 21nr 

7 F = xyi + ^-j, f[x, y) = 9 = 0 so / cannot depend on x; streamlines are vertical (y = constant) 


i + 2yj 15F = 2a:i-2yj 


11 F = 3i + j IS F = 

19% = -l;y = -x + C 21^ = -f; 

27 F = + atj 29 F = =^g (xi + y _ f Ii - ffifg rp7T ((x _ i) i + y j) 


: 2 + y 2 = C 


17 F = e* - ® i — e*~ v } 


25 parallel 


St F = ^2*7 i - i S3 &L — ^2 _ l. djf _ _ 2 

X 2 » 1 2 X J 2’ dx y/^3 ~ * 

35 ^ = = ILL*. <LL = ®Lv.. f( r ) = n ff ; veg c ; rc i Gg 

00 dx dr dx dr r » dy dr r ’ J V r l ° gives Circles 

37 T; F (no equipotentials); T; F (not multiple of xi + yj + zk) 

39 F and F + i and 2F have the same streamlines (different velocities) and equipotentials (different potentials). 
But if / is given, F must be grad /. 


Section 15.2 Line Integrals (page 562) 

1 jf \/l 2 + 2 2 dt = x /5;/ 0 1 2dt = 2 3 £ t*y/2dt + J* 1 • (2 - t)dt = f + § 

5 /o *( — 3 sin t)dt = 0 (gradient field); /j" — 9sin 2 1 dt = —9x = — area 
7 No, xy j is not a gradient field; take line x = t,y = t from (0,0) to (1,1) and / t 2 dt ± | 
9 No, for a circle (2?rr) 2 ^ 0 2 + 0 2 11 / = x + ^y 2 ; /(0, l) - /(1,0) = 

13 / = |zV; /(0, 1) - /(l, 0) = 0 15 / = r = N /^ 2 + ^; /( 0, 1) - /( 1, 0) = 0 

17 Gradient for n = 2; after calculation ^ ^ 

’ oy dx r n 

19 x = a cos t, = a sin t y ds = a dt , M — J Q (a + a sin t)a dt = 2xa 2 
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21 x = a cos £, y = a sin £ , ds = a dt, M = f a 3 cos 2 t dt — na 3 , (x, y) = (0, 0) by symmetry 
23 T = = 7 ^fr;F = 3 Z i + 4j =6ti + 4j > da = 2vT+«»* > P-Tda= (6M + 4 j) •(^{ y )2VTTt 5 'dt 

20 1 dt; F • dR = (6ti + 4j) • (2 titi + 2t dt j) = 20t dt; work = J 2 20 1 dt = 30 
j£ then M = ay + b, N = ax + c, constants a, b, c 

27 F = 4sj (work = 4 from (1,0) up to (1,1)) 29 / = [x - 2y]^,o) = _1 31 / = I^lfo.o) = 1 

33 Not conservative; f*(t i — tj) ■ (dt i + tit j) = / 0 dt = 0; (t 2 i — tj) • (dt i + 2t tit j) = f 1 —t 2 dt = — | 

35 w = ax > = 2 * + 6 - 30 « = 2, ■ b is arbitrary 37 SM. = 2 ye- = f = -y 2 e~* 

39 a_M_ == ^ = M..f = r=V ^ T ^=\ xi + y j\ 

41 F = (x - y)i + (x + y)j has ^ = -1, ^ = 1, no / 43 2x;0;0 

Section 15.3 Green’s Theorem (page 571) 

1 Jq K (a cos t)a cos t dt — 7ra 2 ; N x — M y = 1, f f dx dy = area no 2 

3 J x dx -f x dx = 0, N x — M y = 0, f f 0 dx dy = 0 

5 J x 2 y dx = / 0 2 *(acos£) 2 (asin£)(-asin£ dt) = — — / Q 2,r (sin 2£) 2 d£ = — 

N x — M y = —x 2 , / f(—x 2 )dx dy = J 2 * J Q a — r 2 cos 2 9 r dr d9 — — ~~ 

7 f x dy — y dx = f*{ cos 2 £ + sin 2 £) dt = ir; f J (1 + l)dx dy — 2 (area) = tt; f x 2 dy — xy dx = | -f 1; 

/o /o x M x dy ~ I 

9 | J 2?r (3 cos 4 t sin 2 £ + 3 sin 4 t cos 2 £)d£ = | / Q 2,r 3 cos 2 £ sin 2 tdt = |- (see Answer 5) 

11 J F ■ dR == 0 around any loop; F = + jfj and f F • dR = [— sin £ cos £ + sin £ cos £]d£ = 0; 

^ = Hr 8 ives J / 0 til dy 

13 x = cos 2f, y = sin 2t, t from 0 to 2x; J^ n —2 sin 2 2f dt = — 2x = —2 (area); 

J Q 2,r —2dt = — 4x = —2 times Example 7 

15 J M dy — Ndx = J 2 * 2 sin t cos t dt — 0; f f (M x + N y )dx dy — J J 0 dx dy = 0 

17 M = f,N = Mdy-Ndx = cos 2 t+sin 2 t)tit = 2x; //(M x + jVj,)tix tiy = f f (^ - fr.+ ; ~ £)dx dy 

If \dx dy = f f dr dd = 2x 

19 f Mdy - JVtiz = / -z 2 y dx = f° -x 2 (l - x)tix = f* / 0 1_y x 2 tix dy = ± 

21 ff(M x + N y )dx dy — If div F dx dy — 0 between the circles 
23 Work: f a dx + b dy = f f (§~ ~ §^)dx dy; Flux: same integral 

25 g = tan -1 (^) = 9 is undefined at (0,0) 27 Test M y — N x : x 2 dx + y 2 dy is exact = d(|x 3 -f |y 3 ) 

29 div F = 2y — 2y = 0; g = xy 2 31 div F = 2x + 2y; no g 33 div F = 0; g = e x sin y 
35 div F = 0; g = 2 ^- 

37 N x - M y — -2x, -6xy, 0, 2x - 2 y, 0, — 2e x+y ; in 31 and 33 / = |(x 3 -f y 3 ) and f = e x cos y 
39 F = (3x 2 — 3y 2 )i — 6xy j;div F = 0 41 / = x 4 — 6x 2 y 2 + y 4 ; g = 4 x 3 y — 4xy 3 

43 F = c* cos y i — e x sin y j; g = e x sin y 

45 JV = /(re), / Mtix + Ntiy = f(l)dy + f° f(0)dy = /( 1) - /( 0); //(iV x - M y )tix tiy = 
f f §xd x dy — Jq §£dx (Fundamental Theorem of Calculus) 
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Section 15.4 Surface Integrals (page 581) 


1 N = -2xi - 2yj + k; dS = dx dy; f 2n f 2 Vi + 4 r 2 r dr dd = *(17 3 / 2 - 1) 

3 N = — i + j + k; dS = \/3 dx dy, area V^n 

6 N - + * <“ = 1 J « ' ! ° ~ r(2 " ^ 

7 N = — 7j + k; dS = 5\/2 dx dy, area 5-\/2A 

9 N = (y 2 — x 2 )i — 2xyj + k; dS = \/l + (y 2 - x 2 ) 2 + 4x 2 y 2 dx dy = \/l + (y 2 + x 2 ) 2 dx dy; 

/o * /o 1 VT+r^ r dr dO = -J 2 + "M^+v' 2 ) 

11 N = 2i + 2j + k; 45 = 3dx dy; 3 (area of triangle with 2x + 2y < 1) = | 

13 nay/d 2 + h? 15 f £ y xy(\/3 dx dy) = 

17 sin 2 <f > cos <f > sin 9 cos 0(sin^ d<f> d6) =0 19 A = i+j+2k; B = j+k; N = — i— j+k; dS = \/3 du dv 

21 A = — sin u(cos v i -f sin v j) + cos uk;B = -(3 + cos u) sin v i + (3 + cos u) cos v j; 

N = —(3 + cos u) (cos u cos v i + cos u sin v j -f- sin u k); dS = (3 + cos u)du dv 

23 ~ N §5 + p ) dx d y = // (~ 2 * 2 - *y 2 + *) dx d y = // ~ r2 ( r * de ) = - 8 * 

25 F • N = — xH-y + z==0on plane 

27 N = -i - j + k, F = (v + u) i - u j,//F • NdS = // -v du dv = 0 
29 f J dS = Jq* / Q 27r (3 + cos u)du dv = 12?r 2 31 Yes 33 No 

35 A = i + /' cos 0 j + f 9 sin0 k;B = — /sin0 j + / cos 0 k; N = ff'i — / cos 0 j — /sin0 k; dS = |N|d:r dd = 

/(xJvT+7 5 dx 40 


Section 15.5 The Divergence Theorem (page 588) 

ldivF=l JffdV=f 3 divF = 2x + 2y+2z,///divFdV = 0 5 div F = 3, // 3dP = | = \ 

7 F • N = p 2 , / / p _ 0 p 2 dS = 4jto 4 9 div F = 2 z, J 2 " J^ 2 2 p cos <j>[p 2 sin </> dp d<j> dd) = |7ro 4 

11 Jq Jo Jq (2x + l)dx dy dz = o 4 + a 3 ; — 2a 2 + 2a 2 + 0 + a 4 + 0 + a 3 

13 div F = f,fff fdV = 0;F • n = x, // xdS = 0 15 div F = 1; /// 1 dV = f; /// 1 dP = | 

17 div (p 1 ) = ^8= + e, . grad^ = fr ~ • gradp 

19 Two spheres, n radial out, n radial in, n = k on top, n = — k on bottom, n = / 1+a ^ on side; 

v* 3 +y 3 

n = -i, - j, — k, i -f 2j + 3k on 4 faces; n = k on top, n = + Jfj - k) on cone 

21 V = cylinder, // / div F dV = //(^ + ^ )dx dy (z integral = l); //F • n dS = 

/ Mdy — Ndx y z integral = 1 on side, F ■ n = 0 top and bottom; Green’s flux theorem. 

23 div F = = — 4 ttG; at the center; F = 2R inside, F = 2(^) 3 R outside 

25 div u r = |, $ = ^, // E • n dS = // IdS = 4 tt 27 F (div F = 0); F; T(F * n < 1); F 

29 Plane circle; top half of sphere; div F = 0 


Section 15.6 Stokes’ Theorem and the Curl of F (page 595) 

1 curl F = i + j + k 3 curl F = 0 5 curl F = 0 T / = ^(rc + y-|- z ) 2 

9 curl x m i — 0; x n j has zero curl if n = 0 11 curl F = 2yi; n = j on circle so // F • ndS = 0 

13 curl F = 2i + ?j, n = i, // curl F • ndS = // 2 dS = 2% 
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15 Both integrals equal / F -dR; Divergence Theorem, V = region between S and T, always div curl F = 0 
17 Always div curl F = 0 19 / = xz + y 21/ = e x ~ z 23 F = yk 

25 curl F = (0362 — <*263)1 + (ai&3 — «3&i)j + (<*261 - ai& 2 )k 27 curl F = 2wk; curl F ■ — = 2 cj /\/3 
29 F = x(a 3 z + a 2 y)i + y(<*ix + a 3 z ) j + z(a x x + a 2 y)k 

31 curl F = —2k, / f —2k • RdS — f 2 * J^ 2 —2 cos <^(sin <j> d<f> dd) = — 2tt; / y dx — x dy = 

J 2 *(— sin 2 1 — cos 2 t)dt = — 2tt 

33 curl F = 2a, 2 / /(aix + a 2 y + a 3 z)dS = 0 + 0 + 2a 3 J 2 * f^ 2 cos ^ sin <f> d<f> dd = 27ra 3 

35 curl P = -i, n = // F • ndS = r 2 

37 <7 = 2^ — ^ = stream function; zero divergence 

39 div F = div (V + W) = div Y so y = div V so V = ^-j (has zero curl). Then W = F — Y = xyi — 

41 curl (curl F) = curl (— 2yk) = — 2i; grad (div F) = grad 2x = 2i; F xx -f F yy + F zz = 4i 
43 curl E = — = a sin t so E = | (a x R) sin t 

45 n = j so fMdx + Pdz = ff{^f - £)dx dz 47 M y * = M y + M z f y + P y f x + P*/ y / x + P/ xy 

49 fFdR = // curl F • n dS;ff¥ndS = ///div F dY 
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Mdet [V 4 — A 
»*=[-’ 1 ] 


= A 2 - 5A = 0 if A = 0 or A = 5; A 



= 0 


2 

-1 



33 F if 6 ^ 0; T; T; F (e A * is not a vector); T 


Section 16.2 Differential Equations (page 610) 

1 3 Be 3 * — Be 3 * = 8e 3 * gives B = 4 : y = 4e 3 * 3 y = 3 — 2 t + t 2 5 At * + 4e 3 * = 7at£ = 0ifA=3 

7 Add y = Ae"* because y' + y = 0; choose A = — 1 so — e -t + 3 — 2t + £ 2 = 2 at £ = 0 

9 y = y = t; by l’Hopital lim * - = lim = t 

11 Substitute y = At 1 + Bte * + C cos t + 2? sin $ in equation: B = 1,C=|,B = — |, any A 

13 Particular solution y = Ate* + Be*; \f = Ate* + (A + B)e* = c(Aie* + Be*) + te* 
gives A = cA + 1, A + B = cB, A = y^, B = 

15 A 2 e A * -f 6Ae A * + 5e A * = 0 gives A 2 + 6A + 5 = 0, (A -f 5) (A + 1) = 0, A = — 1 or —5 
(both negative so decay); y = Ae"* + Be" 5 * 

17 (A 2 + 2A + 3)e A * = 0, A = — 1 ± y/^2 has imaginary part and negative real part; 
y = Ae^" 1 " 1- ^ 1 )* + Be^" 1 "'/ 2 *)*; y — Ce~ l cos y/2 1 + Be"* sin y/2 1 

19 d = 0 no damping; d = 1 underdamping; d = 2 critical damping; d = 3 overdamping 

21 A = — | ± v ^~ 4c is repeated when 6 2 = 4c and A = — |; (tA 2 + 2A)e A * + 6(tA + l)e A * + cte A * = 0 
when A 2 + 6A + c = 0 and 2A + b = 0 

23 —a cos t — b sin t — a sin t + 6 cos t + a cos t + 6 sin t = cos t if a = 0, 6 = 1, y = sin t 

25 y = A cos 3t + B cos 5i; y" + 9 y = — 25B cos 5t + 9B cos 5t = cos 5t gives B = =1; 
yo = 0 gives A = y£ 

27 y = A(cosa;t — cosa;o*)> y n = — Aa; 2 coswt + Aw 2 coswot, y ,; + w 2 y = coswt gives A(—u> 2 + w 2 ) = 1; 
breaks down when w 2 = u 2 

29 y = Be 5 *; 25B + 3B = 1, B = ^ 31 y = A + Bt = § + |t 

33 j/ ; — 25y = e 6 *; y n + y = sin t; y" = 1 + t; right side solves homogeneous equation so particular 
solution needs extra factor t 

35 e*,e"*,e**, t~ xt 37 y = e" 2 * + 2te" 2 *; y(27r) = (1 + 4?r)e" 4,r « 0 

39 y = (4e" r * - r 2 e" 4 */ r )/( 4 - r 2 ) — 1 as r -> 0 43 h < 2; fe < 2.8 


Section 16.3 Discrete Mathematics (page 615) 

1 Two then two then last one; go around hexagon 3 Six (each deletes one edge) 

5 Connected: there is a path between any two nodes; connecting each new node requires an edge 
13 Edge lengths 1,2,4 

15 No; 1,3,4 on left connect only to 2,3 on right; 1,3 on right connect only to 2 on left 17 4 
19 Yes 21 F (may loop); T 25 16 
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l/ u n dx — a “ n+1 j except for f ~ x f — All the integrals 1-17 involve u = ax + b 

2/*“"** except for / ^ = f - ^ and / ^ ^ 

3/^ te = Mf-2iu + 6 I ln|u|) 4/*i£ = £(„-2Mn|u|-£) 5 / = *(ln |tx| + £ - £) 

6/S = F ln l=l 2/ & = + p ln l*l 8/ = sb “ ik ln lil 9 / irSr = ~T^f + F ln lil 

10 /v/S<f*=£« 3 /’ u/rr^S ix = 2^a« J /« j 2 / x»^S <fc = MiS2!*irJ||h±!Sa u 3/2 

i3i^dx=2 y /u+ti^ z i 4 /^ = a^a v ^ ii)f = ' 1!, '.';,y >l> ’ l vi 
16/ * = (‘ > 0) ” $(*<<>) 17/ gdz= -£ + | / ^ 


be— ad 111 1 qz-f& I 


z dz 1 / b 


20/ Vx 2 ± a 2 dx = §\/x 2 ± a 2 ± ^ ln |x + \/x 2 ± a 2 | 


= E=sa(: ln l ai + 6 I - £ ln l CI + d l) 


2 1 / = ln I* + ±a2 l 


^±<il <*3. 

Z 


2 ±a 2 2 


= \/x 2 + a 2 - aln (“±yS±Z^ 


— |\/x 2 zb a 2 =p ~ ln lx + \/x 2 db a 2 


dx 


^x 2 — a 2 — a sec 1 - 

a 


/z 2 ±q 2 j \/z 2 ±q 2 


+ In lx + \/x 2 ± a 2 1 


26/ x Vx 2 ± a 2 dx=l (2x 2 ± a 2 )\/x 2 ± a 2 - £ ln |x + >/x 2 ± a 2 | 


nn f £i — -r vx*±a 

**•’ x 2 \Zx*±a* ' a 2 ® 


28/ (x 2 ± a 2 ) 3 / 2 dx — |(2x 2 ± 5a 2 ) V x 2 ± a 2 + ^ ln |x + \/x 2 ± a 2 | 29/ 

30 /Vo 2 - x 2 dx = f Va 2 - x 2 + ^ sin -1 | 31 / 7 ** 2 = sin -1 g 32/ 


dx 

(z 2 ±a 2 ) 3 / 


dbz 

a 2 \Zz 2 ±a 2 


#= = sin" 1 f 

i 2_ t 2 a 


dz 

— 


33/ x - dx = y/a 2 - x 2 - aln | ° +vf ^ — | 34/ i 2 \/a 2 - x 2 dx = |(2x 2 - a 2 )Va 2 - x 2 + y sin ** 


oe f dz — _ Va 2 -x 2 

~2+/^2 _~2 a 2 z 


^ 36f^dx = -^ - sin- 1 ! 37/^p = -iln|^^| 
v/a 2 - x 2 + ^sin -1 ^ 39/(o 2 - x 2 ) 3 / 2 dx = | (5a 2 - 2x 2 ) \/a 2 - x 2 + ^ sin - 1 ^ 

fe* >’<<’ 43/ I ^ = lta„(f + f) 

5/13 ‘a" _I \/fe Utl ¥ . j! > » ! 45/ n/fci = i ta » ¥ 

4-r, ln| s . tico...tyg=»l.ina i| j! <c i 47/ i_ c t ., = ~; cot f 


= -Mn 

,2_ r 2 a 


i 1 n 1 a+v a 2 — z 2 


■ f v a 2 — x 2 + sin” 1 - 


dz _ 

b+c sin ax aN 


in f dx _ -1 

6+csinqz ay /^2~^2 

A A f 2 

6 +CCO 8 qz a> / 52_ c 2 


6+ccos qz 


c+6 sin qz+y/c 2 — 6 2 cos qz l2 

6+c sin az | > 

“~‘k/fe tani ?l' * ! >‘ J 


1 I c+fr cos qz+v c 2 -6 2 sin ax I l2 ^ .2 

aZp m I 6+ccosa* l> 0 < c 


ax _ 1 tan ax 

l+cosoi ~ a lan 2 


dx _ _ 1 rr . t ax 

l-cos ax ~~ a LO '' 2 


48/ sin 1 ax dx = x sin 1 ax + ^Vl - a 2 x 2 49/ x n sin 1 ax dx = sin 1 ax - ^ 

50/ tan -1 ax dx= x tan -1 ax - ^ ln(l + a 2 x 2 ) 51/ x n tan -1 ax dx = tan -1 ax - ^ 

52/ e ai dx = ^ 53/ xe ai dx = ^(ax - 1) 54/ * 2 e ai dx = ^(a 2 x 2 - 2ax + 2) (6“ is e^ 1 " 6 ) 1 ) 

55/ ^ = !n I In ox | Not elementary: / e*’ dx, / e* ln x dx, / ^ dx, / ^ dx, / ^dx 



Exponentials and Logarithms 

y = b x *-*x = log b y y = e* <-► x = In y 
e = lim(l+±)“ = £~ Q i = 2 . 71828 - ■■ 

«* = lim(l + * ) n = E“=o m 

In y = !if In 1 = 0 lne = 1 

In xy = In x + In y In x n = n In x 

loga y = (log a b) (log b y) log a 6 = 1/ log b a 

gX+y _ e x e y b x = e xlnb gin y _ y 


Equations and Their Solutions 


y' = cy 
y' = cy + s 
y' = cy- by 2 

y" = -A 2 y 

my" + dj/ + ky = 0 
2/ n +i = ay n 
y n +i = ay n + s 


y 0 e 

y 0 e ct + f(e cl - 1) 

-C- j _ f-fyo. 
6+de’ ct ° “ j/o 

cos A£ and sin At 
e Alt and e Aat or te Xlt 
a n Vo 

«"yo + s^fr 


Vectors and Determinants 


Matrices and Inverses 


A = aii + a 2 j + 03k 

|A| 2 = A ♦ A = a\ + c&2 4- a 3 (length squared) 

A • B = ai&i + <1262 4- 0363 = |A||B|cos 0 
| A • B| < |A||B| (Schwarz inequality: | cos 6\ < 1) 
|A 4- B| < |A| 4- |B| (triangle inequality) 

|A x B| = |A||B||sin0| (cross product) 

i j k 1(&2^3 ~ ^3^2) 

AxB= ai a 2 a 3 = 4-j(a 3 6i - ai6 3 ) 

b± 62 & 3 4-k(ai& 2 — cL 2 bi ) 

Right hand rule i x j = k, j x k = i, kxi=j 
Parallelogram area = |ai& 2 — a 2 &i | = |Det| 
Triangle area = \\cLib 2 - a 2 6i| = ||Det| 

Box volume = |A • (B x C)| = (Determinant | 


Ax = combination of columns = b 
Solution x — A~~ l b if A~ 1 A = I 


Least squares A T Ax = A T b 
Ax — Ax (A is an eigenvalue) 


' a 6 ‘ 

-1 

1 

d -6 ' 

1 

ad— be 


c a 


— c a 


{AB)- 1 = B^A" 1 , (AB) t = B T A T 


‘ 

-1 

b x c 

a b c 

_ 1 
~ £> 

c x a 

a x b 


cl 1 cl 2 a 3 

b 1 62 ^3 

ci c 2 c 3 


4* <^ 162^3 4" Ct2^3 Cl 4- CL^b\C 2 
— aifc 3 C2 — Q-2^1 C3 — a 3 6 2 c 1 



SI 

Units 

Symbols 

From 

To 

Multiply by 

length 


meter 

m 

degrees 

radians 

.01745 

mass 


kilogram 

kg 

calories 

joules 

4.1868 

time 


second 

s 

BTU 

joules 

1055.1 

current 


ampere 

A 

foot-pounds 

joules 

1.3558 

frequency 


hertz 

Hz ~ 1/s 

feet 

meters 

.3048 

force 


newton 

N ~ kg-m/s 2 

miles 

km 

1.609 

pressure 


pascal 

Pa ~ N/m 2 

feet/sec 

km/hr 

1.0973 

energy, work 


joule 

J ~ N-m 

pounds 

kg 

.45359 

power 


watt 

W ~ J/s 

ounces 

kg 

.02835 

charge 


coulomb 

C ~ A-s 

gallons 

liters 

3.785 

temperature 


kelvin 

K 

horsepower 

watts 

745.7 

Speed of light 

c 

= 2.9979 xlO 8 m/s 

Radius at Equator R = 

: 6378 km = 3964 miles 

Gravity 

G 

= 6.6720 x 10" 11 Nm 2 /kg 2 

Acceleration 

9 = 

9.8067 m/s 2 = 32.174 ft/s 2 



Sums and Infinite Series 

l + x + ■ — h x n ~ 1 = 

1 + nx + + • • • + x n = (1 + s) n 

1 + 2 + b n = §n(n + 1) « ~ 

l 2 + 2 2 + . . . + n 2 = a Is +Ui22AU . * f 

1+| + b ^ « In n — + oo (harmonic) 

1— 2 + | — ••• == ln 2 (alternating) 

+ V' _I_ — il v J- — e! 

A 3 ~ 5 4 Z-r „•> 6 Z^ n < 90 

= 1 + x + z 2 + • • • (geometric: |x| < 1) 

= 1 + 2x + 3x 2 + - • • = ^(j^) 

= 1 — x + re 2 — • * • (geometric for —x) 

ln(l + *)=*-£ + £- ■■■ = / ^ 

sin x = x — x 3 /6 + x 5 / 120 — • • • (all x) 

cos x = 1 — x 2 /2 + x 4 /24 — * • • (all x) 

e at = l + x + |j‘ + t - < (e=l+l+^ + ' t ') 

e tx — cos x -Vi sin x (Euler’s formula) 

cosh x = |(e x + e“ a: ) = 1 + |y -f • * • 

sinhx = |(e x - e“ x ) = x + ~ H 

(cos 0 -H i sin $) n — cos nO -f- i sin n$ 

/(x) = /( 0) + f'(0)x + /"(0) ~y + •••• (Taylor) 

/ 1 2 -? y) = / "h x fx + yfy “I - |r /** x vfxy + • • • 

Polar and Spherical 


x = r cos 0 and y = r sin 0 


r = y/x 2 4- y 2 and tan 0 = y/x 
x -V iy = r(cos 0 + * sin 0) = re t0 
Area f ~r 2 d$ Length f \Jr 2 e T r 2 dS 
x = p sin <f> cos 5 , y = p sin <j> sin 0, z = p cos <f> 
Area dA = dx dy = r dr d6 = J du dv 
Volume r dr d0 dz = p 2 sin <f> dp d<j> d$ 

Stretching factor J = = u v 

d{u ' v) y u y„ 


Area - Volume - Length - Mass - Moment 

Circle 7rr 2 Ellipse nab Wedge of circle r 2 0/ 2 
Cylinder side 2nrh Volume n r 2 h Shell dV = 2nrh dr 
Sphere surface Anr 2 Volume |7rr 3 Shell dV = 47 rr 2 dr 
Cone or pyramid Volume i (base area) (height) 

Length of curve / ds = J y/l + (dy/dx) 2 dx 

Area between curves /(v(x) — w(x))dx 

Surface area of revolution f 2 nr ds(r = x or r = y) 

Volume of revolution: Slices / ny 2 dx Shells f 2nxh dx 
Area of surface z(x, y) : ff yjl + z 2 + z 2 dx dy 

Mass M = ffpdA Moment M y — f f px dA 
x = M y /M,y = M x /M Moment of Inertia I y = f f px 2 dA 

Work W = /* F{x)dx = V{b) - V (a) Force F = dV/dx 

Partial Derivatives of z — f(x,y) 

Tangent plane z - z 0 ~ (f£)(s - x 0 ) + (f£)(y - yo) 
Approximation A z « (|£)Ax + (|^-)A y 

Normal N = (/„ /„, -l) or (F X) F y , F z ) 

Gradient V/ = §£ i + gj 

Directional derivative: = V/ • u = f x u\ -h f y u 2 

Chain rule: f + 


Vector field F(x, y, 2) = M i + JV j + P k 


Work fF dR Flux / M dy - Ndx 
Divergence of F = V • F = ^ ^ + ir- 


Curl of F = V x F = 


i j k 

d/dx d/dy d/dz 

MNP 


Conservative F = V/ = gradient of / if curl F = O 
Green's Theorem fMdx-j-Ndy=ff (^ - ~d) dx dy 
Divergence Theorem // F • n dS = /// div F dV 
Stokes’ Theorem f F ■ dR = J J (curl F) • n dS 


An additional table of integrals is included just after the index. 



